HE’S the man who taught the Scottish dialect to Alexa, the voice behind the Amazon Echo digital assistant, so that she could respond if you called out to her to turn on the big light, “gie’s a bit of Sydney Devine” or asked the address of nearest howff for a bevvy.
It involved many darkened rooms, subterfuge on an industrial scale in co-opting scores of unknowing participants, dozens of hard disks air-freighted to Silicon Valley and hundreds of thousands of pounds of expenditure. It was, in the argot, a pure, dead brilliant operation.
Back in the summer of 2105 Richard Thynne was looking for a well-paid temporary job. He has a post-graduate degree and was thinking about teaching but needed something to do over the summer before he went travelling. He approached the graduate recruitment company Freshminds in London.
“I was asked if I had three months on my hands, which I did,” he recalls. And so the cloak-and-dagger adventure began. “The person who interviewed me only knew it something to do with the technological aspects of data capture, not who the client was. That was only known to one person in the company.”
The next stage was a series of phone interrogations down the line from California, one from the person leading the project he now knows as Nehal Meshal, cryptically described by the company as a program manager.
It was only when he was selected and turned up again at Freshminds to join seven others, the final team, that he was told the client was Amazon. Not what they had been selected to do.
A week later he was in a boardroom in the company’s massive, 200,000-plus square foot glass and steel London headquarters in Holborn Viaduct. There he was told in detail about the Alexa project, but only after he had signed a non-disclosure agreement. He and his colleagues were to go round the country sampling regional accents so the digital assistant they planned to launch in the UK could understand diverse brogues and respond properly.
“No-one in Amazon in London knew what we were doing or why we were there. We operated from a blacked-out room. The US guys had hired a market research company here, Appen, but only one person in that company knew what was going on and he had been flown in from Australia, presumably so he couldn’t blab over beers to associates in London.”
It took them a week to be trained. “It was quite a complicated affair, with loads of equipment, 20 mics at different levels and heights in different parts of the room, with all the co-ordinates then fed into the computer to provide a 3D map of the room, along with photographs.” All of the data was then fired over to Silicon Valley.
It took a day to set up the equipment and volunteers, surrounded by a battery of microphones and speakers, were paid £40 to take part.
Voice recognition software works by analysing the airwaves you make when you speak, translating that into digital data. Samples of the waves are taken at frequent intervals – hundredths or thousandths of a second – the higher the sampling and precision rates the higher the quality. These samples of words are then compared with the same words already stored in digital templates and matched, modified, together with other elements of technological wizardry known only to the 1,000 and more developers who worked on Alexa.
Thynne and colleagues were paid £120 a day, plus £20 food expenses, and put up in good hotels. “This was no cheapskate, exploitative venture,” he says.
The first stop for Thynne and the team was Manchester. The bloke from Oz, Richard Ingram, was the trailblazer going on ahead to each of the designated cities. There he would book two flats or houses through Airbnb – the group had now been split into two teams of four – and line up participants for the study, who, of course, were not told why they were taking part.
“We had a spiel and if people asked what it was all about, I told them we were measuring how sound rebounded in rooms. It was a complete red herring,” says Thynne.
Those taking part had been asked to come with the titles of five books, five films and five favourite pieces of music in their heads. The sessions were in three parts.
“They had to read a series of prompts, press the button on the tablet when they were ready and speak them out. There were 40 or so, like ‘Alexa, find me the weather in Manchester’.”
But it wasn’t just Alexa, other names were sown to add confusion, like Siri and Google.
In the second phase the prompt left a blank at the end, such as “Alexa, find me a book by ...", with the person adding in their own favourite.
Finally each participant was asked to ask Alexa their own questions. To simulate a real household environment Thynne and the others would occasionally switch on a TV or a washing machine as the person was speaking. In all there were between 200 and 300 prompts and responses in each session by each one of the more than 1,000 people taking part, all if it recorded and shipped off to the States. “A mind-boggling amount of data,” says Thynne.
“Some people guessed what we were up to (Alexa had recently launched in the States) but the overwhelming number didn’t. We had a large box of hard disks and every two days we’d trot to the post office to send them to London,” he says, “and from there air-freighted to California.”
After Manchester it was Leeds for two weeks, followed by the fortnight in Glasgow, the same in Birmingham with the last stint back in London.
“In all of the places we didn’t just record local dialects, we also had some with Irish accents, Poles, others for whom English wasn’t the native language, a whole range.”
Clearly Amazon were pleased with the result. Alexa launched in the UK last September, with an English accent, and in a blizzard of TV and newspaper advertising. Thynne would like to make his own judgment, but despite all his hard work Amazon didn’t send him his own digital helpmate.
Why are you making commenting on The Herald only available to subscribers?
It should have been a safe space for informed debate, somewhere for readers to discuss issues around the biggest stories of the day, but all too often the below the line comments on most websites have become bogged down by off-topic discussions and abuse.
heraldscotland.com is tackling this problem by allowing only subscribers to comment.
We are doing this to improve the experience for our loyal readers and we believe it will reduce the ability of trolls and troublemakers, who occasionally find their way onto our site, to abuse our journalists and readers. We also hope it will help the comments section fulfil its promise as a part of Scotland's conversation with itself.
We are lucky at The Herald. We are read by an informed, educated readership who can add their knowledge and insights to our stories.
That is invaluable.
We are making the subscriber-only change to support our valued readers, who tell us they don't want the site cluttered up with irrelevant comments, untruths and abuse.
In the past, the journalist’s job was to collect and distribute information to the audience. Technology means that readers can shape a discussion. We look forward to hearing from you on heraldscotland.com
Comments & Moderation
Readers’ comments: You are personally liable for the content of any comments you upload to this website, so please act responsibly. We do not pre-moderate or monitor readers’ comments appearing on our websites, but we do post-moderate in response to complaints we receive or otherwise when a potential problem comes to our attention. You can make a complaint by using the ‘report this post’ link . We may then apply our discretion under the user terms to amend or delete comments.
Post moderation is undertaken full-time 9am-6pm on weekdays, and on a part-time basis outwith those hours.
Read the rules hereLast Updated:
Report this comment Cancel