A US radio journalist who lost his voice two years ago will soon return to the air, thanks to artificial intelligence. Jamie Dupree, 54, a political radio journalist with Cox Media Group, is unable to talk due to a rare neurological condition. A new voice was created for him by Scottish technology company CereProc. CereProc trained a neural network to predict how Mr Dupree would talk, using samples from his old voice recordings.
Typically, in order to create a voice for someone, the individual needs to read out a script for 30 hours in order to gather enough data. Then artificial intelligence is applied to either chop up words from the audio file and stick them back together on demand, or the technology is used to predict and imitate the person’s speech patterns.Both of these methods can cost tens of thousands of pounds, and take a month to produce just one voice. To speed up the process and make it more affordable, CereProc started developing its own neural networks in 2006. Today, its artificial intelligence system can generate a voice in just a few days for £500, once a user has recorded themselves reading the script on its website.
The neural networks, which contain between six to 10 layers each, work by slicing audio recordings of words down to phonetics. The artificial intelligence system slices each word read out by an individual into 100 tiny pieces, and does this with lots of common words until eventually it understands how basic phonetics work in that person’s voice and has an ordered sequence for all the pieces in each word. Courtesy: BBC health