Saturday, December 21, 2024
-5.2 C
New York

Robots Can Now Talk like Humans with Thanks to Google’s DeepMind Project

Google’s DeepMind researchers have been at it again creating technology that will help change the world as we know it and this time it comes in the form of WaveNet. WaveNet is a convolutional neural network that is even closer to mimicking both US English and Mandarin Chinese. The network can also switch between different voices and create unique musical fragments.





This type of text-to-speech system (TTS) is different to those currently in use today as is trained on raw audio waveform from multiple speakers.  It then uses the network to generate synthetic utterances and sends the sample back to the network to generate the next sample.  One of DeepMind’s researcher’s comments, “As well as yielding more natural-sounding speech, using raw waveforms means that WaveNet can model any kind of audio, including music.”

 

WaveNet also has the ability to learn different characteristics of various voices (female and male) including breathing and mouth gestures. The researcher’s state, “To make sure [WaveNet] knew which voice to use for any given utterance; we conditioned the network on the identity of the speaker.  Interestingly, we found that training on many speakers made it better at modeling a single speaker than training on that speaker alone, suggesting a form of transfer learning.”





One area that WaveNet mat struggle with is it will be limited to Google products, at least for the time being as the approach requires so much data and computing power.  The processing power needed is said to be around 16,000 samples per second to create realistic speech sounds.  But, WaveNet can still be used to remodel music audio and speech recognition and is something we will see much more of soon.

Related Links;


More News To Read

Hot this week

Brooklyn Defendants Charged in Rideshare Hacking Scheme: Jailbroken Phones Used to Exploit Uber

Brooklyn federal court has charged two defendants, Eliahou Paldiel...

Detecting Defects in Next-Generation Computer Chips: The Future of TMD-Based Semiconductors

As technology advances, the demand for smaller, more powerful...

Merging Galaxies in the Early Universe: The Birth of a Monster Galaxy

Astronomers have recently observed a fascinating event in the...

Topics

Brooklyn Defendants Charged in Rideshare Hacking Scheme: Jailbroken Phones Used to Exploit Uber

Brooklyn federal court has charged two defendants, Eliahou Paldiel...

Detecting Defects in Next-Generation Computer Chips: The Future of TMD-Based Semiconductors

As technology advances, the demand for smaller, more powerful...

Merging Galaxies in the Early Universe: The Birth of a Monster Galaxy

Astronomers have recently observed a fascinating event in the...

NASA’s Roman Space Telescope to Uncover Galactic Fossils and Dark Matter Mysteries

NASA’s Roman Space Telescope is set to transform our...

Black Myth: Wukong – A Game that Gamers Love Despite Media Backlash

In a gaming industry increasingly influenced by social agendas,...

Gravitational Waves Reveal a ‘Supercool’ Secret About the Big Bang

In 2023, physicists made a groundbreaking discovery that could...

Related Articles

Popular Categories

Send this to a friend