Google DeepMind researchers have been busy creating a new artificial intelligence called WaveNet that has the ability to mimic human speech like no other machine out there. The machine learns to mimic individual sound waves made by humans and is currently being tested in both US English and Mandarin Chinese, and so far, the results are pretty astonishing. Although it hasn’t yet achieved 100 percent perfection, it is pretty close.
Google DeepMind’s team are so confident in their system that they have published samples that you can listen to online to decide for yourself how human you think they sound. They have also applied WaveNet to speech recognition and the researchers commented, “We trained WaveNet with two loss teams, one to predict the next sample and one to classify the frame, the model generalized better than with a single loss and achieved 18.8 PER on the test set, which is to our knowledge the best score obtained from a model trained directly on raw audio on TIMIT.” So, look out the world, there may be talking, human-sounding robots coming sooner than you think!
More News To Read