2016-09-09 @ 18.21.52 In English

via Kaj Sotala

> This post presents WaveNet, a deep generative model of raw audio waveforms. We show that WaveNets are able to generate speech which mimics any human voice and which sounds more natural than the best existing Text-to-Speech systems, reducing the gap with human performance by over 50%.

> We also demonstrate that the same network can be used to synthesize other audio signals such as music, and present some striking samples of automatically generated piano pieces.

2016-08-09 @ 17.49.05 In English

(From 2012)

His arguments get flimsier and flimsier towards the end (because he wants to justify his belief), but it's an entertaining performance nevertheless.