Keiichi Tokuda

Duration: 52 mins 12 secs
Share this media item:
Embed this media item:


About this item
Keiichi Tokuda's image
Description: Human-like singing and talking machines: flexible speech synthesis in karaoke, anime, smart phones, video games, digital signage, TV and radio programs, etc.
 
Created: 2015-02-11 12:34
Collection: Speech Group
Division F Talks
Publisher: University of Cambridge
Copyright: University of Cambridge
Language: eng (English)
Keywords: Speech synthesis;
 
Abstract: This talk will give an overview of statistical approach to flexible speech synthesis. For constructing human-like talking machines, speech synthesis systems are required to have an ability to generate speech with arbitrary speaker’s voice, various speaking styles in different languages, varying emphasis and focus, and/or emotional expressions. The main advantage of the statistical approach is that such flexibility can easily be realized using mathematically well-defined algorithms. In this talk, the system architecture is outlined and then recent results and demos will be presented.

Biography

Keiichi Tokuda is a Professor in the Department of Computer Science at Nagoya Institute of Technology and currently he is visiting Google as sabbatical. He is also an Honorary Professor at the University of Edinburgh. He was an Invited Researcher at the National Institute of Information and Communications Technology (NICT), formally known as the ATR Spoken Language Communication Research Laboratories, Kyoto, Japan from 2000 to 2013, and was a Visiting Researcher at Carnegie Mellon University from 2001 to 2002. He has been working on statistical parametric speech synthesis after he proposed an algorithm for speech parameter generation from HMM in 1995. He received six paper awards and two achievement awards. He is an IEEE Fellow and an ISCA Fellow.
Available Formats
Format Quality Bitrate Size
MPEG-4 Video 1280x720    2.98 Mbits/sec 1.14 GB View Download
MPEG-4 Video 640x360    1.93 Mbits/sec 758.52 MB View Download
WebM 1280x720    2.68 Mbits/sec 1.03 GB View Download
WebM 640x360    448.42 kbits/sec 171.50 MB View Download
iPod Video 480x270    520.41 kbits/sec 198.97 MB View Download
MP3 44100 Hz 249.76 kbits/sec 95.58 MB Listen Download
Auto * (Allows browser to choose a format it supports)