Keiichi Tokuda
Duration: 52 mins 12 secs
Share this media item:
Embed this media item:
Embed this media item:
About this item
Description: | Human-like singing and talking machines: flexible speech synthesis in karaoke, anime, smart phones, video games, digital signage, TV and radio programs, etc. |
---|
Created: | 2015-02-11 12:34 |
---|---|
Collection: |
Speech Group
Division F Talks |
Publisher: | University of Cambridge |
Copyright: | University of Cambridge |
Language: | eng (English) |
Keywords: | Speech synthesis; |
Abstract: | This talk will give an overview of statistical approach to flexible speech synthesis. For constructing human-like talking machines, speech synthesis systems are required to have an ability to generate speech with arbitrary speaker’s voice, various speaking styles in different languages, varying emphasis and focus, and/or emotional expressions. The main advantage of the statistical approach is that such flexibility can easily be realized using mathematically well-defined algorithms. In this talk, the system architecture is outlined and then recent results and demos will be presented.
Biography Keiichi Tokuda is a Professor in the Department of Computer Science at Nagoya Institute of Technology and currently he is visiting Google as sabbatical. He is also an Honorary Professor at the University of Edinburgh. He was an Invited Researcher at the National Institute of Information and Communications Technology (NICT), formally known as the ATR Spoken Language Communication Research Laboratories, Kyoto, Japan from 2000 to 2013, and was a Visiting Researcher at Carnegie Mellon University from 2001 to 2002. He has been working on statistical parametric speech synthesis after he proposed an algorithm for speech parameter generation from HMM in 1995. He received six paper awards and two achievement awards. He is an IEEE Fellow and an ISCA Fellow. |
---|
Available Formats
Format | Quality | Bitrate | Size | |||
---|---|---|---|---|---|---|
MPEG-4 Video | 1280x720 | 2.98 Mbits/sec | 1.14 GB | View | Download | |
MPEG-4 Video | 640x360 | 1.93 Mbits/sec | 758.52 MB | View | Download | |
WebM | 1280x720 | 2.68 Mbits/sec | 1.03 GB | View | Download | |
WebM | 640x360 | 448.42 kbits/sec | 171.50 MB | View | Download | |
iPod Video | 480x270 | 520.41 kbits/sec | 198.97 MB | View | Download | |
MP3 | 44100 Hz | 249.76 kbits/sec | 95.58 MB | Listen | Download | |
Auto * | (Allows browser to choose a format it supports) |