This paper describes a trainable excitation approach to eliminate the unnaturalness of HMM-based speech synthesizers. During the waveform generation part, mixed excitation is constructed by state-dependent filtering of pulse trains and white noise se-quences. In the training part, filters and pulse trains are jointly optimized through a procedure which resembles analysis-by-synthesis speech coding algorithms, where likelihood maxi-mization of residual signals (derived from the same database which is used to train the HMM-based synthesizer) is pur-sued. Preliminary results show that the novel excitation model in question eliminates the unnaturalness of synthesized speech, being comparable in quality to the the best approaches thus far report...
Most HMM-based TTS systems use a hard voiced/unvoiced classification to produce a discontinuous F0 s...
In the present paper, a hidden-semi Markov model (HSMM) based speech synthesis system is proposed. I...
A major cause of degradation of speech quality in HMM-based speech synthesis is the use of a simple ...
INTERSPEECH2010: 11th Annual Conference of the International Speech Communication Association, Septe...
Abstract—The quality of speech generated from Hidden Markov Model (HMM)-based Speech Synthesis Syste...
HMM-based speech synthesis generally suffers from typical buzzi-ness due to over-simplified excitati...
This paper describes a novel technique for producing smooth speech parametric representation evoluti...
Blizzard Challenge 2009 Workshop, September 4, 2009, Edinburgh, UK.This paper describes the NICT s...
HMM-based speech synthesis offers a way to generate speech with different voice qualities. However, ...
Abstract Speech synthesis has been applied in many kinds of practical applications. Currently, state...
This paper proposes the use of the Liljencrants-Fant model (LFmodel) to represent the glottal source...
In source-filter models of speech production, the residual signal - what remains after passing the s...
Oversmoothing of speech parameter trajectories is one of the causes for quality degradation of HMM-b...
A statistical speech synthesis system based on the hidden Markov model (HMM) was recently proposed. ...
Although Hidden Markov Model based speech synthesis has been proved to have good performance, there ...
Most HMM-based TTS systems use a hard voiced/unvoiced classification to produce a discontinuous F0 s...
In the present paper, a hidden-semi Markov model (HSMM) based speech synthesis system is proposed. I...
A major cause of degradation of speech quality in HMM-based speech synthesis is the use of a simple ...
INTERSPEECH2010: 11th Annual Conference of the International Speech Communication Association, Septe...
Abstract—The quality of speech generated from Hidden Markov Model (HMM)-based Speech Synthesis Syste...
HMM-based speech synthesis generally suffers from typical buzzi-ness due to over-simplified excitati...
This paper describes a novel technique for producing smooth speech parametric representation evoluti...
Blizzard Challenge 2009 Workshop, September 4, 2009, Edinburgh, UK.This paper describes the NICT s...
HMM-based speech synthesis offers a way to generate speech with different voice qualities. However, ...
Abstract Speech synthesis has been applied in many kinds of practical applications. Currently, state...
This paper proposes the use of the Liljencrants-Fant model (LFmodel) to represent the glottal source...
In source-filter models of speech production, the residual signal - what remains after passing the s...
Oversmoothing of speech parameter trajectories is one of the causes for quality degradation of HMM-b...
A statistical speech synthesis system based on the hidden Markov model (HMM) was recently proposed. ...
Although Hidden Markov Model based speech synthesis has been proved to have good performance, there ...
Most HMM-based TTS systems use a hard voiced/unvoiced classification to produce a discontinuous F0 s...
In the present paper, a hidden-semi Markov model (HSMM) based speech synthesis system is proposed. I...
A major cause of degradation of speech quality in HMM-based speech synthesis is the use of a simple ...