HMM-based speech synthesis offers a way to generate speech with different voice qualities. However, sometimes databases contain certain inherent voice qualities that need to be parametrized properly. One example of this is vocal fry typically occurring at the end of utterances. A popular mixed excitation vocoder forHMM-based speech synthesis is STRAIGHT. The standard STRAIGHT is optimized for modal voices and may not produce high quality with other voice types. Fortunately, due to the flexibility of STRAIGHT, different F0 and aperiodicity measures can be used in the synthesis without any inherent degradations in speech quality. We have replaced the STRAIGHT excitation with a representation based on a robust F0 measure and a carefully determ...
This paper describes a novel parameter generation algorithm for an HMM-based speech synthesis techni...
The Multi-Space Probability Distribution Hidden Markov model (MSD-HMM) is a discrete model that lear...
INTERSPEECH2005: the 9th European Conference on Speech Communication and technology, September 4-8, ...
Feature-based vocoders, e.g., STRAIGHT, offer a way to manipulate the perceived characteristics of t...
Fundamental frequency, or F0 is critical for high quality speech synthesis in HMM based speech synth...
HMM-based synthesized voices are intelligible but not natural especially in limited data condition b...
A uniform phase representation for the harmonic model in speech synthesis applications Gilles Degott...
This paper describes a trainable excitation approach to eliminate the unnaturalness of HMM-based spe...
Although Hidden Markov Model based speech synthesis has been proved to have good performance, there ...
Abstract—The quality of speech generated from Hidden Markov Model (HMM)-based Speech Synthesis Syste...
Parametric speech synthesis has received increased attention in recent years following the developme...
This paper proposes the use of the Liljencrants-Fant model (LFmodel) to represent the glottal source...
Abstract Speech synthesis has been applied in many kinds of practical applications. Currently, state...
HMM-based speech synthesis generally suffers from typical buzzi-ness due to over-simplified excitati...
A major cause of degradation of speech quality in HMM-based speech synthesis is the use of a simple ...
This paper describes a novel parameter generation algorithm for an HMM-based speech synthesis techni...
The Multi-Space Probability Distribution Hidden Markov model (MSD-HMM) is a discrete model that lear...
INTERSPEECH2005: the 9th European Conference on Speech Communication and technology, September 4-8, ...
Feature-based vocoders, e.g., STRAIGHT, offer a way to manipulate the perceived characteristics of t...
Fundamental frequency, or F0 is critical for high quality speech synthesis in HMM based speech synth...
HMM-based synthesized voices are intelligible but not natural especially in limited data condition b...
A uniform phase representation for the harmonic model in speech synthesis applications Gilles Degott...
This paper describes a trainable excitation approach to eliminate the unnaturalness of HMM-based spe...
Although Hidden Markov Model based speech synthesis has been proved to have good performance, there ...
Abstract—The quality of speech generated from Hidden Markov Model (HMM)-based Speech Synthesis Syste...
Parametric speech synthesis has received increased attention in recent years following the developme...
This paper proposes the use of the Liljencrants-Fant model (LFmodel) to represent the glottal source...
Abstract Speech synthesis has been applied in many kinds of practical applications. Currently, state...
HMM-based speech synthesis generally suffers from typical buzzi-ness due to over-simplified excitati...
A major cause of degradation of speech quality in HMM-based speech synthesis is the use of a simple ...
This paper describes a novel parameter generation algorithm for an HMM-based speech synthesis techni...
The Multi-Space Probability Distribution Hidden Markov model (MSD-HMM) is a discrete model that lear...
INTERSPEECH2005: the 9th European Conference on Speech Communication and technology, September 4-8, ...