HMM-based speech synthesis generally suffers from typical buzzi-ness due to over-simplified excitation modeling of voiced speech. In order to alleviate this effect, several studies have proposed various new excitation models. No consensus has however been reached on what is the perceptual importance of the accurate modeling of the periodic and aperiodic components of voiced speech, and to what extent they separately contribute in improving naturalness. This pa-per considers a generalized mixed excitation modeling, common to various existing approaches, in which both periodic and aperiodic components coexist. At least three main factors may alter the quality of synthesis: periodic waveform, noise spectral weighting, and noise time envelope. ...
Parametric speech synthesis has received increased attention in recent years following the developme...
Fundamental frequency, or F0 is critical for high quality speech synthesis in HMM based speech synth...
Summarization: Hidden Markov models (HMMs) are becoming the dominant approach for text-to-speech syn...
Abstract—The quality of speech generated from Hidden Markov Model (HMM)-based Speech Synthesis Syste...
This paper describes a trainable excitation approach to eliminate the unnaturalness of HMM-based spe...
This paper proposes the use of the Liljencrants-Fant model (LFmodel) to represent the glottal source...
HMM-based speech synthesis offers a way to generate speech with different voice qualities. However, ...
This paper proposes the use of the Liljencrants-Fant model (LF-model) to represent the glottal sourc...
Decomposition of speech into periodic and aperiodic components is useful in analyzing and describing...
International audienceThe speech signal may be considered as the output of a time-varying vocal trac...
In hidden Markov model-based speech synthesis, speech is typically parameterized using source-filter...
The excitation for LPC speech synthesis usually consists of two separate signals- a delta-function p...
Most HMM-based TTS systems use a hard voiced/unvoiced classification to produce a discontinuous F0 s...
A major cause of degradation of speech quality in HMM-based speech synthesis is the use of a simple ...
International audienceThis paper assesses the ability of a HMM-based speech synthesis systems to mod...
Parametric speech synthesis has received increased attention in recent years following the developme...
Fundamental frequency, or F0 is critical for high quality speech synthesis in HMM based speech synth...
Summarization: Hidden Markov models (HMMs) are becoming the dominant approach for text-to-speech syn...
Abstract—The quality of speech generated from Hidden Markov Model (HMM)-based Speech Synthesis Syste...
This paper describes a trainable excitation approach to eliminate the unnaturalness of HMM-based spe...
This paper proposes the use of the Liljencrants-Fant model (LFmodel) to represent the glottal source...
HMM-based speech synthesis offers a way to generate speech with different voice qualities. However, ...
This paper proposes the use of the Liljencrants-Fant model (LF-model) to represent the glottal sourc...
Decomposition of speech into periodic and aperiodic components is useful in analyzing and describing...
International audienceThe speech signal may be considered as the output of a time-varying vocal trac...
In hidden Markov model-based speech synthesis, speech is typically parameterized using source-filter...
The excitation for LPC speech synthesis usually consists of two separate signals- a delta-function p...
Most HMM-based TTS systems use a hard voiced/unvoiced classification to produce a discontinuous F0 s...
A major cause of degradation of speech quality in HMM-based speech synthesis is the use of a simple ...
International audienceThis paper assesses the ability of a HMM-based speech synthesis systems to mod...
Parametric speech synthesis has received increased attention in recent years following the developme...
Fundamental frequency, or F0 is critical for high quality speech synthesis in HMM based speech synth...
Summarization: Hidden Markov models (HMMs) are becoming the dominant approach for text-to-speech syn...