This paper proposes the use of the Liljencrants-Fant model (LFmodel) to represent the glottal source signal in HMM-based speech synthesis systems. These systems generally use a pulse train to model the periodicity of the excitation signal of voiced speech. However, this model produces a strong and uniform harmonic structure throughout the spectrum of the excitation which makes the synthetic speech sound buzzy. The use of a mixed band excitation and phase manipulation reduces this effect but it can result in degradation of the speech quality if the noise component is not weighted carefully. In turn, the LFwaveform has a decaying spectrum at higher frequencies, which is more similar to the real glottal source excitation signal. We conducted a...
Abstract—The quality of speech generated from Hidden Markov Model (HMM)-based Speech Synthesis Syste...
This paper describes a trainable excitation approach to eliminate the unnaturalness of HMM-based spe...
The representation of the glottal source is of paramount impor-tance for describing para-linguistic ...
This paper proposes the use of the Liljencrants-Fant model (LF-model) to represent the glottal sourc...
Parametric speech synthesis has received increased attention in recent years following the developme...
A major cause of degradation of speech quality in HMM-based speech synthesis is the use of a simple ...
We have studied the analysis and synthesis of speech with a glottal-excited speech synthesizer. Inst...
This paper introduces a HMM-based speech synthesis system which uses a new method for the Separation...
HMM-based speech synthesis generally suffers from typical buzzi-ness due to over-simplified excitati...
Most of the degradation in current Statistical Parametric Speech Synthesis (SPSS) results from the f...
cote interne IRCAM: Degottex13aNone / NoneNational audienceIn current methods for voice transformati...
Recently, generative neural network models which operate directly on raw audio, such as WaveNet, hav...
cote interne IRCAM: Roebel12aNone / NoneNational audienceThe present article investigates into the u...
The quality of the vocoder plays a crucial role in the performance of parametric speech synthesis sy...
HMM-based speech synthesis offers a way to generate speech with different voice qualities. However, ...
Abstract—The quality of speech generated from Hidden Markov Model (HMM)-based Speech Synthesis Syste...
This paper describes a trainable excitation approach to eliminate the unnaturalness of HMM-based spe...
The representation of the glottal source is of paramount impor-tance for describing para-linguistic ...
This paper proposes the use of the Liljencrants-Fant model (LF-model) to represent the glottal sourc...
Parametric speech synthesis has received increased attention in recent years following the developme...
A major cause of degradation of speech quality in HMM-based speech synthesis is the use of a simple ...
We have studied the analysis and synthesis of speech with a glottal-excited speech synthesizer. Inst...
This paper introduces a HMM-based speech synthesis system which uses a new method for the Separation...
HMM-based speech synthesis generally suffers from typical buzzi-ness due to over-simplified excitati...
Most of the degradation in current Statistical Parametric Speech Synthesis (SPSS) results from the f...
cote interne IRCAM: Degottex13aNone / NoneNational audienceIn current methods for voice transformati...
Recently, generative neural network models which operate directly on raw audio, such as WaveNet, hav...
cote interne IRCAM: Roebel12aNone / NoneNational audienceThe present article investigates into the u...
The quality of the vocoder plays a crucial role in the performance of parametric speech synthesis sy...
HMM-based speech synthesis offers a way to generate speech with different voice qualities. However, ...
Abstract—The quality of speech generated from Hidden Markov Model (HMM)-based Speech Synthesis Syste...
This paper describes a trainable excitation approach to eliminate the unnaturalness of HMM-based spe...
The representation of the glottal source is of paramount impor-tance for describing para-linguistic ...