Speech parameter generation considering global variance (GV generation) is widely acknowledged to dramatically improve the quality of synthetic speech generated by HMM-based systems. However it is slower and has higher latency than the standard speech parameter generation algorithm. In addition it is known to produce artifacts, though existing approaches to prevent artifacts are effective. We present a simple new theoretical analysis of speech parameter generation considering global variance based on Lagrange multipliers. This analysis sheds light on one source of artifacts and suggests a way to reduce their occurrence. It also suggests an approximation to exact GV generation that allows fast, low latency synthesis. In a subjective evaluati...
HMM-based speech synthesis offers a way to generate speech with different voice qualities. However, ...
Parameter trajectory generation for HMM-based speech synthesis is practically achieved using only fi...
This paper proposes the use of the Liljencrants-Fant model (LFmodel) to represent the glottal source...
INTERSPEECH2005: the 9th European Conference on Speech Communication and technology, September 4-8, ...
This paper describes a novel parameter generation algorithm for an HMM-based speech synthesis techni...
Summarization: Linear Dynamical Models (LDMs) have been used in speech synthesis recently as an alte...
The speech parameter generation algorithm considering global variance (GV) for HMM-based speech synt...
The speech parameter generation algorithm considering global variance (GV) for HMM-based speech synt...
Abstract Speech synthesis has been applied in many kinds of practical applications. Currently, state...
Fast, low-artifact speech synthesis considering global variance Citation for published version: Shan...
Although Hidden Markov Model based speech synthesis has been proved to have good performance, there ...
This thesis proposes a new probabilistic model and speech parameter generation method for statistica...
Abstract—This paper presents a parameter generation method for hidden Markov model (HMM)-based stati...
ICASSP2009: IEEE International Conference on Acoustics, Speech, and Signal Processing, April 19-24...
Parametric speech synthesis has received increased attention in recent years following the developme...
HMM-based speech synthesis offers a way to generate speech with different voice qualities. However, ...
Parameter trajectory generation for HMM-based speech synthesis is practically achieved using only fi...
This paper proposes the use of the Liljencrants-Fant model (LFmodel) to represent the glottal source...
INTERSPEECH2005: the 9th European Conference on Speech Communication and technology, September 4-8, ...
This paper describes a novel parameter generation algorithm for an HMM-based speech synthesis techni...
Summarization: Linear Dynamical Models (LDMs) have been used in speech synthesis recently as an alte...
The speech parameter generation algorithm considering global variance (GV) for HMM-based speech synt...
The speech parameter generation algorithm considering global variance (GV) for HMM-based speech synt...
Abstract Speech synthesis has been applied in many kinds of practical applications. Currently, state...
Fast, low-artifact speech synthesis considering global variance Citation for published version: Shan...
Although Hidden Markov Model based speech synthesis has been proved to have good performance, there ...
This thesis proposes a new probabilistic model and speech parameter generation method for statistica...
Abstract—This paper presents a parameter generation method for hidden Markov model (HMM)-based stati...
ICASSP2009: IEEE International Conference on Acoustics, Speech, and Signal Processing, April 19-24...
Parametric speech synthesis has received increased attention in recent years following the developme...
HMM-based speech synthesis offers a way to generate speech with different voice qualities. However, ...
Parameter trajectory generation for HMM-based speech synthesis is practically achieved using only fi...
This paper proposes the use of the Liljencrants-Fant model (LFmodel) to represent the glottal source...