The speech parameter generation algorithm considering global variance (GV) for HMM-based speech synthesis proved to be effective against the over-smoothing problem. However, the correlation between dimensions of parameter vector is not sufficiently considered in the current GV model. For some parameters, e.g., Line Spectral Pairs (LSP), the difference of adjacent LSPs has strong influence on the spectral envelope. Considering this important feature, the paper proposes a GV modeling on the difference of adjacent LSPs, i.e., GV on frequency domain delta LSP. By improving the GV likelihood on frequency domain delta LSP, the over-smoothing effect of generated parameter trajectory is better alleviated than conventional one. The result of a perce...
Summarization: Hidden Markov models (HMMs) are becoming the dominant approach for text-to-speech syn...
Handling variable ambient noise is a challenging task for au-tomatic speech recognition (ASR) system...
This paper proposes the use of the Liljencrants-Fant model (LFmodel) to represent the glottal source...
The speech parameter generation algorithm considering global variance (GV) for HMM-based speech synt...
This paper describes a novel parameter generation algorithm for an HMM-based speech synthesis techni...
INTERSPEECH2005: the 9th European Conference on Speech Communication and technology, September 4-8, ...
Summarization: Linear Dynamical Models (LDMs) have been used in speech synthesis recently as an alte...
Speech parameter generation considering global variance (GV generation) is widely acknowledged to dr...
ICASSP2009: IEEE International Conference on Acoustics, Speech, and Signal Processing, April 19-24...
Abstract Speech synthesis has been applied in many kinds of practical applications. Currently, state...
A minimum generation error (MGE) criterion had been proposedto solve the issues related to maximum l...
This paper proposes a technique for constructing independent parameter tying structures of mean and ...
Abstract—This paper presents a parameter generation method for hidden Markov model (HMM)-based stati...
Although Hidden Markov Model based speech synthesis has been proved to have good performance, there ...
Oversmoothing of speech parameter trajectories is one of the causes for quality degradation of HMM-b...
Summarization: Hidden Markov models (HMMs) are becoming the dominant approach for text-to-speech syn...
Handling variable ambient noise is a challenging task for au-tomatic speech recognition (ASR) system...
This paper proposes the use of the Liljencrants-Fant model (LFmodel) to represent the glottal source...
The speech parameter generation algorithm considering global variance (GV) for HMM-based speech synt...
This paper describes a novel parameter generation algorithm for an HMM-based speech synthesis techni...
INTERSPEECH2005: the 9th European Conference on Speech Communication and technology, September 4-8, ...
Summarization: Linear Dynamical Models (LDMs) have been used in speech synthesis recently as an alte...
Speech parameter generation considering global variance (GV generation) is widely acknowledged to dr...
ICASSP2009: IEEE International Conference on Acoustics, Speech, and Signal Processing, April 19-24...
Abstract Speech synthesis has been applied in many kinds of practical applications. Currently, state...
A minimum generation error (MGE) criterion had been proposedto solve the issues related to maximum l...
This paper proposes a technique for constructing independent parameter tying structures of mean and ...
Abstract—This paper presents a parameter generation method for hidden Markov model (HMM)-based stati...
Although Hidden Markov Model based speech synthesis has been proved to have good performance, there ...
Oversmoothing of speech parameter trajectories is one of the causes for quality degradation of HMM-b...
Summarization: Hidden Markov models (HMMs) are becoming the dominant approach for text-to-speech syn...
Handling variable ambient noise is a challenging task for au-tomatic speech recognition (ASR) system...
This paper proposes the use of the Liljencrants-Fant model (LFmodel) to represent the glottal source...