This paper describes a speaker-adaptive HMM-based speech synthesis system. The new system, called ``HTS-2007,'' employs speaker adaptation (CSMAPLR+MAP), feature-space adaptive training, mixed-gender modeling, and full-covariance modeling using CSMAPLR transforms, in addition to several other techniques that have proved effective in our previous systems. Subjective evaluation results show that the new system generates significantly better quality synthetic speech than speaker-dependent approaches with realistic amounts of speech data, and that it bears comparison with speaker-dependent approaches even when large amounts of speech data are available. In addition, a comparison study with several speech synthesis techniques shows the new syste...
In conventional speech synthesis, large amounts of phonetically balanced speech data recorded in hig...
The European Community’s Seventh Framework Programme (FP7/2007-2013) under Grant agreement 213845 (t...
In conventional speech synthesis, large amounts of phonetically balanced speech data recorded in hig...
Abstract—This paper describes a speaker-adaptive HMM-based speech synthesis system. The new system, ...
This paper describes a speaker-independent/adaptive HMM-based speech synthesis system developed for ...
This paper describes an HMM-based speech synthesis system developed by the HTS working group for th...
For the 2008 Blizzard Challenge, we used the same speaker-adaptive approach to HMM-based speech synt...
As speech synthesis techniques become more advanced, we are able to consider building high-quality v...
In this paper we analyze the effects of several factors and configuration choices encountered during...
A statistical parametric approach to speech synthesis based on hidden Markov models (HMMs) has grown...
ICASSP2008: IEEE International Conference on Acoustics, Speech, and Signal Processing, March 30 - ...
For the 2009 Blizzard Challenge we have built an unsupervised version of the HTS-2008 speaker-adapti...
Statistical parametric, especially Hidden Markov Model-based, text-to-speech (TTS) synthesis has rec...
The synthesis of child speech presents challenges both in the collection of data and in the building...
A Text-to-speech (TTS) synthesis system is the artificial production of human system. This paper rev...
In conventional speech synthesis, large amounts of phonetically balanced speech data recorded in hig...
The European Community’s Seventh Framework Programme (FP7/2007-2013) under Grant agreement 213845 (t...
In conventional speech synthesis, large amounts of phonetically balanced speech data recorded in hig...
Abstract—This paper describes a speaker-adaptive HMM-based speech synthesis system. The new system, ...
This paper describes a speaker-independent/adaptive HMM-based speech synthesis system developed for ...
This paper describes an HMM-based speech synthesis system developed by the HTS working group for th...
For the 2008 Blizzard Challenge, we used the same speaker-adaptive approach to HMM-based speech synt...
As speech synthesis techniques become more advanced, we are able to consider building high-quality v...
In this paper we analyze the effects of several factors and configuration choices encountered during...
A statistical parametric approach to speech synthesis based on hidden Markov models (HMMs) has grown...
ICASSP2008: IEEE International Conference on Acoustics, Speech, and Signal Processing, March 30 - ...
For the 2009 Blizzard Challenge we have built an unsupervised version of the HTS-2008 speaker-adapti...
Statistical parametric, especially Hidden Markov Model-based, text-to-speech (TTS) synthesis has rec...
The synthesis of child speech presents challenges both in the collection of data and in the building...
A Text-to-speech (TTS) synthesis system is the artificial production of human system. This paper rev...
In conventional speech synthesis, large amounts of phonetically balanced speech data recorded in hig...
The European Community’s Seventh Framework Programme (FP7/2007-2013) under Grant agreement 213845 (t...
In conventional speech synthesis, large amounts of phonetically balanced speech data recorded in hig...