This paper presents a framework for custom-tailoring voice font in data-driven TTS systems. Three criteria for unit pruning, the prosodic outlier criterion, the importance criterion and the combination of the two, are proposed. The performance of voice fonts in different sizes which are pruned with the three criteria is evaluated by simulating speech synthesis over large amount of texts and estimating the naturalness with an objective measure at the same time. The result shows that the combined criterion performs the best among the three. The pre-estimated curve for naturalness vs. database size might be used as a reference for custom-tailoring voice font. The naturalness remains almost unchanged when 50 % of instances are pruned off with t...
This paper describes the first TTS evaluation campaign designed for Spanish. Seven research institut...
In numerous domains, the usage of synthetic speech is conditioned upon the ability of speech synthes...
The paper deals with the process of designing a phonetically and prosodically rich speech corpus for...
In unit selection based speech synthesizer, defining a good unit set is crucial to the speech qualit...
International audienceVoice corpus plays a crucial role in the quality of the synthetic speech gener...
International audienceTTS voice building generally relies on a script extracted from a big text corp...
Improving the naturalness of synthetic speech is an essential task in developing a text-to-speech (T...
Nowadays, with emerging new voice corpora, voice corpus reduction in expressive TTS becomes more imp...
International audienceText-to-Speech (TTS) systems rely on a grapheme-to-phoneme converter which is ...
In concatenative text-to-speech (TTS) synthesis systems unit selection aims to reduce the number of ...
This research aims to construct a high-quality Japanese TTS (Text-to-Speech) system that has high fl...
Current text-to-speech (TTS) systems are increasingly faced with mixed language tex-tual input. Most...
International audienceUnit selection speech synthesis systems generally rely on target and concatena...
Text to speech synthesis (TTS) is the production of artificial speech by a machine for the given tex...
This paper describes the first TTS evaluation campaign designed for Spanish. Seven research institut...
In numerous domains, the usage of synthetic speech is conditioned upon the ability of speech synthes...
The paper deals with the process of designing a phonetically and prosodically rich speech corpus for...
In unit selection based speech synthesizer, defining a good unit set is crucial to the speech qualit...
International audienceVoice corpus plays a crucial role in the quality of the synthetic speech gener...
International audienceTTS voice building generally relies on a script extracted from a big text corp...
Improving the naturalness of synthetic speech is an essential task in developing a text-to-speech (T...
Nowadays, with emerging new voice corpora, voice corpus reduction in expressive TTS becomes more imp...
International audienceText-to-Speech (TTS) systems rely on a grapheme-to-phoneme converter which is ...
In concatenative text-to-speech (TTS) synthesis systems unit selection aims to reduce the number of ...
This research aims to construct a high-quality Japanese TTS (Text-to-Speech) system that has high fl...
Current text-to-speech (TTS) systems are increasingly faced with mixed language tex-tual input. Most...
International audienceUnit selection speech synthesis systems generally rely on target and concatena...
Text to speech synthesis (TTS) is the production of artificial speech by a machine for the given tex...
This paper describes the first TTS evaluation campaign designed for Spanish. Seven research institut...
In numerous domains, the usage of synthetic speech is conditioned upon the ability of speech synthes...
The paper deals with the process of designing a phonetically and prosodically rich speech corpus for...