We describe a corpus-based approach to improving synthesized speech quality and present two useful cost functions for unit selection. One is pitch-synchronous cross correlation for concatenation costs to reduce the noise caused by phase mismatch at concatenation points. The other is a discontinuous cost function for internal and concatenation costs to eliminate unnecessary cost calculation. An evaluation showed that incorporating pitch-synchronous cross correlation cost was better than using a conventional cost function. In addition, an opinion test to assess the naturalness of the synthesized speech indicated that the proposed method was 0.7 points better on a seven-point MOS(Mean of Opinion Score) than the conventional system. This paper ...
This paper presents a method for selecting speech units for polyphone concatenative speech synthesis...
In unit selection-based concatenative speech synthesis, join cost (also known as concatenation cost)...
Undoubtedly, state-of-the-art unit selection-based concatenative speech systems produce very high qu...
In concatenative text-to-speech (TTS) synthesis systems unit selection aims to reduce the number of ...
This research aims to construct a high-quality Japanese TTS (Text-to-Speech) system that has high fl...
ICASSP2002: IEEE International Conference on Acoustics, Speech and Signal Processing, May 13-17, 2...
For concatenative speech synthesis based on non-uniform unit selection, the key to improve the synth...
In this paper, a novel method for the selection of synthesis unit is proposed. The monosyllables are...
The objective of this project is to develop a Chinese Text-to-Speech technique using concatenative s...
Synthetic speech has been developed steadily for the past few decades. The objective of the project...
One approach to the generation of natural-sounding synthesized speech waveforms is to select and con...
Our reasearch goal is to construct a Japanese TTS (Text-to-Speech) system that can output various ki...
ICSLP2002: the 7th International Conference on Spoken Language Processing , September 16-20, 2002, ...
The primary objective of this paper is to provide an overview of existing Concatenative Text-To-Spee...
Undoubtedly, state-of-the-art unit selection-based concatenative speech systems produce very high qu...
This paper presents a method for selecting speech units for polyphone concatenative speech synthesis...
In unit selection-based concatenative speech synthesis, join cost (also known as concatenation cost)...
Undoubtedly, state-of-the-art unit selection-based concatenative speech systems produce very high qu...
In concatenative text-to-speech (TTS) synthesis systems unit selection aims to reduce the number of ...
This research aims to construct a high-quality Japanese TTS (Text-to-Speech) system that has high fl...
ICASSP2002: IEEE International Conference on Acoustics, Speech and Signal Processing, May 13-17, 2...
For concatenative speech synthesis based on non-uniform unit selection, the key to improve the synth...
In this paper, a novel method for the selection of synthesis unit is proposed. The monosyllables are...
The objective of this project is to develop a Chinese Text-to-Speech technique using concatenative s...
Synthetic speech has been developed steadily for the past few decades. The objective of the project...
One approach to the generation of natural-sounding synthesized speech waveforms is to select and con...
Our reasearch goal is to construct a Japanese TTS (Text-to-Speech) system that can output various ki...
ICSLP2002: the 7th International Conference on Spoken Language Processing , September 16-20, 2002, ...
The primary objective of this paper is to provide an overview of existing Concatenative Text-To-Spee...
Undoubtedly, state-of-the-art unit selection-based concatenative speech systems produce very high qu...
This paper presents a method for selecting speech units for polyphone concatenative speech synthesis...
In unit selection-based concatenative speech synthesis, join cost (also known as concatenation cost)...
Undoubtedly, state-of-the-art unit selection-based concatenative speech systems produce very high qu...