In our previous papers, we have proposed join cost functions derived from spectral distances, which have good correlations with perceptual scores obtained for a range of concatenation discontinuities. To further validate their ability to predict concatenation discontinuities, we have chosen the best three spectral distances and evaluated them subjectively in a listening test. The unit sequences for synthesis stimuli are obtained from a state-of-the-art unit selection text-to-speech system: rVoice from Rhetorical Systems Ltd. In this paper, we report listeners' preferences for each of the three join cost functions
This project aims to contribute to current research on the quality of speech synthesis by conductin...
Our goal is to automatically learn a PERCEPTUALLY-optimal target cost function for a unit selection ...
In concatenative text-to-speech (TTS) synthesis systems unit selection aims to reduce the number of ...
In our previous papers, we have proposed join cost functions derived from spectral distances, which ...
In our previous papers, we have proposed join cost functions derived from spectral distances, which ...
In unit selection-based concatenative speech synthesis, join cost (also known as concatenation cost)...
Undoubtedly, state-of-the-art unit selection-based concatenative speech systems produce very high qu...
Undoubtedly, state-of-the-art unit selection-based concatenative speech systems produce very high qu...
In our previous papers, we have proposed join cost functions derived from spectral distances, which ...
The quality of unit selection based concatenative speech synthesis mainly depends on how well two su...
In unit selection based concatenative speech systems, join cost, which measures how well two units c...
In unit selection based concatenative speech systems, join cost, which measures how well two units c...
Unit selection synthesis predominates today, but is not yet of a quality to rival natural speech. U...
We introduce a new method for computing join cost in unit-selection speech synthesis which uses a li...
A significant problem with unit selection based speech synthesis is the listener perception of soun...
This project aims to contribute to current research on the quality of speech synthesis by conductin...
Our goal is to automatically learn a PERCEPTUALLY-optimal target cost function for a unit selection ...
In concatenative text-to-speech (TTS) synthesis systems unit selection aims to reduce the number of ...
In our previous papers, we have proposed join cost functions derived from spectral distances, which ...
In our previous papers, we have proposed join cost functions derived from spectral distances, which ...
In unit selection-based concatenative speech synthesis, join cost (also known as concatenation cost)...
Undoubtedly, state-of-the-art unit selection-based concatenative speech systems produce very high qu...
Undoubtedly, state-of-the-art unit selection-based concatenative speech systems produce very high qu...
In our previous papers, we have proposed join cost functions derived from spectral distances, which ...
The quality of unit selection based concatenative speech synthesis mainly depends on how well two su...
In unit selection based concatenative speech systems, join cost, which measures how well two units c...
In unit selection based concatenative speech systems, join cost, which measures how well two units c...
Unit selection synthesis predominates today, but is not yet of a quality to rival natural speech. U...
We introduce a new method for computing join cost in unit-selection speech synthesis which uses a li...
A significant problem with unit selection based speech synthesis is the listener perception of soun...
This project aims to contribute to current research on the quality of speech synthesis by conductin...
Our goal is to automatically learn a PERCEPTUALLY-optimal target cost function for a unit selection ...
In concatenative text-to-speech (TTS) synthesis systems unit selection aims to reduce the number of ...