In speech synthesis the unit inventory is decided using phonological and phonetic expertise. This process is resource intensive and potentially sub-optimal. In this paper we investigate how acoustic clustering, together with lexicon constraints, can be used to build a self-organised inventory. Six English speech synthesis systems were built using two frameworks, unit selection and parametric HTS for three inventory conditions: 1) a traditional phone set, 2) a system using orthographic units, and 3) a self-organised inventory. A listening test showed a strong preference for the classic system, and for the orthographic system over the self-organised system. Results also varied by letter to sound complexity and database coverage. This suggests...
The purpose of this research is to determine the best method for deciding on an optimal set of conca...
The paper focuses on the unit selection approach to speech synthesis, discussing drawbacks mainly re...
This paper presents a method for selecting speech units for polyphone concatenative speech synthesis...
In speech synthesis the inventory of units is decided by inspection and on the basis of phonological...
This paper describes a new method for synthesizing speech by concatenating sub-word units from a dat...
We describe a concatenative speech synthesiser for British English which uses the HADIFIX [8] invent...
While developing lexical resources for a particular language variety (Viennese), we experimented wit...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
One approach to the generation of natural-sounding synthesized speech waveforms is to select and con...
This paper presents a new technique for speech synthesis by unit selection. The technique works by ...
Initial attempts at performing text-to-speech conversion based on standard orthographic units are pr...
We describe a concatenative speech synthesiser for British English which uses the HADIFIX inventory ...
Unit-selection speech-synthesis systems have been proposed. In most of the unit-selection speech-syn...
In this paper, a novel method for the selection of synthesis unit is proposed. The monosyllables are...
Unit selection synthesis predominates today, but is not yet of a quality to rival natural speech. U...
The purpose of this research is to determine the best method for deciding on an optimal set of conca...
The paper focuses on the unit selection approach to speech synthesis, discussing drawbacks mainly re...
This paper presents a method for selecting speech units for polyphone concatenative speech synthesis...
In speech synthesis the inventory of units is decided by inspection and on the basis of phonological...
This paper describes a new method for synthesizing speech by concatenating sub-word units from a dat...
We describe a concatenative speech synthesiser for British English which uses the HADIFIX [8] invent...
While developing lexical resources for a particular language variety (Viennese), we experimented wit...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
One approach to the generation of natural-sounding synthesized speech waveforms is to select and con...
This paper presents a new technique for speech synthesis by unit selection. The technique works by ...
Initial attempts at performing text-to-speech conversion based on standard orthographic units are pr...
We describe a concatenative speech synthesiser for British English which uses the HADIFIX inventory ...
Unit-selection speech-synthesis systems have been proposed. In most of the unit-selection speech-syn...
In this paper, a novel method for the selection of synthesis unit is proposed. The monosyllables are...
Unit selection synthesis predominates today, but is not yet of a quality to rival natural speech. U...
The purpose of this research is to determine the best method for deciding on an optimal set of conca...
The paper focuses on the unit selection approach to speech synthesis, discussing drawbacks mainly re...
This paper presents a method for selecting speech units for polyphone concatenative speech synthesis...