A complete inventoly of accented and unaccented diphones, produced by a professional male speaker, was made. In order to evaluate the quality of synthetic speech using both types of diphones, we asked subjects to judge the naturalness and fluency of different versions of an utterance. Two experiments with isolated polysyllabic words indicate that the use of accented or unaccented diphones has a perceptual effect on long vowels only. In a third experiment, we evaluated the use of the different diphone types in short sentences with a fixed temporal struc ture. It appears that the use of unaccented instead of accented diphones for unaccented syllables does not result systematically in more natural-sounding speech
The perceived quality of synthetic speech strongly depends on its prosodic naturalness. Departing fr...
The model of speech production generally used in speech synthesis is that of a source modified by a ...
In this paper, we present a comparative study of natural and synthetic speech samples which vary in ...
A complete inventoly of accented and unaccented diphones, produced by a professional male speaker, w...
In this paper, three experiments are reported that were run in order to assess the quality of Dutch ...
This paper focuses on the creation of word-lists for making diphone recordings for speech synthesis...
This dissertation presents the importance of diphones’ duration and f0 information in generating mor...
Speech production measurements have revealed many regularities in the time domain . The literature h...
110 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1981.The purpose of this work is t...
With limited training data, infrequent triphone models for speech recognition will not be observed i...
In spoken dialogue systems, in which humans interact with computers over the telephone, it is essent...
A new analysis-synthesis algorithm has been developed for high quality diphone speech synthesis, bas...
Synthetic speech is a commonly used form of computer-generated speech. Synthetic speech is different...
In an effort to select a speech representation for our next generation concatenative text-to-speech ...
Synthetic speech is a commonly used form of computer-generated speech. Synthetic speech is different...
The perceived quality of synthetic speech strongly depends on its prosodic naturalness. Departing fr...
The model of speech production generally used in speech synthesis is that of a source modified by a ...
In this paper, we present a comparative study of natural and synthetic speech samples which vary in ...
A complete inventoly of accented and unaccented diphones, produced by a professional male speaker, w...
In this paper, three experiments are reported that were run in order to assess the quality of Dutch ...
This paper focuses on the creation of word-lists for making diphone recordings for speech synthesis...
This dissertation presents the importance of diphones’ duration and f0 information in generating mor...
Speech production measurements have revealed many regularities in the time domain . The literature h...
110 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1981.The purpose of this work is t...
With limited training data, infrequent triphone models for speech recognition will not be observed i...
In spoken dialogue systems, in which humans interact with computers over the telephone, it is essent...
A new analysis-synthesis algorithm has been developed for high quality diphone speech synthesis, bas...
Synthetic speech is a commonly used form of computer-generated speech. Synthetic speech is different...
In an effort to select a speech representation for our next generation concatenative text-to-speech ...
Synthetic speech is a commonly used form of computer-generated speech. Synthetic speech is different...
The perceived quality of synthetic speech strongly depends on its prosodic naturalness. Departing fr...
The model of speech production generally used in speech synthesis is that of a source modified by a ...
In this paper, we present a comparative study of natural and synthetic speech samples which vary in ...