This chapter describes how a very large corpus of conversational speech is being tested as a source of units for concatenative speech synthesis. It shows that the challenge no longer lies in phone-sized unit selection, but in categorising larger units for their affective and pragmatic effect. The work is by nature exploratory, but much progress has been achieved and we now have the beginnings of an understanding of the types of grammar and the ontology of vocal productions that will be required for the interactive synthesis of conversational speech. The chapter describes the processes involved and explains some of the features selected for optimal expressive speech rendering
This paper deals with the design of a speech corpus for a concatenation-based text-to-speech (TTS) s...
Traditional and commercial speech synthesizers are incapable of synthesizing speech with proper emot...
Stöber K, Portele T, Wagner P, Hess W. Synthesis by Word Concatenation. In: Proceedings of Interspe...
One approach to the generation of natural-sounding synthesized speech waveforms is to select and con...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Conventional synthetic voices can synthesise neutral read aloud speech well. But, to make synthetic ...
The goal of this work was to develop a speech synthesis system which concatenates variable-length un...
Spontaneous conversational speech has many characteristics that are currently not well modelled in u...
Unit selection speech synthesis has reached high levels of naturalness and intelligibility for neutr...
Speech is the means of communication in the vocal form, used to express one’s emotions, thoughts and...
When creating voices for concatenative speech synthesis, several hours of speech uttered by a profes...
The purpose of this research is to determine the best method for deciding on an optimal set of conca...
In the past few decades, technology has advanced and made human lifestyles more convenient. One exam...
Concatenating units of natural speech is one method of speech synthesis1. Most such systems use an i...
There have been many efforts to improve the quality of speech synthesis systems in conversational AI...
This paper deals with the design of a speech corpus for a concatenation-based text-to-speech (TTS) s...
Traditional and commercial speech synthesizers are incapable of synthesizing speech with proper emot...
Stöber K, Portele T, Wagner P, Hess W. Synthesis by Word Concatenation. In: Proceedings of Interspe...
One approach to the generation of natural-sounding synthesized speech waveforms is to select and con...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Conventional synthetic voices can synthesise neutral read aloud speech well. But, to make synthetic ...
The goal of this work was to develop a speech synthesis system which concatenates variable-length un...
Spontaneous conversational speech has many characteristics that are currently not well modelled in u...
Unit selection speech synthesis has reached high levels of naturalness and intelligibility for neutr...
Speech is the means of communication in the vocal form, used to express one’s emotions, thoughts and...
When creating voices for concatenative speech synthesis, several hours of speech uttered by a profes...
The purpose of this research is to determine the best method for deciding on an optimal set of conca...
In the past few decades, technology has advanced and made human lifestyles more convenient. One exam...
Concatenating units of natural speech is one method of speech synthesis1. Most such systems use an i...
There have been many efforts to improve the quality of speech synthesis systems in conversational AI...
This paper deals with the design of a speech corpus for a concatenation-based text-to-speech (TTS) s...
Traditional and commercial speech synthesizers are incapable of synthesizing speech with proper emot...
Stöber K, Portele T, Wagner P, Hess W. Synthesis by Word Concatenation. In: Proceedings of Interspe...