Spontaneous conversational speech has many characteristics that are currently not well modelled in unit selection and HMM-based speech synthesis. But in order to build synthetic voices more suitable for interaction we need data that exhibits more conversational characteristics than the generally used read aloud sentences. In this paper we will show how carefully selected utterances from a spontaneous conversation was instrumental for building an HMM-based synthetic voices with more natural sounding conversational characteristics than a voice based on carefully read aloud sentences. We also investigated a style blending technique as a solution to the inherent problem of phonetic coverage in spontaneous speech data. But the lack of an appropr...
This article reports the results of two experiments in which factors such as duration, amplitude an...
While text-to-speech has long been centered on the production of an intelligible message of good qua...
Speech technology can facilitate human-machine interaction and create new communi-cation interfaces....
Conventional synthetic voices can synthesise neutral read aloud speech well. But, to make synthetic ...
Unit selection speech synthesis has reached high levels of naturalness and intelligibility for neutr...
This chapter describes how a very large corpus of conversational speech is being tested as a source ...
As speech synthesis techniques become more advanced, we are able to consider building high-quality v...
The ability to use the recorded audio of a subject's voice to produce an open-domain synthesis syste...
This review gives a general overview of techniques used in statistical parametric speech synthesis. ...
The synthesis of child speech presents challenges both in the collection of data and in the building...
At the time of beginning this thesis, statistical parametric speech synthesis (SPSS) using hidden M...
We analyse the contribution of higher-level elements of the linguistic specification of a data-drive...
Phenomena like filled pauses, laughter, breathing, hesitation, etc. play significant role in everyda...
In speaker-adaptive HMM-based speech synthesis, there are typically a few speakers for which the out...
A statistical parametric approach to speech synthesis based on hidden Markov models (HMMs) has grown...
This article reports the results of two experiments in which factors such as duration, amplitude an...
While text-to-speech has long been centered on the production of an intelligible message of good qua...
Speech technology can facilitate human-machine interaction and create new communi-cation interfaces....
Conventional synthetic voices can synthesise neutral read aloud speech well. But, to make synthetic ...
Unit selection speech synthesis has reached high levels of naturalness and intelligibility for neutr...
This chapter describes how a very large corpus of conversational speech is being tested as a source ...
As speech synthesis techniques become more advanced, we are able to consider building high-quality v...
The ability to use the recorded audio of a subject's voice to produce an open-domain synthesis syste...
This review gives a general overview of techniques used in statistical parametric speech synthesis. ...
The synthesis of child speech presents challenges both in the collection of data and in the building...
At the time of beginning this thesis, statistical parametric speech synthesis (SPSS) using hidden M...
We analyse the contribution of higher-level elements of the linguistic specification of a data-drive...
Phenomena like filled pauses, laughter, breathing, hesitation, etc. play significant role in everyda...
In speaker-adaptive HMM-based speech synthesis, there are typically a few speakers for which the out...
A statistical parametric approach to speech synthesis based on hidden Markov models (HMMs) has grown...
This article reports the results of two experiments in which factors such as duration, amplitude an...
While text-to-speech has long been centered on the production of an intelligible message of good qua...
Speech technology can facilitate human-machine interaction and create new communi-cation interfaces....