Nowadays, especially with the upswing of neural networks, speech synthesis is almost totally data driven. The goal of this thesis is to provide methods for automatic and unsupervised learning from data for expressive speech synthesis. In comparison to "ordinary" synthesis systems, it is more difficult to find reliable expressive training data, despite huge availability on sources like Internet. The main difficulty consists in the highly speaker- and situation-dependent nature of expressiveness, causing many and acoustically substantial variations. The consequences are, first, it is very difficult to define labels which reliably identify expressive speech with all nuances. The typical definition of 6 basic emotions, or alike, is a simplifica...
Speech is the fundamental mode of human communication, and its synthesis has long been a core priori...
Most speech synthesis systems require a linguistic module to produce the features that drive the spe...
Freely available audiobooks are a rich resource of expressive speech recordings that can be used for...
Nowadays, especially with the upswing of neural networks, speech synthesis is almost totally data dr...
Recently, text-to-speech (TTS) synthesis has gained immense success in the human-computer interactio...
International audienceGreat improvement has been made in the field of expressive audiovisual Text-to...
The work of this thesis concerns the modeling of emotions for expressive audiovisual textto-speech s...
In this paper we present a DNN based speech synthesis system trained on an audiobook including senti...
This work presents a study on the suitability of prosodic andacoustic features, with a special focus...
This work aims at creating expressive voices from audiobooks using semantic selection. First, for ea...
Speech synthesis is the task of generating speech using computers. Due to the limitations of classic...
Speech Synthesis is the computer process of converting text to voice. This project consists in the s...
Advances in speech synthesis have led to redefinition of the key issues of person-machine communicat...
Expressive synthesis from text is a challenging problem. There are two issues. First, read text is o...
When designing human-machine interfaces it is important to consider not only the bare bones function...
Speech is the fundamental mode of human communication, and its synthesis has long been a core priori...
Most speech synthesis systems require a linguistic module to produce the features that drive the spe...
Freely available audiobooks are a rich resource of expressive speech recordings that can be used for...
Nowadays, especially with the upswing of neural networks, speech synthesis is almost totally data dr...
Recently, text-to-speech (TTS) synthesis has gained immense success in the human-computer interactio...
International audienceGreat improvement has been made in the field of expressive audiovisual Text-to...
The work of this thesis concerns the modeling of emotions for expressive audiovisual textto-speech s...
In this paper we present a DNN based speech synthesis system trained on an audiobook including senti...
This work presents a study on the suitability of prosodic andacoustic features, with a special focus...
This work aims at creating expressive voices from audiobooks using semantic selection. First, for ea...
Speech synthesis is the task of generating speech using computers. Due to the limitations of classic...
Speech Synthesis is the computer process of converting text to voice. This project consists in the s...
Advances in speech synthesis have led to redefinition of the key issues of person-machine communicat...
Expressive synthesis from text is a challenging problem. There are two issues. First, read text is o...
When designing human-machine interfaces it is important to consider not only the bare bones function...
Speech is the fundamental mode of human communication, and its synthesis has long been a core priori...
Most speech synthesis systems require a linguistic module to produce the features that drive the spe...
Freely available audiobooks are a rich resource of expressive speech recordings that can be used for...