In this work we design an approach for automatic feature selection and voice creation for expressive synthesis. Our approach is guided by two main goals: (1) increasing the flexibility of expressive voice creation and (2) overcoming the limitations of speaking styles in expressive synthesis. We define a novel set of features, combining traditionally used prosodic features with spectral features and proposing the use of iVectors. With these features we perform unsupervised clustering of an audiobook excerpt and, from these clusters, we create synthetic voices using the SAT technique. To evaluate the clustering performance we propose an objective evaluation of the unsupervised clustering results technique based on perplexity reduction. This o...
PhD (Information Technology), North-West University, Vaal Triangle Campus, 2014Text-to-speech synthe...
Text-to-speech synthesis (TTS) has progressed to such a stage that given a large, clean, phoneticall...
Speech is the most common way of communication among humans. People who cannot communicate through s...
In this work we design an approach for automatic feature selection and voice creation for expressive...
This work presents a study on the suitability of prosodic andacoustic features, with a special focus...
Freely available audiobooks are a rich resource of expressive speech recordings that can be used for...
Abstract In this paper, we explore how to construct stylistic TTS databases from audio books, in whi...
Expressive synthesis from text is a challenging problem. There are two issues. First, read text is o...
Audiobooks are a powerful source of rich information for speech synthesis. Recent work has been foc...
This work aims at creating expressive voices from audiobooks using semantic selection. First, for ea...
Dans ces travaux de thèse nous abordons l'expressivité de la parole lue avec un type de données part...
In this thesis, we study the expressivity of read speech with a particular type of data, which are...
By definition, spontaneous speech is unscripted and created on the fly by the speaker. It is dramati...
The objective of this thesis is the generation of a high quality expressive audio-book, using natura...
Voice quality plays a pivotal role in speech style variation. There-fore, control and analysis of vo...
PhD (Information Technology), North-West University, Vaal Triangle Campus, 2014Text-to-speech synthe...
Text-to-speech synthesis (TTS) has progressed to such a stage that given a large, clean, phoneticall...
Speech is the most common way of communication among humans. People who cannot communicate through s...
In this work we design an approach for automatic feature selection and voice creation for expressive...
This work presents a study on the suitability of prosodic andacoustic features, with a special focus...
Freely available audiobooks are a rich resource of expressive speech recordings that can be used for...
Abstract In this paper, we explore how to construct stylistic TTS databases from audio books, in whi...
Expressive synthesis from text is a challenging problem. There are two issues. First, read text is o...
Audiobooks are a powerful source of rich information for speech synthesis. Recent work has been foc...
This work aims at creating expressive voices from audiobooks using semantic selection. First, for ea...
Dans ces travaux de thèse nous abordons l'expressivité de la parole lue avec un type de données part...
In this thesis, we study the expressivity of read speech with a particular type of data, which are...
By definition, spontaneous speech is unscripted and created on the fly by the speaker. It is dramati...
The objective of this thesis is the generation of a high quality expressive audio-book, using natura...
Voice quality plays a pivotal role in speech style variation. There-fore, control and analysis of vo...
PhD (Information Technology), North-West University, Vaal Triangle Campus, 2014Text-to-speech synthe...
Text-to-speech synthesis (TTS) has progressed to such a stage that given a large, clean, phoneticall...
Speech is the most common way of communication among humans. People who cannot communicate through s...