Intonation plays a crucial role in making synthetic speech sound more natural. However, intonation modeling largely remains an open question. In my thesis, the interpolated F0 is parameterized dynamically by means of sign values, encoding the direction of pitch change, and corresponding quantized magnitude values, encoding the amount of pitch change in such direction. The sign and magnitude values are used for the training of a dedicated neural network. The proposed methodology is evaluated and compared to a state-of-the-art DNN-based TTS system. To this end, a segmental synthesizer was implemented to normalize the effect of the spectrum. The synthesizer uses the F0 and linguistic features to predict the spectrum, aperiodicity, and voicing ...
In this thesis a data-driven and linguistically interpretable intonation model for the automatic ana...
This paper presents a fully machine-driven approach for intonation description and its linguistic in...
This paper describes an implementation of the rise/fall/connection (RFC) model of intonation for us...
This paper presents a fully machine-driven approach for intonation description and its linguistic in...
This paper presents a fully machine-driven approach for intonation description and its linguistic in...
This paper presents a fully machine-driven approach for intonation description and its linguistic in...
This paper presents a fully machine-driven approach for intonation description and its linguistic in...
This thesis addresses the problem of generating a range of natural sounding pitch contours for spee...
This paper introduces the Tilt intonational model and describes how this model can be used to automa...
AbstractThis paper proposes linguistic, production and prosodic constraints for modeling the intonat...
AbstractThis paper proposes linguistic, production and prosodic constraints for modeling the intonat...
This paper describes a general system which maps from a phonological specification of an utterance J...
The tilt intonation model facilitates automatic analysis and synthesis of intonation. The analysis a...
Abstract In this paper we propose models for predicting the intonation for the sequence of syllables...
The absence of convincing intonation makes current parametric speech synthesis systems sound dull a...
In this thesis a data-driven and linguistically interpretable intonation model for the automatic ana...
This paper presents a fully machine-driven approach for intonation description and its linguistic in...
This paper describes an implementation of the rise/fall/connection (RFC) model of intonation for us...
This paper presents a fully machine-driven approach for intonation description and its linguistic in...
This paper presents a fully machine-driven approach for intonation description and its linguistic in...
This paper presents a fully machine-driven approach for intonation description and its linguistic in...
This paper presents a fully machine-driven approach for intonation description and its linguistic in...
This thesis addresses the problem of generating a range of natural sounding pitch contours for spee...
This paper introduces the Tilt intonational model and describes how this model can be used to automa...
AbstractThis paper proposes linguistic, production and prosodic constraints for modeling the intonat...
AbstractThis paper proposes linguistic, production and prosodic constraints for modeling the intonat...
This paper describes a general system which maps from a phonological specification of an utterance J...
The tilt intonation model facilitates automatic analysis and synthesis of intonation. The analysis a...
Abstract In this paper we propose models for predicting the intonation for the sequence of syllables...
The absence of convincing intonation makes current parametric speech synthesis systems sound dull a...
In this thesis a data-driven and linguistically interpretable intonation model for the automatic ana...
This paper presents a fully machine-driven approach for intonation description and its linguistic in...
This paper describes an implementation of the rise/fall/connection (RFC) model of intonation for us...