www.infor.uva.es Despite of the existence of high quality unit selection speech synthesizers, they are based on a reading style approach. How-ever, new applications such as Speech-to-Speech Translation or Speech User Interfaces demand a talking style which is more natural in these contexts. Disfluencies are a major characteristic of talking style so that it is convenient to be able to generate disfluent speech. In the present paper a preliminary analysis of pitch and segmental duration in repetitions and filled pauses is presented. Simple rules to predict these prosodic features are derived from the previous analysis and used for synthesis. Eval-uation shows an increase in naturalness while overall quality is decreased. 1
Natural-sounding speech synthesis requires close control over the temporal structure of the speech f...
This article describes a perceptual evaluation of the prosodic structure of a spontaneously produced...
In this contribution we investigate the distribution of disfluencies and pause length in spoken Fren...
Betz S, Wagner P, Schlangen D. Micro-Structure of Disfluencies: Basics for Conversational Speech Syn...
Disfluent speech synthesis is necessary in some applications such as automatic film dubbing or spoke...
During public presentations or interviews, speakers commonly and unconsciously abuse interjections o...
This study examines prosodic parameters in two types of disfluencies, vowel lengthenings, and filled...
A key difference between spontaneous speech and controlled laboratory speech is the prevalence of di...
This paper explores the results of a previous experiment concerning listeners ’ ratings of different...
This dissertation presents the importance of diphones’ duration and f0 information in generating mor...
This paper reports preliminary results from a study of disfluencies in European Portuguese, based on...
Paper presented at: Speech Prosody 2016; 2016 May 31-June 3; Boston (MA, USA)Speech synthesis has im...
The present investigation examined the effects of noise on prosodic and segmental timing in speech ...
International audienceThis paper presents an exploratory work to automatically insert disfluencies i...
People pause between words and sentences when they speak. They pause to emphasize content, or to mak...
Natural-sounding speech synthesis requires close control over the temporal structure of the speech f...
This article describes a perceptual evaluation of the prosodic structure of a spontaneously produced...
In this contribution we investigate the distribution of disfluencies and pause length in spoken Fren...
Betz S, Wagner P, Schlangen D. Micro-Structure of Disfluencies: Basics for Conversational Speech Syn...
Disfluent speech synthesis is necessary in some applications such as automatic film dubbing or spoke...
During public presentations or interviews, speakers commonly and unconsciously abuse interjections o...
This study examines prosodic parameters in two types of disfluencies, vowel lengthenings, and filled...
A key difference between spontaneous speech and controlled laboratory speech is the prevalence of di...
This paper explores the results of a previous experiment concerning listeners ’ ratings of different...
This dissertation presents the importance of diphones’ duration and f0 information in generating mor...
This paper reports preliminary results from a study of disfluencies in European Portuguese, based on...
Paper presented at: Speech Prosody 2016; 2016 May 31-June 3; Boston (MA, USA)Speech synthesis has im...
The present investigation examined the effects of noise on prosodic and segmental timing in speech ...
International audienceThis paper presents an exploratory work to automatically insert disfluencies i...
People pause between words and sentences when they speak. They pause to emphasize content, or to mak...
Natural-sounding speech synthesis requires close control over the temporal structure of the speech f...
This article describes a perceptual evaluation of the prosodic structure of a spontaneously produced...
In this contribution we investigate the distribution of disfluencies and pause length in spoken Fren...