This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (, duration, and energy), from a neutral style into a number of expressive ones. The main goal was to validate the usefulness of VoQ in the enhancement of expressive synthetic speech in terms of speech quality and style identification. A harmonic plus noise model (HNM) was used to modify VoQ and prosodic parameters that were extracted from an expressive speech corpus. Perception test results indicated the improvement of obtained expressive speech styles using VoQ modelling along with prosodic characteristics
Feature-based vocoders, e.g., STRAIGHT, offer a way to manipulate the perceived characteristics of t...
A uniform phase representation for the harmonic model in speech synthesis applications Gilles Degott...
HMM-based speech synthesis offers a way to generate speech with different voice qualities. However, ...
This paper presents the perceptual experiments that were carried out in order to validate the method...
This paper presents the perceptual experiments that were carried out in order to validate the method...
Copyright © 2014 Carlos Monzo et al. This is an open access article distributed under the Creative C...
Both of the prosody and spectral features are important for emotional speech synthesis. Besides pros...
This paper describes recent progress in our approach to generating expressive speech. A goal of text...
One of the biggest challenges in speech synthesis is the production of naturally sounding synthetic ...
We describe an approach to simulate different phonation types, following John Laver’s terminology, b...
Voice transformation is the process of transforming the characteristics of speech uttered by a sourc...
This document will review a sample of available voice modelling and transformation techniques, in vi...
Voice quality is recognized to play an important role for the rendering of emotions in verbal commun...
This work presents a study on the suitability of prosodic andacoustic features, with a special focus...
Voice quality is the perceived timbre of speech. Considering the interaction between voice quality a...
Feature-based vocoders, e.g., STRAIGHT, offer a way to manipulate the perceived characteristics of t...
A uniform phase representation for the harmonic model in speech synthesis applications Gilles Degott...
HMM-based speech synthesis offers a way to generate speech with different voice qualities. However, ...
This paper presents the perceptual experiments that were carried out in order to validate the method...
This paper presents the perceptual experiments that were carried out in order to validate the method...
Copyright © 2014 Carlos Monzo et al. This is an open access article distributed under the Creative C...
Both of the prosody and spectral features are important for emotional speech synthesis. Besides pros...
This paper describes recent progress in our approach to generating expressive speech. A goal of text...
One of the biggest challenges in speech synthesis is the production of naturally sounding synthetic ...
We describe an approach to simulate different phonation types, following John Laver’s terminology, b...
Voice transformation is the process of transforming the characteristics of speech uttered by a sourc...
This document will review a sample of available voice modelling and transformation techniques, in vi...
Voice quality is recognized to play an important role for the rendering of emotions in verbal commun...
This work presents a study on the suitability of prosodic andacoustic features, with a special focus...
Voice quality is the perceived timbre of speech. Considering the interaction between voice quality a...
Feature-based vocoders, e.g., STRAIGHT, offer a way to manipulate the perceived characteristics of t...
A uniform phase representation for the harmonic model in speech synthesis applications Gilles Degott...
HMM-based speech synthesis offers a way to generate speech with different voice qualities. However, ...