This paper reports rules for morphing a voice to make it be perceived as containing various primitive features, for example, to make it sound more “bright” or “dark”. In a previous work we proposed a three-layered model, which contains emotional speech, primitive features, and acoustic features, for the perception of emotional speech. By experiments and acoustic analysis, we built the relationships between the three layers and reported that such relationships are significant. Then, a bottom-up method was adopted in order to verify the relationships. That is, we morphed (resynthesized) a speech voice by composing acoustic features in the bottommost layer to produce a voice in which listeners could perceive a single or multiple primitive feat...
With advances in the techniques and naturalness of speech synthesis, and the increasing commercial c...
This paper outlines an approach to modelling the dynamics of voice source parameters as observed in ...
We propose to use a comprehensive path model of vocal emotion communication, encompassing encoding, ...
This paper proposes a system to convert neutral speech to emotional with controlled intensity of emo...
This paper proposes an emotional speech synthesis system based on a three-layered model using a dime...
Modern speech synthesis systems with very high intelligibility are readily available in a number of ...
In order to investigate what acoustic features are important to emotional impressions and how those ...
This paper describes some of the results from the project entitled “New Parameterization for Emotion...
We can communicate using speech from which various infor-mation can be perceived. Emotion is an espe...
UnrestrictedEmotions play an important role in human life. They are essential for communication, for...
There has been considerable research into perceptible correlates of emotional state, but a very limi...
This paper proposes a multi-layer approach to modeling perception of expressive speech. Many earlier...
All speech produced by humans includes information about the speaker, including conveying the emotio...
Speech can express subjective meanings and intents that, in order to be fully understood, rely heavi...
In emotional speech studies, it is well known that loudness, pitch, position and length of pauses, e...
With advances in the techniques and naturalness of speech synthesis, and the increasing commercial c...
This paper outlines an approach to modelling the dynamics of voice source parameters as observed in ...
We propose to use a comprehensive path model of vocal emotion communication, encompassing encoding, ...
This paper proposes a system to convert neutral speech to emotional with controlled intensity of emo...
This paper proposes an emotional speech synthesis system based on a three-layered model using a dime...
Modern speech synthesis systems with very high intelligibility are readily available in a number of ...
In order to investigate what acoustic features are important to emotional impressions and how those ...
This paper describes some of the results from the project entitled “New Parameterization for Emotion...
We can communicate using speech from which various infor-mation can be perceived. Emotion is an espe...
UnrestrictedEmotions play an important role in human life. They are essential for communication, for...
There has been considerable research into perceptible correlates of emotional state, but a very limi...
This paper proposes a multi-layer approach to modeling perception of expressive speech. Many earlier...
All speech produced by humans includes information about the speaker, including conveying the emotio...
Speech can express subjective meanings and intents that, in order to be fully understood, rely heavi...
In emotional speech studies, it is well known that loudness, pitch, position and length of pauses, e...
With advances in the techniques and naturalness of speech synthesis, and the increasing commercial c...
This paper outlines an approach to modelling the dynamics of voice source parameters as observed in ...
We propose to use a comprehensive path model of vocal emotion communication, encompassing encoding, ...