The detection of prosodic characteristics is an important aspect of both speech synthesis and speech recognition. Correct placement of pitch accents aids in more natural sounding speech, while automatic detection of accents can contribute to better wordlevel recognition and better textual understanding. In this paper we investigate probabilistic, contextual, and phonological factors that influence pitch accent placement in natural, conversational speech in a sequence labeling setting. We introduce Conditional Random Fields (CRFs) to pitch accent prediction task in order to incorporate these factors efficiently in a sequence model. We demonstrate the usefulness and the incremental effect of these factors in a sequence model by performing exp...
International audiencePronunciation adaptation consists in predicting pronunciation variants of word...
International audiencePronunciation adaptation consists in predicting pronunciation variants of word...
International audiencePronunciation adaptation consists in predicting pronunciation variants of word...
The detection of prosodic characteristics is an important aspect of both speech synthesis and speech...
Abstract — While the current TTS systems can deliver quite acceptable segmental quality of synthesiz...
Determining pitch accents in a sentence is a key task for a text-to-speech (TTS) system. We describe...
Determining pitch accents in a sentence is a key task for a text-to-speech (TTS) system. We describe...
Abstract Determining pitch accents in a sentence is a key task for a textto-speech (TTS) system. We ...
The variability and reduction that are characteristic of talking in natural interaction make it very...
The variability and reduction that are characteristic of talking in natural interaction make it very...
Previous work has shown that the energy components of frequency subbands with a variety of frequenci...
Previous work has shown that the energy components of frequency subbands with a variety of frequenci...
Previous work has shown that the energy components of frequency subbands with a variety of frequenci...
I describe a limited-resource approach to generating prosody that mediates text-based information th...
Tonal cues play an important role in distinguishing ambiguous words in Mandarin speech recognition. ...
International audiencePronunciation adaptation consists in predicting pronunciation variants of word...
International audiencePronunciation adaptation consists in predicting pronunciation variants of word...
International audiencePronunciation adaptation consists in predicting pronunciation variants of word...
The detection of prosodic characteristics is an important aspect of both speech synthesis and speech...
Abstract — While the current TTS systems can deliver quite acceptable segmental quality of synthesiz...
Determining pitch accents in a sentence is a key task for a text-to-speech (TTS) system. We describe...
Determining pitch accents in a sentence is a key task for a text-to-speech (TTS) system. We describe...
Abstract Determining pitch accents in a sentence is a key task for a textto-speech (TTS) system. We ...
The variability and reduction that are characteristic of talking in natural interaction make it very...
The variability and reduction that are characteristic of talking in natural interaction make it very...
Previous work has shown that the energy components of frequency subbands with a variety of frequenci...
Previous work has shown that the energy components of frequency subbands with a variety of frequenci...
Previous work has shown that the energy components of frequency subbands with a variety of frequenci...
I describe a limited-resource approach to generating prosody that mediates text-based information th...
Tonal cues play an important role in distinguishing ambiguous words in Mandarin speech recognition. ...
International audiencePronunciation adaptation consists in predicting pronunciation variants of word...
International audiencePronunciation adaptation consists in predicting pronunciation variants of word...
International audiencePronunciation adaptation consists in predicting pronunciation variants of word...