Prosody is a kind of cues that are critical to human speech perception and comprehension, so it is plausible to integrate prosodic information into machine speech recognition. However, as a result of the supra-segmental nature, it is hard to integrate prosodic information with conventional acoustic features. Recently, RNNLMs have shown to be the state-of-the-art language model in many tasks. We thus attempt to integrate prosodic information into RNNLMs for improving speech recognition performance based on rescoring strategy. Firstly, three word-level prosodic features are extracted from speech and then passed to RNNLMs separately. Therefore RNNLMs predict the next word based on prosodic features and word history. Experiments conducted on Li...
Abstract—Does prosody help word recognition? This paper proposes a novel probabilistic framework in ...
Prosody is known to play an important role in human speech perception process. Therefore, there is a...
This paper presents a novel approach that improves the robustness of prosody dependent language mode...
Prosody is a kind of cues that are critical to human speech perception and comprehension, so it is p...
The goal of this thesis is to advance the use of recurrent neural network language models (RNNLMs) ...
Recurrent neural network language models (RNNLMs) are powerful language modeling techniques. Signifi...
Prosodic breaks prediction from text is a fundamental task to obtain naturalness in text to speech a...
The recurrent neural network language model (RNNLM) has been demonstrated to consistently reduce per...
Recurrent neural network language models (RNNLM) have become an increasingly popular choice for stat...
Recurrent neural network language models (RNNLM) have become an increasingly popular choice for stat...
Deep neural networks have become a veritable alternative to classic speaker recognition and clusteri...
Thesis (Ph.D.)--University of Washington, 2020Prosody comprises aspects of speech that communicate i...
Recurrent neural network language models (RNNLMs) have been successfully applied in a variety of lan...
124 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2004.Prosody (the melody and rhyth...
© 2015 Elsevier B.V. All rights reserved. Due to their advantages over conventional n-gram language ...
Abstract—Does prosody help word recognition? This paper proposes a novel probabilistic framework in ...
Prosody is known to play an important role in human speech perception process. Therefore, there is a...
This paper presents a novel approach that improves the robustness of prosody dependent language mode...
Prosody is a kind of cues that are critical to human speech perception and comprehension, so it is p...
The goal of this thesis is to advance the use of recurrent neural network language models (RNNLMs) ...
Recurrent neural network language models (RNNLMs) are powerful language modeling techniques. Signifi...
Prosodic breaks prediction from text is a fundamental task to obtain naturalness in text to speech a...
The recurrent neural network language model (RNNLM) has been demonstrated to consistently reduce per...
Recurrent neural network language models (RNNLM) have become an increasingly popular choice for stat...
Recurrent neural network language models (RNNLM) have become an increasingly popular choice for stat...
Deep neural networks have become a veritable alternative to classic speaker recognition and clusteri...
Thesis (Ph.D.)--University of Washington, 2020Prosody comprises aspects of speech that communicate i...
Recurrent neural network language models (RNNLMs) have been successfully applied in a variety of lan...
124 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2004.Prosody (the melody and rhyth...
© 2015 Elsevier B.V. All rights reserved. Due to their advantages over conventional n-gram language ...
Abstract—Does prosody help word recognition? This paper proposes a novel probabilistic framework in ...
Prosody is known to play an important role in human speech perception process. Therefore, there is a...
This paper presents a novel approach that improves the robustness of prosody dependent language mode...