Pronunciations for words are a critical component in an automated speech recognition system (ASR) as mis-recognitions may be caused by missing or inaccurate pronunciations. The need for high quality pronunciations has recently motivated data-driven techniques to gen-erate them [1]. We propose a data-driven and language-independent framework for verification of such pronunciations to further improve the lexicon quality in ASR. New candidate pronunciations are veri-fied by re-recognizing historical audio logs and examining the asso-ciated recognition costs. We build an additional pronunciation qual-ity feature from word and pronunciation frequencies in logs. A ma-chine learned classifier trained on these features achieves nearly 90% accuracy ...
In this paper, we develop a new method for compiling a pronunciation dictionary to model pronunciati...
While many studies have been focused on pronunciation modeling for improving word recognition, limit...
In the context of ASR systems it is of major importance to accurately model the allophonic variation...
The pronunciation dictionary, or lexicon, is an essential component in an automatic speech recogniti...
In many ways, the lexicon remains the Achilles heel of modern automatic speech recogniz-ers (ASRs). ...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
This article focuses on modeling pronunciation variation in two different ways: data-derived and kno...
We explore different ways of "spelling" a word in a speech recognizer's lexicon and h...
Automatic Speech Recognition (ASR) can be very useful in language learning tools in order to correct...
The authors address the question whether phonological features can be used effectively in an automat...
Data-Driven Pronunciation Generation for ASR Maria Obedkova In ASR systems, dictionaries are usually...
The large pronunciation variability of words in conversational speech is one of the major causes of ...
This paper focuses on modeling pronunciation variation in two di�erent ways: data-derived and knowle...
INTRODUCTION Pronunciations in spontaneous, conversational speech tend to be much more variable tha...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
In this paper, we develop a new method for compiling a pronunciation dictionary to model pronunciati...
While many studies have been focused on pronunciation modeling for improving word recognition, limit...
In the context of ASR systems it is of major importance to accurately model the allophonic variation...
The pronunciation dictionary, or lexicon, is an essential component in an automatic speech recogniti...
In many ways, the lexicon remains the Achilles heel of modern automatic speech recogniz-ers (ASRs). ...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
This article focuses on modeling pronunciation variation in two different ways: data-derived and kno...
We explore different ways of "spelling" a word in a speech recognizer's lexicon and h...
Automatic Speech Recognition (ASR) can be very useful in language learning tools in order to correct...
The authors address the question whether phonological features can be used effectively in an automat...
Data-Driven Pronunciation Generation for ASR Maria Obedkova In ASR systems, dictionaries are usually...
The large pronunciation variability of words in conversational speech is one of the major causes of ...
This paper focuses on modeling pronunciation variation in two di�erent ways: data-derived and knowle...
INTRODUCTION Pronunciations in spontaneous, conversational speech tend to be much more variable tha...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
In this paper, we develop a new method for compiling a pronunciation dictionary to model pronunciati...
While many studies have been focused on pronunciation modeling for improving word recognition, limit...
In the context of ASR systems it is of major importance to accurately model the allophonic variation...