In many ways, the lexicon remains the Achilles heel of modern automatic speech recogniz-ers (ASRs). Unlike stochastic acoustic and language models that learn the values of their parameters from training data, the baseform pronunciations of words in an ASR vocabulary are typically specified manually, and do not change, unless they are edited by an expert. Our work presents a novel generative framework that uses speech data to learn stochastic lexicons, thereby taking a step towards alleviating the need for manual intervention and au-tomnatically learning high-quality baseform pronunciations for words. We test our model on a variety of domains: an isolated-word telephone speech corpus, a weather query corpus and an academic lecture corpus. We...
In this paper, we tackle the problem of pronunciation inference and Out-of-Vocabulary (OOV) enrollme...
INTRODUCTION Pronunciations in spontaneous, conversational speech tend to be much more variable tha...
We present a framework for discovering acoustic units and generating an associated pronunciation lex...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
Pronunciations for words are a critical component in an automated speech recognition system (ASR) as...
Standard automatic speech recognition (ASR) systems use phoneme-based pronunciation lexicon prepared...
The pronunciation dictionary, or lexicon, is an essential component in an automatic speech recogniti...
Abstract The creation of a pronunciation lexicon remains the most inefficient process in developing ...
We explore different ways of "spelling" a word in a speech recognizer's lexicon and h...
The creation of a pronunciation lexicon re-mains the most inefficient process in develop-ing an Auto...
Obtaining good pronunciations for named-entities poses a challenge for automated speech recognition ...
In the early '90s, the availability of the TIMIT read-speech phonetically transcribed corpus le...
This article focuses on modeling pronunciation variation in two different ways: data-derived and kno...
The large pronunciation variability of words in conversational speech is one of the major causes of ...
To achieve a robust system the variation seen for different speaking styles must be handled. An inve...
In this paper, we tackle the problem of pronunciation inference and Out-of-Vocabulary (OOV) enrollme...
INTRODUCTION Pronunciations in spontaneous, conversational speech tend to be much more variable tha...
We present a framework for discovering acoustic units and generating an associated pronunciation lex...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
Pronunciations for words are a critical component in an automated speech recognition system (ASR) as...
Standard automatic speech recognition (ASR) systems use phoneme-based pronunciation lexicon prepared...
The pronunciation dictionary, or lexicon, is an essential component in an automatic speech recogniti...
Abstract The creation of a pronunciation lexicon remains the most inefficient process in developing ...
We explore different ways of "spelling" a word in a speech recognizer's lexicon and h...
The creation of a pronunciation lexicon re-mains the most inefficient process in develop-ing an Auto...
Obtaining good pronunciations for named-entities poses a challenge for automated speech recognition ...
In the early '90s, the availability of the TIMIT read-speech phonetically transcribed corpus le...
This article focuses on modeling pronunciation variation in two different ways: data-derived and kno...
The large pronunciation variability of words in conversational speech is one of the major causes of ...
To achieve a robust system the variation seen for different speaking styles must be handled. An inve...
In this paper, we tackle the problem of pronunciation inference and Out-of-Vocabulary (OOV) enrollme...
INTRODUCTION Pronunciations in spontaneous, conversational speech tend to be much more variable tha...
We present a framework for discovering acoustic units and generating an associated pronunciation lex...