Over the past several years, I have been conducting research on subword modeling in speech recognition. The research is most specifically aimed at the difficult task of identifying and characterizing unknown words, although the proposed framework also has utility in other recognition tasks such as phonological and prosodic modeling. The approach exploits the linguistic substructure ofwOPP by describing graphemic, phonemic, phonological, syllabic, and morphemic constraints through a set of context-free rules, and supporting the resulting parse treeswee a corpus- 13 trained probability model. A derived finite state transducer representation forms a natural means for integrating the trained model into a recognizer search. This paper descr...
Introduction At the current state of the art, high-accuracy speech recognition with moderate to lar...
There is now considerable evidence that fine-grained acoustic-phonetic detail in the speech signal h...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
In today's society, speech recognition systems have reached a mass audience, especially in the field...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Because in agglutinative languages the number of observed word forms is very high, subword units are...
Despite the proliferation of speech-enabled applications and devices, speech-driven human-machine in...
The goal of speech recognition is to find the most probable word given the acoustic evidence, i.e. a...
By definition, words that are not present in a recognition vocabulary are called out-of-vocabulary (...
Speech recognition is the task of decoding an acoustic speech signal into a written text. Large voca...
We describe a novel way to implement subword language models in speech recognition systems based on ...
We study two key issues in task-independent training, namely selection of a universal set of subword...
Speech recognition is the process of converting acoustic waveforms into text. This requires models t...
A new language model for speech recognition inspired by linguistic analysis is presented. The model ...
A realistic model of speech recognition and understanding should be heavily based both on linguistic...
Introduction At the current state of the art, high-accuracy speech recognition with moderate to lar...
There is now considerable evidence that fine-grained acoustic-phonetic detail in the speech signal h...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
In today's society, speech recognition systems have reached a mass audience, especially in the field...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Because in agglutinative languages the number of observed word forms is very high, subword units are...
Despite the proliferation of speech-enabled applications and devices, speech-driven human-machine in...
The goal of speech recognition is to find the most probable word given the acoustic evidence, i.e. a...
By definition, words that are not present in a recognition vocabulary are called out-of-vocabulary (...
Speech recognition is the task of decoding an acoustic speech signal into a written text. Large voca...
We describe a novel way to implement subword language models in speech recognition systems based on ...
We study two key issues in task-independent training, namely selection of a universal set of subword...
Speech recognition is the process of converting acoustic waveforms into text. This requires models t...
A new language model for speech recognition inspired by linguistic analysis is presented. The model ...
A realistic model of speech recognition and understanding should be heavily based both on linguistic...
Introduction At the current state of the art, high-accuracy speech recognition with moderate to lar...
There is now considerable evidence that fine-grained acoustic-phonetic detail in the speech signal h...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...