Standard hidden Markov model (HMM) based automatic speech recognition (ASR) systems usually use cepstral features as acoustic observation and phonemes as subword units. Speech signal exhibits wide range of variability such as, due to environmental variation, speaker variation. This leads to different kinds of mismatch, such as, mismatch between acoustic features and acoustic models or mismatch between acoustic features and pronunciation models (given the acoustic models). The main focus of this work is on integrating auxiliary knowledge sources into standard ASR systems so as to make the acoustic models more robust to the variabilities in the speech signal. We refer to the sources of knowledge that are able to provide additional information...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Spoken words convey several levels of information. At the primary level, the speech conveys words or...
One of the key challenges involved in building statistical automatic speech recog-nition (ASR) syste...
Standard hidden Markov model (HMM) based automatic speech recognition (ASR) systems usually use ceps...
Automatic speech recognition (ASR) is a very challenging problem due to the wide variety of the data...
Automatic speech recognition bases its models on the acoustic features derived from the speech signa...
The state-of-the-art automatic speech recognition (ASR) systems typically use phonemes as subword un...
Developing a phonetic lexicon for a language requires linguistic knowledge as well as human effort, ...
Standard hidden Markov model (HMM) based automatic speech recognition (ASR) systems use phonemes as ...
Tandem systems transform the cepstral features into posterior probabilities of subword units using a...
Some practical uses of ASR have been implemented, including the transcription of meetings and the us...
Phonological studies suggest that the typical subword units such as phones or phonemes used in autom...
There is growing interest in using graphemes as subword units, especially in the context of the rapi...
International audienceHeterogeneous knowledge sources that model speech only at certain time frames ...
In this paper we present a study of automatic speech recognition systems using context-dependent pho...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Spoken words convey several levels of information. At the primary level, the speech conveys words or...
One of the key challenges involved in building statistical automatic speech recog-nition (ASR) syste...
Standard hidden Markov model (HMM) based automatic speech recognition (ASR) systems usually use ceps...
Automatic speech recognition (ASR) is a very challenging problem due to the wide variety of the data...
Automatic speech recognition bases its models on the acoustic features derived from the speech signa...
The state-of-the-art automatic speech recognition (ASR) systems typically use phonemes as subword un...
Developing a phonetic lexicon for a language requires linguistic knowledge as well as human effort, ...
Standard hidden Markov model (HMM) based automatic speech recognition (ASR) systems use phonemes as ...
Tandem systems transform the cepstral features into posterior probabilities of subword units using a...
Some practical uses of ASR have been implemented, including the transcription of meetings and the us...
Phonological studies suggest that the typical subword units such as phones or phonemes used in autom...
There is growing interest in using graphemes as subword units, especially in the context of the rapi...
International audienceHeterogeneous knowledge sources that model speech only at certain time frames ...
In this paper we present a study of automatic speech recognition systems using context-dependent pho...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Spoken words convey several levels of information. At the primary level, the speech conveys words or...
One of the key challenges involved in building statistical automatic speech recog-nition (ASR) syste...