Summarization: A trend in automatic speech recognition systems is the use of continuous mixture-density hidden Markov models (HMMs). Despite the good recognition performance that these systems achieve on average in large vocabulary applications, there is a large variability in performance across speakers. Performance degrades dramatically when the user is radically different from the training population. A popular technique that can improve the performance and robustness of a speech recognition system is adapting speech models to the speaker, and more generally to the channel and the task. In continuous mixture-density HMMs the number of component densities is typically very large, and it may not be feasible to acquire a sufficient amount o...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
Summarization: The mismatch that frequently occurs between the training and testing conditions of an...
It has been a common practice in speech recognition and elsewhere to approximate the log likelihood ...
Nowadays, almost all speaker-independent (SI) speech recognition systems use CDHMM with multivariate...
Summarization: An algorithm is proposed that achieves a good tradeoff between modeling resolution an...
Summarization: The recognition accuracy in recent large vocabulary Automatic Speech Recognition (ASR...
Summarization: The recognition accuracy in previous large vocabulary automatic speech recognition (A...
International audienceAcoustic modeling techniques, based on clustering of the training data, have b...
We describe a novel fast speaker adaptation algorithm for large vocabulary speech recognition system...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
Summarization: Speaker adaptation is recognized as an essential part of today’s large-vocabulary aut...
The challenge of speaker adaptation is to reliably fine-tune models of a general population to fit t...
We describe a novel fast speaker adaptation algorithm for large vocabulary speech recognition system...
Gaussian mixture (GMM)-HMMs, though being the predominant modeling technique for speech recognition,...
We describe a novel fast speaker adaptation algorithm for large vocabulary speech recognition system...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
Summarization: The mismatch that frequently occurs between the training and testing conditions of an...
It has been a common practice in speech recognition and elsewhere to approximate the log likelihood ...
Nowadays, almost all speaker-independent (SI) speech recognition systems use CDHMM with multivariate...
Summarization: An algorithm is proposed that achieves a good tradeoff between modeling resolution an...
Summarization: The recognition accuracy in recent large vocabulary Automatic Speech Recognition (ASR...
Summarization: The recognition accuracy in previous large vocabulary automatic speech recognition (A...
International audienceAcoustic modeling techniques, based on clustering of the training data, have b...
We describe a novel fast speaker adaptation algorithm for large vocabulary speech recognition system...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
Summarization: Speaker adaptation is recognized as an essential part of today’s large-vocabulary aut...
The challenge of speaker adaptation is to reliably fine-tune models of a general population to fit t...
We describe a novel fast speaker adaptation algorithm for large vocabulary speech recognition system...
Gaussian mixture (GMM)-HMMs, though being the predominant modeling technique for speech recognition,...
We describe a novel fast speaker adaptation algorithm for large vocabulary speech recognition system...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
Summarization: The mismatch that frequently occurs between the training and testing conditions of an...
It has been a common practice in speech recognition and elsewhere to approximate the log likelihood ...