It has been a common practice in speech recognition and elsewhere to approximate the log likelihood of a Gaussian mixture model (GMM) with the maximum component log likelihood. While often a computational necessity, the max approximation comes at a price of inferior modeling when the Gaussian components significantly over-lap. This paper shows how the approximation error can be reduced by changing component priors. In our experiments the loss in word error rate due to max approximation, albeit small, is reduced by 50-100 % at no cost in computational efficiency. Furthermore, we expect acoustic models will become larger with time and increase compo-nent overlap and word error rate loss. This makes reducing the ap-proximation error more relev...
In most of state-of-the-art speech recognition systems, Gaussian mixture models (GMMs) are used to ...
Acoustic modeling using mixtures of multivariate Gaussians is the prevalent approach for many speech...
In an automatic speech recognition system us-ing a tied-mixture acoustic model, the main cost in CPU...
Summarization: A trend in automatic speech recognition systems is the use of continuous mixture-dens...
Acoustic modeling using mixtures of multivariate Gaussians is the prevalent approach for many speech...
Acoustic modeling using mixtures of multivariate Gaussians is the prevalent approach for many speech...
This paper describes experimental results of applying Subspace Gaussian Mixture Models (SGMMs) in tw...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
In an HMM based large vocabulary continuous speech recognition system, the evaluation of - context d...
International audienceLVCSR systems are usually based on continuous density HMMs, which are typicall...
EUROSPEECH2003: 8th European Conference on Speech Communication and Technology, September 1-4, 2003...
This article considers a new approximation to the log-likelihood surface in mixture models. This app...
Most automatic speech recognition (ASR) systems express probability densities over sequences of acou...
Acoustic modeling using mixtures of multivariate Gaussians is the prevalent approach for many speech...
LVCSR systems are usually based on continuous density HMMs, which are typically implemented using Ga...
In most of state-of-the-art speech recognition systems, Gaussian mixture models (GMMs) are used to ...
Acoustic modeling using mixtures of multivariate Gaussians is the prevalent approach for many speech...
In an automatic speech recognition system us-ing a tied-mixture acoustic model, the main cost in CPU...
Summarization: A trend in automatic speech recognition systems is the use of continuous mixture-dens...
Acoustic modeling using mixtures of multivariate Gaussians is the prevalent approach for many speech...
Acoustic modeling using mixtures of multivariate Gaussians is the prevalent approach for many speech...
This paper describes experimental results of applying Subspace Gaussian Mixture Models (SGMMs) in tw...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
In an HMM based large vocabulary continuous speech recognition system, the evaluation of - context d...
International audienceLVCSR systems are usually based on continuous density HMMs, which are typicall...
EUROSPEECH2003: 8th European Conference on Speech Communication and Technology, September 1-4, 2003...
This article considers a new approximation to the log-likelihood surface in mixture models. This app...
Most automatic speech recognition (ASR) systems express probability densities over sequences of acou...
Acoustic modeling using mixtures of multivariate Gaussians is the prevalent approach for many speech...
LVCSR systems are usually based on continuous density HMMs, which are typically implemented using Ga...
In most of state-of-the-art speech recognition systems, Gaussian mixture models (GMMs) are used to ...
Acoustic modeling using mixtures of multivariate Gaussians is the prevalent approach for many speech...
In an automatic speech recognition system us-ing a tied-mixture acoustic model, the main cost in CPU...