We describe a sub-vector clustering technique to reduce the memory size and computational cost of continuous density hidden Markov models (CHMMs). Acoustic models in modern large-vocabulary, continuous speech recognition systems are typically CHMMs. Systems with 100,000 Gaussian distributions of 40-60 dimensions are common, needing several tens of MB of memory. Computing HMM state likelihoods is several tens of times slower than real time. We show that by clustering and quantizing the Gaussian distributions a few dimensions at a time, both computation and memory costs can be reduced several fold without significant loss of recognition accuracy. On the 1994 Wall Street Journal 20K test set, this technique reduced the acoustic model size by a...
Abstract. In this paper, we introduce a fast estimate algorithm for dis-criminant training of semi-c...
International audienceA fast likelihood computation approach called dynamic Gaussian selection (DGS)...
We present experiments in using neural network based methods to initialize continuous observation de...
Most contemporary laboratory recognizers require too much memory to run, and are too slow for mass a...
It generally takes a long time and requires a large amount of speech data to train hidden Markov mod...
LVCSR systems are usually based on continuous density HMMs, which are typically implemented using Ga...
This paper studies algorithms for reducing the com-putational eort of the mixture density calculatio...
International audienceLVCSR systems are usually based on continuous density HMMs, which are typicall...
Speech recognition systems require too much memory to run and are too slow for mass application. In ...
With the advance in semiconductor memory and the availability of very large speech corpora (of hundr...
Large vocabulary speech recognition systems based on hidden Markov models (HMM) make use of many ten...
In this paper we address the issues in construction of discrete hidden Markov models (HMMs) in the f...
Acoustic modeling using mixtures of multivariate Gaussians is the prevalent approach for many speech...
Senones were introduced to share Hidden Markov model (HMM) parameters at a sub-phonetic level in [3]...
Acoustic modeling using mixtures of multivariate Gaussians is the prevalent approach for many speech...
Abstract. In this paper, we introduce a fast estimate algorithm for dis-criminant training of semi-c...
International audienceA fast likelihood computation approach called dynamic Gaussian selection (DGS)...
We present experiments in using neural network based methods to initialize continuous observation de...
Most contemporary laboratory recognizers require too much memory to run, and are too slow for mass a...
It generally takes a long time and requires a large amount of speech data to train hidden Markov mod...
LVCSR systems are usually based on continuous density HMMs, which are typically implemented using Ga...
This paper studies algorithms for reducing the com-putational eort of the mixture density calculatio...
International audienceLVCSR systems are usually based on continuous density HMMs, which are typicall...
Speech recognition systems require too much memory to run and are too slow for mass application. In ...
With the advance in semiconductor memory and the availability of very large speech corpora (of hundr...
Large vocabulary speech recognition systems based on hidden Markov models (HMM) make use of many ten...
In this paper we address the issues in construction of discrete hidden Markov models (HMMs) in the f...
Acoustic modeling using mixtures of multivariate Gaussians is the prevalent approach for many speech...
Senones were introduced to share Hidden Markov model (HMM) parameters at a sub-phonetic level in [3]...
Acoustic modeling using mixtures of multivariate Gaussians is the prevalent approach for many speech...
Abstract. In this paper, we introduce a fast estimate algorithm for dis-criminant training of semi-c...
International audienceA fast likelihood computation approach called dynamic Gaussian selection (DGS)...
We present experiments in using neural network based methods to initialize continuous observation de...