Joint uncertainty decoding (JUD) is an effective model-based noise compensation technique for conventional Gaussian mix-ture model (GMM) based speech recognition systems. In this paper, we apply JUD to subspace Gaussian mixture model (SGMM) based acoustic models. The total number of Gaus-sians in the SGMM acoustic model is usually much larger than for conventional GMMs, which limits the application of approaches which explicitly compensate each Gaussian, such as vector Taylor series (VTS). However, by clustering the Gaussian components into a number of regression classes, JUD-based noise compensation can be successfully applied to SGMM systems. We evaluate the JUD/SGMM technique us-ing the Aurora 4 corpus, and the experimental results indic...
We describe a new approach to speech recognition, in which all Hidden Markov Model (HMM) states shar...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
It has been a common practice in speech recognition and elsewhere to approximate the log likelihood ...
Common noise compensation techniques use vector Tay-lor series (VTS) to approximate the mismatch fun...
In most of state-of-the-art speech recognition systems, Gaussian mixture models (GMMs) are used to ...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
This paper describes experimental results of applying Subspace Gaussian Mixture Models (SGMMs) in tw...
We consider the problem of Gaussian mixture model (GMM)-based classification of noisy data, where th...
International audienceWe consider Gaussian mixture model (GMM)-based classification from noisy featu...
Revised version including a bugfix in the computation of the Wiener uncertainty estimator and in the...
Abstract—We investigate cross-lingual acoustic modelling for low resource languages using the subspa...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
This paper investigates the impact of subspace based techniques for acoustic modeling in automatic s...
We describe a new approach to speech recognition, in which all Hidden Markov Model (HMM) states shar...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
It has been a common practice in speech recognition and elsewhere to approximate the log likelihood ...
Common noise compensation techniques use vector Tay-lor series (VTS) to approximate the mismatch fun...
In most of state-of-the-art speech recognition systems, Gaussian mixture models (GMMs) are used to ...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
This paper describes experimental results of applying Subspace Gaussian Mixture Models (SGMMs) in tw...
We consider the problem of Gaussian mixture model (GMM)-based classification of noisy data, where th...
International audienceWe consider Gaussian mixture model (GMM)-based classification from noisy featu...
Revised version including a bugfix in the computation of the Wiener uncertainty estimator and in the...
Abstract—We investigate cross-lingual acoustic modelling for low resource languages using the subspa...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
This paper investigates the impact of subspace based techniques for acoustic modeling in automatic s...
We describe a new approach to speech recognition, in which all Hidden Markov Model (HMM) states shar...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
It has been a common practice in speech recognition and elsewhere to approximate the log likelihood ...