This paper investigates the impact of subspace based techniques for acoustic modeling in automatic speech recognition (ASR). There are many well known approaches to subspace based speaker adaptation which represent sources of variability as a projection within a low di-mensional subspace. A new approach to acoustic modeling in ASR, referred to as the subspace based Gaussian mixture model (SGMM), represents phonetic variability as a set of projections applied at the state level in a hidden Markov model (HMM) based acoustic model. The impact of the SGMM in modeling these intrinsic sources of vari-ability is evaluated for a continuous speech recognition (CSR) task. The SGMM is shown to provide an 18 % reduction in word error rate (WER) for spe...
In recent years, under the hidden Markov modeling (HMM) framework, the use of subspace Gaussian mixt...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
The performance of the speech recognition systems to translate voice to text is still an issue in la...
This paper describes experimental results of applying Subspace Gaussian Mixture Models (SGMMs) in tw...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
In most of state-of-the-art speech recognition systems, Gaussian mixture models (GMMs) are used to ...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
This paper investigates employment of Subspace Gaussian Mixture Models (SGMMs) for acoustic model ad...
The subspace Gaussian mixture model (SGMM) has been recently proposed as an acoustic modeling techni...
We describe a new approach to speech recognition, in which all Hidden Markov Model (HMM) states shar...
Spoken words convey several levels of information. At the primary level, the speech conveys words or...
Intrinsic variability of the speaker in spontaneous speech remains a challenge to state of the art A...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
The hypothesis that for a given amount of training data a speaker model has an optimum number of com...
In recent years, under the hidden Markov modeling (HMM) framework, the use of subspace Gaussian mixt...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
The performance of the speech recognition systems to translate voice to text is still an issue in la...
This paper describes experimental results of applying Subspace Gaussian Mixture Models (SGMMs) in tw...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
In most of state-of-the-art speech recognition systems, Gaussian mixture models (GMMs) are used to ...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
This paper investigates employment of Subspace Gaussian Mixture Models (SGMMs) for acoustic model ad...
The subspace Gaussian mixture model (SGMM) has been recently proposed as an acoustic modeling techni...
We describe a new approach to speech recognition, in which all Hidden Markov Model (HMM) states shar...
Spoken words convey several levels of information. At the primary level, the speech conveys words or...
Intrinsic variability of the speaker in spontaneous speech remains a challenge to state of the art A...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
The hypothesis that for a given amount of training data a speaker model has an optimum number of com...
In recent years, under the hidden Markov modeling (HMM) framework, the use of subspace Gaussian mixt...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
The performance of the speech recognition systems to translate voice to text is still an issue in la...