Speech recognition applications are known to require a significant amount of resources (training data, memory, computing power). However, the targeted context of this work- mobile phone embed-ded speech recognition system- only authorizes few KB of memory, few MIPS and usually small amount of training data. In order to fit the resource constraints, an approach based on a semi-continuous HMM system using a GMM-based state-independent acoustic modeling is proposed in this paper. A trans-formation is computed and applied to the global GMM in order to obtain each of the HMM state-dependent probability density func-tions. This strategy aims at storing only the transformation function parameters for each state and authorizes to decrease the amoun...
To obtain a robust acoustic model for a certain speech recognition task, a large amount of speech da...
This paper presents improvements in acoustic and lan-guage modeling for automatic speech recognition...
The paper revives an older approach to acoustic modeling that borrows from n-gram language modeling ...
International audienceSpeech recognition applications are known to require a significant amount of r...
Speech recognition applications are known to require a significant amount of resources. However, emb...
In this paper, we present several methods for mapping recognition engine requirements to mobile phon...
Speech recognition applications are known to require a significant amount of memory. However, the ta...
ASRU2005: IEEE Automatic Speech Recognition and Understanding Workshop, November 27, 2005, San Juan...
Abstract We present our study on semi-supervised Gaussian mixture model (GMM) hidden Markov model (H...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
This paper introduces a method for regularization of HMM sys-tems that avoids parameter overfitting ...
For large-scale deployments of speaker verification systems models size can be an important issue fo...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
Reservoir Computing (RC) has recently been introduced as an interesting alternative for acoustic mod...
Good HMM-based speech recognition performance requires at most minimal inaccuracies to be introduced...
To obtain a robust acoustic model for a certain speech recognition task, a large amount of speech da...
This paper presents improvements in acoustic and lan-guage modeling for automatic speech recognition...
The paper revives an older approach to acoustic modeling that borrows from n-gram language modeling ...
International audienceSpeech recognition applications are known to require a significant amount of r...
Speech recognition applications are known to require a significant amount of resources. However, emb...
In this paper, we present several methods for mapping recognition engine requirements to mobile phon...
Speech recognition applications are known to require a significant amount of memory. However, the ta...
ASRU2005: IEEE Automatic Speech Recognition and Understanding Workshop, November 27, 2005, San Juan...
Abstract We present our study on semi-supervised Gaussian mixture model (GMM) hidden Markov model (H...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
This paper introduces a method for regularization of HMM sys-tems that avoids parameter overfitting ...
For large-scale deployments of speaker verification systems models size can be an important issue fo...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
Reservoir Computing (RC) has recently been introduced as an interesting alternative for acoustic mod...
Good HMM-based speech recognition performance requires at most minimal inaccuracies to be introduced...
To obtain a robust acoustic model for a certain speech recognition task, a large amount of speech da...
This paper presents improvements in acoustic and lan-guage modeling for automatic speech recognition...
The paper revives an older approach to acoustic modeling that borrows from n-gram language modeling ...