Speech recognition applications are known to require a significant amount of resources. However, embedded speech recognition only authorizes few KB of memory, few MIPS, and small amount of training data. In order to fit the resource constraints of embedded applications, an approach based on a semicontinuous HMM system using state-independent acoustic modelling is proposed. A transformation is computed and applied to the global model in order to obtain each HMM state-dependent probability density functions, authorizing to store only the transformation parameters. This approach is evaluated on two tasks: digit and voice-command recognition. A fast adaptation technique of acoustic models is also proposed. In order to significantly reduce compu...
This paper presents improvements in acoustic and lan-guage modeling for automatic speech recognition...
The performance of a speech recognizer is degraded drastically in reverberant environments. We propo...
Challenging scenario is addressed in which a hands-free speech recognizer operates in a noisy office...
Speech recognition applications are known to require a significant amount of resources. However, em...
Speech recognition applications are known to require a significant amount of resources (training dat...
International audienceSpeech recognition applications are known to require a significant amount of r...
In this paper, we present several methods for mapping recognition engine requirements to mobile phon...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
In this paper a challenging scenario is addressed in which a hands-free speech recognizer operates i...
A challenging scenario is addressed in which a hands-free speech recognizer operates in a noisy offi...
Speech recognition applications are known to require a significant amount of memory. However, the ta...
Good HMM-based speech recognition performance requires at most minimal inaccuracies to be introduced...
Senones were introduced to share Hidden Markov model (HMM) parameters at a sub-phonetic level in [3]...
ICASSP1997: IEEE International Conference on Acoustics, Speech, and Signal Processing, April 21-24...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
This paper presents improvements in acoustic and lan-guage modeling for automatic speech recognition...
The performance of a speech recognizer is degraded drastically in reverberant environments. We propo...
Challenging scenario is addressed in which a hands-free speech recognizer operates in a noisy office...
Speech recognition applications are known to require a significant amount of resources. However, em...
Speech recognition applications are known to require a significant amount of resources (training dat...
International audienceSpeech recognition applications are known to require a significant amount of r...
In this paper, we present several methods for mapping recognition engine requirements to mobile phon...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
In this paper a challenging scenario is addressed in which a hands-free speech recognizer operates i...
A challenging scenario is addressed in which a hands-free speech recognizer operates in a noisy offi...
Speech recognition applications are known to require a significant amount of memory. However, the ta...
Good HMM-based speech recognition performance requires at most minimal inaccuracies to be introduced...
Senones were introduced to share Hidden Markov model (HMM) parameters at a sub-phonetic level in [3]...
ICASSP1997: IEEE International Conference on Acoustics, Speech, and Signal Processing, April 21-24...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
This paper presents improvements in acoustic and lan-guage modeling for automatic speech recognition...
The performance of a speech recognizer is degraded drastically in reverberant environments. We propo...
Challenging scenario is addressed in which a hands-free speech recognizer operates in a noisy office...