In this paper, we present several methods for mapping recognition engine requirements to mobile phone resource. The proposed techniques are evaluated on a digit recognition task using both French and English corpora. We investigate particularly three aspects of speech processing: acoustic parameterization, recognition algorithms and acoustic modeling. Several parameterization algorithms (LPCC, MFCC and PLP) are compared to the Linear Predictive Coding (LPC) included in the GSM norm. The two parameterization algorithms, MFCC and PLP, perform significantly better than the other ones. Moreover, feature vector size can be reduced from 13 to 6 PLP coefficients without a significant lost of performance. We compare Dynamic Time Warping (DTW) and H...
Conventional large vocabulary automatic speech recognition (ASR) systems require a mapping from word...
Abstract—Today’s speech recognition systems are based on hidden Markov models (HMMs) with Gaussian m...
The development of a speech recognition system requires at least three resources: a large labeled sp...
This paper is focused on cellular phone embedded speech recog-nition. We present several methods abl...
Speech recognition applications are known to require a significant amount of resources (training dat...
Speech recognition applications are known to require a significant amount of resources. However, em...
Summarization: We describe an approach for the estimation of acoustic phonetic models that will be u...
In general the aim of an automatic speech recognition system is to write down what is said. State of...
International audienceSpeech recognition applications are known to require a significant amount of r...
This paper describes the specification, design and development phases of two widely used telepho...
ICSLP1998: the 5th International Conference on Spoken Language Processing, November 30 - December 4...
This paper presents improvements in acoustic and lan-guage modeling for automatic speech recognition...
Speech recognition applications are known to require a significant amount of memory. However, the ta...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
The hidden Markov model (HMM) is commonly employed in automatic speech recognition (ASR). The HMM ha...
Conventional large vocabulary automatic speech recognition (ASR) systems require a mapping from word...
Abstract—Today’s speech recognition systems are based on hidden Markov models (HMMs) with Gaussian m...
The development of a speech recognition system requires at least three resources: a large labeled sp...
This paper is focused on cellular phone embedded speech recog-nition. We present several methods abl...
Speech recognition applications are known to require a significant amount of resources (training dat...
Speech recognition applications are known to require a significant amount of resources. However, em...
Summarization: We describe an approach for the estimation of acoustic phonetic models that will be u...
In general the aim of an automatic speech recognition system is to write down what is said. State of...
International audienceSpeech recognition applications are known to require a significant amount of r...
This paper describes the specification, design and development phases of two widely used telepho...
ICSLP1998: the 5th International Conference on Spoken Language Processing, November 30 - December 4...
This paper presents improvements in acoustic and lan-guage modeling for automatic speech recognition...
Speech recognition applications are known to require a significant amount of memory. However, the ta...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
The hidden Markov model (HMM) is commonly employed in automatic speech recognition (ASR). The HMM ha...
Conventional large vocabulary automatic speech recognition (ASR) systems require a mapping from word...
Abstract—Today’s speech recognition systems are based on hidden Markov models (HMMs) with Gaussian m...
The development of a speech recognition system requires at least three resources: a large labeled sp...