In an automatic speech recognition system us-ing a tied-mixture acoustic model, the main cost in CPU time and memory lies not in the evaluation and storage of Gaussians them-selves but rather in evaluating the mixture likelihoods for each state output distribution. Using a simple entropy-based technique for pruning the mixture weight distributions, we can achieve a significant speedup in recogni-tion for a 5000-word vocabulary with a negli-gible increase in word error rate. This allows us to achieve real-time connected-word dicta-tion on an ARM-based mobile device.
Accurate, real-time Automatic Speech Recognition (ASR) comes at a high energy cost, so accuracy has ...
Accurate, real-time Automatic Speech Recognition (ASR) comes at a high energy cost, so accuracy has ...
© 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
In an automatic speech recognition system using a tied-mixture acoustic model, the main cost in CPU...
ICASSP2000: IEEE International Conference on Acoustics, Speech, and Signal Processing, June 5-9, ...
Semi-continuous acoustic models, where the output distri-butions for all Hidden Markov Model states ...
Improved acoustic modeling can significantly decrease the error rate in large-vocabulary speech reco...
It has been a common practice in speech recognition and elsewhere to approximate the log likelihood ...
This paper investigates the use of aggregation as a means of improving the performance and robustnes...
This paper describes a new approach to acoustic modeling for large vocabulary continuous speech reco...
Summarization: An algorithm is proposed that achieves a good tradeoff between modeling resolution an...
EUROSPEECH2003: 8th European Conference on Speech Communication and Technology, September 1-4, 2003...
The paper revives an older approach to acoustic modeling that borrows from n-gram language modeling ...
In an HMM based large vocabulary continuous speech recognition system, the evaluation of - context d...
In this paper we present a highly optimized implementation of Gaussian mixture acoustic model evalua...
Accurate, real-time Automatic Speech Recognition (ASR) comes at a high energy cost, so accuracy has ...
Accurate, real-time Automatic Speech Recognition (ASR) comes at a high energy cost, so accuracy has ...
© 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
In an automatic speech recognition system using a tied-mixture acoustic model, the main cost in CPU...
ICASSP2000: IEEE International Conference on Acoustics, Speech, and Signal Processing, June 5-9, ...
Semi-continuous acoustic models, where the output distri-butions for all Hidden Markov Model states ...
Improved acoustic modeling can significantly decrease the error rate in large-vocabulary speech reco...
It has been a common practice in speech recognition and elsewhere to approximate the log likelihood ...
This paper investigates the use of aggregation as a means of improving the performance and robustnes...
This paper describes a new approach to acoustic modeling for large vocabulary continuous speech reco...
Summarization: An algorithm is proposed that achieves a good tradeoff between modeling resolution an...
EUROSPEECH2003: 8th European Conference on Speech Communication and Technology, September 1-4, 2003...
The paper revives an older approach to acoustic modeling that borrows from n-gram language modeling ...
In an HMM based large vocabulary continuous speech recognition system, the evaluation of - context d...
In this paper we present a highly optimized implementation of Gaussian mixture acoustic model evalua...
Accurate, real-time Automatic Speech Recognition (ASR) comes at a high energy cost, so accuracy has ...
Accurate, real-time Automatic Speech Recognition (ASR) comes at a high energy cost, so accuracy has ...
© 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...