This paper investigates the use of aggregation as a means of improving the performance and robustness of mixture Gaussian models. This technique produces models that are more accurate and more robust to different test sets than traditional cross-validation using a development set. A theoretical justification for this technique is presented along with experimental results in phonetic classification, phonetic recognition, and word recognition tasks on the TIMIT and Resource Management corpora. In speech classification and recognition tasks error rate reductions of up to 12% were observed using this technique. A method for utilizing treestructured density functions for the purpose of pruning the aggregated models is also presented. 1. INTRODUC...
The hypothesis that for a given amount of training data a speaker model has an optimum number of com...
International audienceAcoustic modeling techniques, based on clustering of the training data, have b...
Summarization: This work presents techniques for improved cross-language transfer of speech...
EUROSPEECH2003: 8th European Conference on Speech Communication and Technology, September 1-4, 2003...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
In an automatic speech recognition system us-ing a tied-mixture acoustic model, the main cost in CPU...
It has been a common practice in speech recognition and elsewhere to approximate the log likelihood ...
This paper describes a new approach to acoustic modeling for large vocabulary continuous speech reco...
This paper describes experimental results of applying Subspace Gaussian Mixture Models (SGMMs) in tw...
Summarization: A trend in automatic speech recognition systems is the use of continuous mixture-dens...
An estimation of parameters of a multivariate Gaussian Mixture Model is usually based on a criterion...
Summarization: The porting of a speech recognition system to a new language is usually a time-consum...
This paper provides an overview of Gaussian Mixture Model (GMM) and its component of speech signal. ...
In an automatic speech recognition system using a tied-mixture acoustic model, the main cost in CPU...
The hypothesis that for a given amount of training data a speaker model has an optimum number of com...
International audienceAcoustic modeling techniques, based on clustering of the training data, have b...
Summarization: This work presents techniques for improved cross-language transfer of speech...
EUROSPEECH2003: 8th European Conference on Speech Communication and Technology, September 1-4, 2003...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
In an automatic speech recognition system us-ing a tied-mixture acoustic model, the main cost in CPU...
It has been a common practice in speech recognition and elsewhere to approximate the log likelihood ...
This paper describes a new approach to acoustic modeling for large vocabulary continuous speech reco...
This paper describes experimental results of applying Subspace Gaussian Mixture Models (SGMMs) in tw...
Summarization: A trend in automatic speech recognition systems is the use of continuous mixture-dens...
An estimation of parameters of a multivariate Gaussian Mixture Model is usually based on a criterion...
Summarization: The porting of a speech recognition system to a new language is usually a time-consum...
This paper provides an overview of Gaussian Mixture Model (GMM) and its component of speech signal. ...
In an automatic speech recognition system using a tied-mixture acoustic model, the main cost in CPU...
The hypothesis that for a given amount of training data a speaker model has an optimum number of com...
International audienceAcoustic modeling techniques, based on clustering of the training data, have b...
Summarization: This work presents techniques for improved cross-language transfer of speech...