We describe a novel fast speaker adaptation algorithm for large vocabulary speech recognition systems, which adapts both the Gaussian means and the mixture weights. Gaussian means are expressed as a linear combination of eigenvoices estimated with principal component analysis. The non-negative Gaussian mixture weights are expressed as a linear combination of a set of latent vectors estimated with non-negative matrix factorization. Experiments on the Wall Street Journal database show that the combination of weight and mean adaptation consistently improves the performance compared to eigenvoice adaptation only. Improvements up to 5.8% relative word error rate reduction were observed with 40 eigenvoices and 40 latent weight vectors. Furthermor...
This paper describes a new approach to acoustic modeling for large vocabulary continuous speech reco...
Summarization: The recognition accuracy in recent large vocabulary Automatic Speech Recognition (ASR...
This paper concerns cross-lingual acoustic modeling in the case when there are limited target langua...
We describe a novel fast speaker adaptation algorithm for large vocabulary speech recognition system...
A novel speaker adaptation algorithm based on Gaussian mixture weight adaptation is described. A sma...
In this paper, we describe a novel speaker adaptation algorithm based on Gaussian mixture weight ada...
This paper describes a new method for fast speaker adaptation in large vocabulary recognition system...
Summarization: A trend in automatic speech recognition systems is the use of continuous mixture-dens...
Nowadays, almost all speaker-independent (SI) speech recognition systems use CDHMM with multivariate...
This report proposes a novel approach for Gaussian Mixture Model (GMM) weights decomposition and ada...
Abstract—Recent studies show that Gaussian mixture model (GMM) weights carry less, yet complimentary...
Summarization: Speaker adaptation is recognized as an essential part of today’s large-vocabulary aut...
Recently, kernel eigenvoices were revisited using kernel representations of distributions for rapid ...
In this paper, a new method called Maximum Likelihood General Regression (MLGR) is introduced for sp...
Dehak N., Plchot O., Bahari M.H., Burget L. , Van hamme H., Dehak R., ''GMM weights adaptation based...
This paper describes a new approach to acoustic modeling for large vocabulary continuous speech reco...
Summarization: The recognition accuracy in recent large vocabulary Automatic Speech Recognition (ASR...
This paper concerns cross-lingual acoustic modeling in the case when there are limited target langua...
We describe a novel fast speaker adaptation algorithm for large vocabulary speech recognition system...
A novel speaker adaptation algorithm based on Gaussian mixture weight adaptation is described. A sma...
In this paper, we describe a novel speaker adaptation algorithm based on Gaussian mixture weight ada...
This paper describes a new method for fast speaker adaptation in large vocabulary recognition system...
Summarization: A trend in automatic speech recognition systems is the use of continuous mixture-dens...
Nowadays, almost all speaker-independent (SI) speech recognition systems use CDHMM with multivariate...
This report proposes a novel approach for Gaussian Mixture Model (GMM) weights decomposition and ada...
Abstract—Recent studies show that Gaussian mixture model (GMM) weights carry less, yet complimentary...
Summarization: Speaker adaptation is recognized as an essential part of today’s large-vocabulary aut...
Recently, kernel eigenvoices were revisited using kernel representations of distributions for rapid ...
In this paper, a new method called Maximum Likelihood General Regression (MLGR) is introduced for sp...
Dehak N., Plchot O., Bahari M.H., Burget L. , Van hamme H., Dehak R., ''GMM weights adaptation based...
This paper describes a new approach to acoustic modeling for large vocabulary continuous speech reco...
Summarization: The recognition accuracy in recent large vocabulary Automatic Speech Recognition (ASR...
This paper concerns cross-lingual acoustic modeling in the case when there are limited target langua...