This report proposes a novel approach for Gaussian Mixture Model (GMM) weights decomposition and adaptation. This modeling suggests a new low-dimensional utterance representation method, which uses a simple factor analysis similar to that of the i-vector framework. The suggested approach is applied to the Robust Automatic Transcription of Speech (RATS) language identification evaluation corpus, where the speech recordings are from highly degraded communication channels. In our experiments, after modeling each utterance using the proposed approach, a Deep Belief Networks (DBN) is utilized to recognize the language of utterances.The assessment results show that the proposed method improves conventional maximum likelihood weight adaptation. I...
Summarization: A trend in automatic speech recognition systems is the use of continuous mixture-dens...
Over the last decade, the factor analysis based modeling of a variable length speech utterance into ...
Abstract—This paper addresses the issue of speaker variability and session variability in text-indep...
Abstract—Recent studies show that Gaussian mixture model (GMM) weights carry less, yet complimentary...
We describe a novel fast speaker adaptation algorithm for large vocabulary speech recognition system...
ii Speaker adaptation based on the Universal Background Model (UBM) has become a standard approach f...
Dehak N., Plchot O., Bahari M.H., Burget L. , Van hamme H., Dehak R., ''GMM weights adaptation based...
A novel speaker adaptation algorithm based on Gaussian mixture weight adaptation is described. A sma...
In this paper, a new method called Maximum Likelihood General Regression (MLGR) is introduced for sp...
This paper describes a new approach to acoustic modeling for large vocabulary continuous speech reco...
This paper presents Subspace Gaussian Mixture Model (SGMM) approach employed as a probabilistic gene...
This paper describes a speaker recognition system based on feature extraction utilizing the constrai...
In this paper, we describe a novel speaker adaptation algorithm based on Gaussian mixture weight ada...
Supervectors represent speaker-specific Gaussian Mixture Models which are enrolled from a Universal ...
Language identification (LID) of speech data recorded over noisy communication channels is a challen...
Summarization: A trend in automatic speech recognition systems is the use of continuous mixture-dens...
Over the last decade, the factor analysis based modeling of a variable length speech utterance into ...
Abstract—This paper addresses the issue of speaker variability and session variability in text-indep...
Abstract—Recent studies show that Gaussian mixture model (GMM) weights carry less, yet complimentary...
We describe a novel fast speaker adaptation algorithm for large vocabulary speech recognition system...
ii Speaker adaptation based on the Universal Background Model (UBM) has become a standard approach f...
Dehak N., Plchot O., Bahari M.H., Burget L. , Van hamme H., Dehak R., ''GMM weights adaptation based...
A novel speaker adaptation algorithm based on Gaussian mixture weight adaptation is described. A sma...
In this paper, a new method called Maximum Likelihood General Regression (MLGR) is introduced for sp...
This paper describes a new approach to acoustic modeling for large vocabulary continuous speech reco...
This paper presents Subspace Gaussian Mixture Model (SGMM) approach employed as a probabilistic gene...
This paper describes a speaker recognition system based on feature extraction utilizing the constrai...
In this paper, we describe a novel speaker adaptation algorithm based on Gaussian mixture weight ada...
Supervectors represent speaker-specific Gaussian Mixture Models which are enrolled from a Universal ...
Language identification (LID) of speech data recorded over noisy communication channels is a challen...
Summarization: A trend in automatic speech recognition systems is the use of continuous mixture-dens...
Over the last decade, the factor analysis based modeling of a variable length speech utterance into ...
Abstract—This paper addresses the issue of speaker variability and session variability in text-indep...