Abstract—We investigate cross-lingual acoustic modelling for low resource languages using the subspace Gaussian mixture model (SGMM). We assume the presence of acoustic models trained on multiple source languages, and use the global subspace parameters from those models for improved modelling in a target language with limited amounts of transcribed speech. Experiments on the GlobalPhone corpus using Spanish, Por-tuguese, and Swedish as source languages and German as target language (with 1 hour and 5 hours of transcribed audio) show that multilingually trained SGMM shared parameters result in lower word error rates (WERs) than using those from a single source language. We also show that regularizing the estimation of the SGMM state vectors ...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
This paper investigates employment of Subspace Gaussian Mixture Models (SGMMs) for acoustic model ad...
Summarization: The porting of a speech recognition system to a new language is usually a time-consum...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
The subspace Gaussian mixture model (SGMM) has been exploited for cross-lingual speech recognition. ...
The subspace Gaussian mixture model (SGMM) has been recently proposed as an acoustic modeling techni...
Although research has previously been done on multilingual speech recognition, it has been found to ...
This paper concerns cross-lingual acoustic modeling in the case when there are limited target langua...
This paper describes experimental results of applying Subspace Gaussian Mixture Models (SGMMs) in tw...
In most of state-of-the-art speech recognition systems, Gaussian mixture models (GMMs) are used to ...
Recent studies have shown that speech recognizers may benefit from data in languages other than the ...
This paper investigates employment of Subspace Gaussian Mixture Models (SGMMs) for acoustic model ad...
This paper investigates employment of Subspace Gaussian Mixture Models (SGMMs) for acoustic model ad...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
This paper investigates employment of Subspace Gaussian Mixture Models (SGMMs) for acoustic model ad...
Summarization: The porting of a speech recognition system to a new language is usually a time-consum...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
The subspace Gaussian mixture model (SGMM) has been exploited for cross-lingual speech recognition. ...
The subspace Gaussian mixture model (SGMM) has been recently proposed as an acoustic modeling techni...
Although research has previously been done on multilingual speech recognition, it has been found to ...
This paper concerns cross-lingual acoustic modeling in the case when there are limited target langua...
This paper describes experimental results of applying Subspace Gaussian Mixture Models (SGMMs) in tw...
In most of state-of-the-art speech recognition systems, Gaussian mixture models (GMMs) are used to ...
Recent studies have shown that speech recognizers may benefit from data in languages other than the ...
This paper investigates employment of Subspace Gaussian Mixture Models (SGMMs) for acoustic model ad...
This paper investigates employment of Subspace Gaussian Mixture Models (SGMMs) for acoustic model ad...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixtu...
This paper investigates employment of Subspace Gaussian Mixture Models (SGMMs) for acoustic model ad...
Summarization: The porting of a speech recognition system to a new language is usually a time-consum...