In this paper, we explore how different acoustic modeling tech-niques can benefit from data in languages other than the target language. We propose an algorithm to perform decision tree state clustering for the recently proposed Kullback-Leibler di-vergence based hidden Markov models (KL-HMM) and com-pare it to subspace Gaussian mixture modeling (SGMM). KL-HMM can exploit multilingual information in the form of uni-versal phoneme posterior features and SGMM benefits from a universal background model that can be trained on multilingual data. Taking the Greek SpeechDat(II) data as an example, we show that KL-HMM performs best for small amounts of target language data. Index Terms: Speech recognition, multilingual acoustic mod-eling, under-res...
This paper describes experimental results of applying Subspace Gaussian Mixture Models (SGMMs) in tw...
Due to abundant resources not always being available for resource-limited languages, training an aco...
In this paper we revisit the recently proposed triphone mapping as an alternative to decision tree s...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
Under-resourced speech recognizers may benefit from data in languages other than the target language...
Although research has previously been done on multilingual speech recognition, it has been found to ...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
Abstract—We investigate cross-lingual acoustic modelling for low resource languages using the subspa...
Abstract—One of the main challenge in non-native speech recognition is how to handle acoustic variab...
This paper introduces two approximations of the Kullback-Leibler divergence for hidden Markov models...
Recently, we established the equivalence of an ergodic HMM (EHMM) to a parallel sub-word recognition...
The development of automatic speech recognition systems requires significant quantities of annotated...
Recently, we established the equivalence of an ergodic HMM (EHMM) to a parallel sub-word recognition...
The subspace Gaussian mixture model (SGMM) has been exploited for cross-lingual speech recognition. ...
Summarization: The porting of a speech recognition system to a new language is usually a time-consum...
This paper describes experimental results of applying Subspace Gaussian Mixture Models (SGMMs) in tw...
Due to abundant resources not always being available for resource-limited languages, training an aco...
In this paper we revisit the recently proposed triphone mapping as an alternative to decision tree s...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
Under-resourced speech recognizers may benefit from data in languages other than the target language...
Although research has previously been done on multilingual speech recognition, it has been found to ...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
Abstract—We investigate cross-lingual acoustic modelling for low resource languages using the subspa...
Abstract—One of the main challenge in non-native speech recognition is how to handle acoustic variab...
This paper introduces two approximations of the Kullback-Leibler divergence for hidden Markov models...
Recently, we established the equivalence of an ergodic HMM (EHMM) to a parallel sub-word recognition...
The development of automatic speech recognition systems requires significant quantities of annotated...
Recently, we established the equivalence of an ergodic HMM (EHMM) to a parallel sub-word recognition...
The subspace Gaussian mixture model (SGMM) has been exploited for cross-lingual speech recognition. ...
Summarization: The porting of a speech recognition system to a new language is usually a time-consum...
This paper describes experimental results of applying Subspace Gaussian Mixture Models (SGMMs) in tw...
Due to abundant resources not always being available for resource-limited languages, training an aco...
In this paper we revisit the recently proposed triphone mapping as an alternative to decision tree s...