Under-resourced speech recognizers may benefit from data in languages other than the target language. In this paper, we report how to boost the performance of an Afrikaans automatic speech recognition system by using already available Dutch data. We successfully exploit available multilingual resources through 1) posterior features, estimated by multilayer perceptrons (MLP) and 2) subspace Gaussian mixture models (SGMMs). Both the MLPs and the SGMMs can be trained on out-of-language data. We use three different acoustic modeling techniques, namely Tandem, Kullback-Leibler divergence based HMMs (KL-HMM) as well as SGMMs and show that the proposed multilingual systems yield 12% relative improvement compared to a conventional monolingual HMM/G...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
Some practical uses of ASR have been implemented, including the transcription of meetings and the us...
Sahraeian R., Van Compernolle D., de Wet F., ''Using generalized maxout networks and phoneme mapping...
Under-resourced speech recognizers may benefit from data in languages other than the target language...
Under-resourced speech recognizers may benefit from data in languages other than the target language...
Under-resourced speech recognizers may benefit from data in languages other than the target language...
Recent studies have shown that speech recognizers may benefit from data in languages other than the ...
© 2017. The Author(s). For purposes of automated speech recognition in under-resourced environments,...
Posterior based acoustic modeling techniques such as Kullback– Leibler divergence based HMM (KL-HMM)...
Over the past decades, speech recognition has dramatically improved in a large variety of applicatio...
The development of automatic speech recognition systems requires significant quantities of annotated...
One of the most important problems that needs tackling for wide deployment of Automatic Speech Recog...
Standard automatic speech recognition (ASR) systems use phonemes as subword units. Thus, one of the ...
In this paper, we explore how different acoustic modeling tech-niques can benefit from data in langu...
Automatic speech recognition (ASR) does not perform equally well on every speaker. There is bias aga...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
Some practical uses of ASR have been implemented, including the transcription of meetings and the us...
Sahraeian R., Van Compernolle D., de Wet F., ''Using generalized maxout networks and phoneme mapping...
Under-resourced speech recognizers may benefit from data in languages other than the target language...
Under-resourced speech recognizers may benefit from data in languages other than the target language...
Under-resourced speech recognizers may benefit from data in languages other than the target language...
Recent studies have shown that speech recognizers may benefit from data in languages other than the ...
© 2017. The Author(s). For purposes of automated speech recognition in under-resourced environments,...
Posterior based acoustic modeling techniques such as Kullback– Leibler divergence based HMM (KL-HMM)...
Over the past decades, speech recognition has dramatically improved in a large variety of applicatio...
The development of automatic speech recognition systems requires significant quantities of annotated...
One of the most important problems that needs tackling for wide deployment of Automatic Speech Recog...
Standard automatic speech recognition (ASR) systems use phonemes as subword units. Thus, one of the ...
In this paper, we explore how different acoustic modeling tech-niques can benefit from data in langu...
Automatic speech recognition (ASR) does not perform equally well on every speaker. There is bias aga...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
Some practical uses of ASR have been implemented, including the transcription of meetings and the us...
Sahraeian R., Van Compernolle D., de Wet F., ''Using generalized maxout networks and phoneme mapping...