Multilingual models for Automatic Speech Recognition (ASR) are attractive as they have been shown to benefit from more training data, and better lend themselves to adaptation to under-resourced languages. However, initialisation from monolingual context-dependent models leads to an explosion of context-dependent states. Connectionist Temporal Classification (CTC) is a potential solution to this as it performs well with monophone labels.\ We investigate multilingual CTC training in the context of adaptation and regularisation techniques that have been shown to be beneficial in more conventional contexts. The multilingual model is trained to model a universal International Phonetic Alphabet (IPA)-based phone set using the CTC loss function. L...
Multilingual automatic speech recognition (ASR) systems mostly benefit low resource languages but su...
International audiencePrevious work has shown that end-to-end neural-based speech recognition system...
This paper presents a study on multilingual deep neural net-work (DNN) based acoustic modeling and i...
Multilingual models for Automatic Speech Recognition (ASR) are attractive as they have been shown to...
Phoneme-based multilingual connectionist temporal classification (CTC) model is easily extensible to...
In this article, we propose a simple yet effective approach to train an end-to-end speech recognitio...
<p>For the past few decades, the bane of Automatic Speech Recognition (ASR) systems have been phonem...
Adapting Automatic Speech Recognition (ASR) models to new domains results in a deterioration of perf...
Exploiting cross-lingual resources is an effective way to compensate for data scarcity of low resour...
Different training and adaptation techniques for multilingual Automatic Speech Recognition (ASR) are...
International audienceThis paper investigates speaker adaptation techniques for bidirectional long ...
We investigate multilingual modeling in the context of a deep neural network (DNN) – hidden Markov ...
The recent development of neural network-based automatic speech recognition (ASR) systems has greatl...
Multilingual speech recognition systems mostly benefit low resource languages but suffer degradation...
Recent breakthroughs in automatic speech recognition (ASR) have resulted in a word error rate (WER) ...
Multilingual automatic speech recognition (ASR) systems mostly benefit low resource languages but su...
International audiencePrevious work has shown that end-to-end neural-based speech recognition system...
This paper presents a study on multilingual deep neural net-work (DNN) based acoustic modeling and i...
Multilingual models for Automatic Speech Recognition (ASR) are attractive as they have been shown to...
Phoneme-based multilingual connectionist temporal classification (CTC) model is easily extensible to...
In this article, we propose a simple yet effective approach to train an end-to-end speech recognitio...
<p>For the past few decades, the bane of Automatic Speech Recognition (ASR) systems have been phonem...
Adapting Automatic Speech Recognition (ASR) models to new domains results in a deterioration of perf...
Exploiting cross-lingual resources is an effective way to compensate for data scarcity of low resour...
Different training and adaptation techniques for multilingual Automatic Speech Recognition (ASR) are...
International audienceThis paper investigates speaker adaptation techniques for bidirectional long ...
We investigate multilingual modeling in the context of a deep neural network (DNN) – hidden Markov ...
The recent development of neural network-based automatic speech recognition (ASR) systems has greatl...
Multilingual speech recognition systems mostly benefit low resource languages but suffer degradation...
Recent breakthroughs in automatic speech recognition (ASR) have resulted in a word error rate (WER) ...
Multilingual automatic speech recognition (ASR) systems mostly benefit low resource languages but su...
International audiencePrevious work has shown that end-to-end neural-based speech recognition system...
This paper presents a study on multilingual deep neural net-work (DNN) based acoustic modeling and i...