In this study, we present improvements in N-best rescoring of code-switched speech achieved by n-gram augmentation as well as optimised pretraining of long short-term memory (LSTM) language models with larger corpora of out-of-domain monolingual text. Our investigation specifically considers the impact of the way in which multiple monolingual datasets are interleaved prior to being presented as input to a language model. In addition, we consider the application of large pretrained transformer-based architectures, and present the first investigation employing these models in English-Bantu code-switched speech recognition. Our experimental evaluation is performed on an under-resourced corpus of code-switched speech comprising four bilingual c...
© 2017. The Author(s). For purposes of automated speech recognition in under-resourced environments,...
Under-resourced speech recognizers may benefit from data in languages other than the target language...
Recently there is growing interest in using neural networks for language modeling. In contrast to th...
This paper investigates the potential of improving a hybrid automatic speech recognition model train...
AbstractWe introduce a new English-isiZulu code-switched speech corpus compiled from South African s...
Under-resourced speech recognizers may benefit from data in languages other than the target language...
Recently there has been interest in the approaches for training speech recognition systems for langu...
Under-resourced speech recognizers may benefit from data in languages other than the target language...
Thesis (PhD)--Stellenbosch University, 2018.ENGLISH ABSTRACT: Code-switching refers to natural, spon...
In this article, we propose a simple yet effective approach to train an end-to-end speech recognitio...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
Neural Machine Translation (NMT) models have achieved remarkable performance on translating between ...
Abstract—Text corpus size is an important issue when building a language model (LM) in particular wh...
We investigate the impact of recent advances in speech recognition techniques for under-resourced l...
Almost none of the 2,000+ languages spoken in Africa have widely available automatic speech recognit...
© 2017. The Author(s). For purposes of automated speech recognition in under-resourced environments,...
Under-resourced speech recognizers may benefit from data in languages other than the target language...
Recently there is growing interest in using neural networks for language modeling. In contrast to th...
This paper investigates the potential of improving a hybrid automatic speech recognition model train...
AbstractWe introduce a new English-isiZulu code-switched speech corpus compiled from South African s...
Under-resourced speech recognizers may benefit from data in languages other than the target language...
Recently there has been interest in the approaches for training speech recognition systems for langu...
Under-resourced speech recognizers may benefit from data in languages other than the target language...
Thesis (PhD)--Stellenbosch University, 2018.ENGLISH ABSTRACT: Code-switching refers to natural, spon...
In this article, we propose a simple yet effective approach to train an end-to-end speech recognitio...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
Neural Machine Translation (NMT) models have achieved remarkable performance on translating between ...
Abstract—Text corpus size is an important issue when building a language model (LM) in particular wh...
We investigate the impact of recent advances in speech recognition techniques for under-resourced l...
Almost none of the 2,000+ languages spoken in Africa have widely available automatic speech recognit...
© 2017. The Author(s). For purposes of automated speech recognition in under-resourced environments,...
Under-resourced speech recognizers may benefit from data in languages other than the target language...
Recently there is growing interest in using neural networks for language modeling. In contrast to th...