The development of a speech recognition system requires at least three resources: a large labeled speech corpus to build the acoustic model, a pronunciation lexicon to map words to phone sequences, and a large text corpus to build the language model. For many languages such as dialects or minority languages, these resources are limited or even unavailable - we label these languages as under-resourced. In this thesis, the focus is to develop reliable acoustic models for under-resourced languages. The following three works have been proposed. In the first work, reliable acoustic models are built by transferring acoustic information from well-resourced languages (source) to under-resourced languages (target). Specifically, the phone models of...
Multilingual speech recognition has drawn significant attention as an effective way to compensate da...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
While recent automatic speech recognition systems achieve remarkable performance when large amounts ...
This paper presents a novel acoustic modeling technique of large vocabulary automatic speech recogni...
Over the past decades, speech recognition has dramatically improved in a large variety of applicatio...
This thesis investigates methods for Acoustic Modeling in Automatic Speech Recog- nition, assuming l...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
Automatic speech recognition systems have so far been developed only for very few languages out of t...
Developing high-performance speech processing systems for low-resource languages is very challenging...
A state-of-the-art automatic speech recognition (ASR) system can often achieve high accuracy for mos...
In this paper we describe an approach that both creates crosslingual acoustic monophone model sets f...
Exploiting cross-lingual resources is an effective way to compensate for data scarcity of low resour...
Multilingual speech recognition has drawn significant attention as an effective way to compensate da...
The paper revives an older approach to acoustic modeling that borrows from n-gram language modeling ...
Due to abundant resources not always being available for resource-limited languages, training an aco...
Multilingual speech recognition has drawn significant attention as an effective way to compensate da...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
While recent automatic speech recognition systems achieve remarkable performance when large amounts ...
This paper presents a novel acoustic modeling technique of large vocabulary automatic speech recogni...
Over the past decades, speech recognition has dramatically improved in a large variety of applicatio...
This thesis investigates methods for Acoustic Modeling in Automatic Speech Recog- nition, assuming l...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
Automatic speech recognition systems have so far been developed only for very few languages out of t...
Developing high-performance speech processing systems for low-resource languages is very challenging...
A state-of-the-art automatic speech recognition (ASR) system can often achieve high accuracy for mos...
In this paper we describe an approach that both creates crosslingual acoustic monophone model sets f...
Exploiting cross-lingual resources is an effective way to compensate for data scarcity of low resour...
Multilingual speech recognition has drawn significant attention as an effective way to compensate da...
The paper revives an older approach to acoustic modeling that borrows from n-gram language modeling ...
Due to abundant resources not always being available for resource-limited languages, training an aco...
Multilingual speech recognition has drawn significant attention as an effective way to compensate da...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
While recent automatic speech recognition systems achieve remarkable performance when large amounts ...