How can we effectively develop speech technology for languages where no transcribed data is available? Many existing approaches use no annotated resources at all, yet it makes sense to leverage information from large annotated corpora in other languages, for example in the form of multilingual bottleneck features (BNFs) obtained from a supervised speech recognition system. In this work, we evaluate the benefits of BNFs for subword modeling (feature extraction) in six unseen languages on a word discrimination task. First we establish a strong unsupervised baseline by combining two existing methods: vocal tract length normalisation (VTLN) and the correspondence autoencoder (cAE). We then show that BNFs trained on a single language already bea...
© Dr Long DuongNatural language processing (NLP) aims, broadly speaking, to teach computers to under...
For a language with no transcribed speech available (the zero-resource scenario), conventional acous...
This paper examines the impact of multilingual (ML) acoustic representations on Automatic Speech Rec...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
We investigate the extraction of bottle-neck features (BNFs) for multiple languages without access t...
Copyright © 2014 ISCA. Developing high-performance speech processing systems for low-resource langua...
We describe a novel way to implement subword language models in speech recognition systems based on ...
In today's society, speech recognition systems have reached a mass audience, especially in the field...
<p>We investigate the extraction of bottle-neck features (BNFs) for multiple languages without acces...
Automatic speech recognition systems have so far been developed only for very few languages out of t...
This paper investigates the application of hierarchical MRASTA bottleneck (BN) features for under-re...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
The development of a speech recognition system requires at least three resources: a large labeled sp...
Over the past decades, speech recognition has dramatically improved in a large variety of applicatio...
This study addresses unsupervised subword modeling, i.e.,learning feature representations that can d...
© Dr Long DuongNatural language processing (NLP) aims, broadly speaking, to teach computers to under...
For a language with no transcribed speech available (the zero-resource scenario), conventional acous...
This paper examines the impact of multilingual (ML) acoustic representations on Automatic Speech Rec...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
We investigate the extraction of bottle-neck features (BNFs) for multiple languages without access t...
Copyright © 2014 ISCA. Developing high-performance speech processing systems for low-resource langua...
We describe a novel way to implement subword language models in speech recognition systems based on ...
In today's society, speech recognition systems have reached a mass audience, especially in the field...
<p>We investigate the extraction of bottle-neck features (BNFs) for multiple languages without acces...
Automatic speech recognition systems have so far been developed only for very few languages out of t...
This paper investigates the application of hierarchical MRASTA bottleneck (BN) features for under-re...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
The development of a speech recognition system requires at least three resources: a large labeled sp...
Over the past decades, speech recognition has dramatically improved in a large variety of applicatio...
This study addresses unsupervised subword modeling, i.e.,learning feature representations that can d...
© Dr Long DuongNatural language processing (NLP) aims, broadly speaking, to teach computers to under...
For a language with no transcribed speech available (the zero-resource scenario), conventional acous...
This paper examines the impact of multilingual (ML) acoustic representations on Automatic Speech Rec...