This paper investigates the application of hierarchical MRASTA bottleneck (BN) features for under-resourced languages within the IARPA Babel project. Through multilingual training of Multi-layer Perceptron (MLP) BN features on five languages (Cantonese, Pashto, Tagalog, Turkish, and Vietnamese), we could end up in a single feature stream which is more beneficial to all languages than the unilingual features. In the case of balanced corpus sizes, the multilingual BN features improve the automatic speech recogni-tion (ASR) performance by 3-5 % and the keyword search (KWS) by 3-10 % relative for both limited (LLP) and full language packs (FLP). Borrowing orders of magnitude more data from non-target FLPs, the recognition error rate is reduced ...
Spoken content in languages of emerging importance needs to be searchable to provide access to the u...
This paper quantifies the value of pronunciation lexicons in large vocabulary continuous speech reco...
This paper presents a novel acoustic modeling technique of large vocabulary automatic speech recogni...
In this paper we present our latest investigation on multilingual bottle-neck (BN) features and its ...
This paper examines the impact of multilingual (ML) acoustic representations on Automatic Speech Rec...
The development of high-performance speech processing systems for low-resource languages is a challe...
Copyright © 2014 ISCA. Developing high-performance speech processing systems for low-resource langua...
Recently there has been increased interest in Automatic Speech Recognition (ASR) and Key Word Spotti...
Copyright © 2014 ISCA. In recent years there has been significant interest in Automatic Speech Recog...
In recent years there has been significant interest in Automatic Speech Recognition (ASR) and Key Wo...
The IARPA Babel program ran from March 2012 to November 2016. The aim of the program was to develop ...
How can we effectively develop speech technology for languages where no transcribed data is availabl...
International audienceThis paper reports on investigations using two techniques for language model t...
This paper presents recent progress in developing speech-to-text (STT) and keyword spotting (KWS) sy...
<p>In this paper we present our latest investigation on initialization schemes for Multilayer Percep...
Spoken content in languages of emerging importance needs to be searchable to provide access to the u...
This paper quantifies the value of pronunciation lexicons in large vocabulary continuous speech reco...
This paper presents a novel acoustic modeling technique of large vocabulary automatic speech recogni...
In this paper we present our latest investigation on multilingual bottle-neck (BN) features and its ...
This paper examines the impact of multilingual (ML) acoustic representations on Automatic Speech Rec...
The development of high-performance speech processing systems for low-resource languages is a challe...
Copyright © 2014 ISCA. Developing high-performance speech processing systems for low-resource langua...
Recently there has been increased interest in Automatic Speech Recognition (ASR) and Key Word Spotti...
Copyright © 2014 ISCA. In recent years there has been significant interest in Automatic Speech Recog...
In recent years there has been significant interest in Automatic Speech Recognition (ASR) and Key Wo...
The IARPA Babel program ran from March 2012 to November 2016. The aim of the program was to develop ...
How can we effectively develop speech technology for languages where no transcribed data is availabl...
International audienceThis paper reports on investigations using two techniques for language model t...
This paper presents recent progress in developing speech-to-text (STT) and keyword spotting (KWS) sy...
<p>In this paper we present our latest investigation on initialization schemes for Multilayer Percep...
Spoken content in languages of emerging importance needs to be searchable to provide access to the u...
This paper quantifies the value of pronunciation lexicons in large vocabulary continuous speech reco...
This paper presents a novel acoustic modeling technique of large vocabulary automatic speech recogni...