For low resource languages, collecting sufficient training data to build acoustic and language models is time consuming and often expensive. But large amounts of text data, such as online newspapers, web forums or online encyclopedias, usually exist for languages that have a large population of native speakers. This text data can be easily collected from the web and then used to both expand the recognizer's vocabulary and improve the language model. One challenge, however, is normalizing and filtering the web data for a specific task. In this paper, we investigate the use of online text resources to improve the performance of speech recognition specifically for the task of keyword spotting. For the five languages provided in the base period...
This paper examines the impact of multilingual (ML) acoustic representations on Automatic Speech Rec...
This article describes a methodology for collecting text from the Web to match a target sublanguage ...
International audiencehe research presented in the paper addresses conversational telephone speechre...
<p>For low resource languages, collecting sufficient training data to build acoustic and language mo...
We describe the use of text data scraped from the web to augment language models for Automatic Speec...
Recently there has been increased interest in Automatic Speech Recognition (ASR) and Key Word Spotti...
EUROSPEECH2001: the 7th European Conference on Speech Communication and Technology, September 3-7, ...
© Springer International Publishing AG 2017. The IARPA Babel program ran from March 2012 to November...
Conversational text is a highly varied, and many abbreviations and short forms exist in different la...
Training language model made from conversational speech is difficult due to large variation of the w...
This paper presents recent progress in developing speech-to-text (STT) and keyword spotting (KWS) sy...
International audienceIn this paper we aim to enhance keyword search for conversational telephone sp...
Recently there has been increased interest in Automatic Speech Recognition (ASR) and Key Word Spotti...
International audienceThis paper reports on investigations using two techniques for language model t...
In this dissertation, three research directions were explored to alleviate two major issues, i.e., t...
This paper examines the impact of multilingual (ML) acoustic representations on Automatic Speech Rec...
This article describes a methodology for collecting text from the Web to match a target sublanguage ...
International audiencehe research presented in the paper addresses conversational telephone speechre...
<p>For low resource languages, collecting sufficient training data to build acoustic and language mo...
We describe the use of text data scraped from the web to augment language models for Automatic Speec...
Recently there has been increased interest in Automatic Speech Recognition (ASR) and Key Word Spotti...
EUROSPEECH2001: the 7th European Conference on Speech Communication and Technology, September 3-7, ...
© Springer International Publishing AG 2017. The IARPA Babel program ran from March 2012 to November...
Conversational text is a highly varied, and many abbreviations and short forms exist in different la...
Training language model made from conversational speech is difficult due to large variation of the w...
This paper presents recent progress in developing speech-to-text (STT) and keyword spotting (KWS) sy...
International audienceIn this paper we aim to enhance keyword search for conversational telephone sp...
Recently there has been increased interest in Automatic Speech Recognition (ASR) and Key Word Spotti...
International audienceThis paper reports on investigations using two techniques for language model t...
In this dissertation, three research directions were explored to alleviate two major issues, i.e., t...
This paper examines the impact of multilingual (ML) acoustic representations on Automatic Speech Rec...
This article describes a methodology for collecting text from the Web to match a target sublanguage ...
International audiencehe research presented in the paper addresses conversational telephone speechre...