In this work we investigate methods to extend the lexicon of a broadcast news (BN) speech recognition system in order to minimize the out-of-vocabulary (OOV) word rate. In particular, the OOV word class within the BM trigram language model is linked to a new unigram LM that is dynamically adapted to cope with language changes over time. LM extensions are evaluated according to the achieved OOV word rate, perplexity, and word-error rate. The last criterion implicitly takes into account the quality of the phonetic transcription used for the new words. In the here proposed experiments, phonetic transcriptions of new words are generated automatically by an in-house developed phonetic transcribe
This paper investigates the effectiveness of online temporal language model adaptation when applied ...
This paper describes extensions and improvements to IBM’s large vocabulary continuous speech recogni...
In this paper, we describe a method to enhance the readability of the textual output in a large voca...
The daily and real-time transcription of Broadcast News (BN) is a challenging task both in acoustic ...
One of the most prevailing problems of large-vocabulary speech recognition systems is the large numb...
Although the vocabularies of ASR systems are designed to achieve high coverage for the expected doma...
This paper investigates the problem of updating over time the statistical language model (LM) of an ...
High out-of-vocabulary (OOV) rates are one of the most prevail-ing problems for languages with a rap...
High out-of-vocabulary (OOV) rates are one of the most prevail-ing problems for languages with a rap...
This paper describes first results of our DARPA-sponsored efforts toward recognizing and browsing fo...
This paper describes the IBM approach to Broadcast News Transcription. Typical problems in the Broa...
A regular automatic speech recognizer works with a so-called recognition lexicon. This lexicon conta...
This paper describes our efforts in extending a large vocabulary speech recognition system to handle...
This paper investigates methods for coping with out-of-vocabulary words in a large vocabulary speech...
This thesis deals with the problem of Out-Of-Vocabulary words in speech recognition. The standard re...
This paper investigates the effectiveness of online temporal language model adaptation when applied ...
This paper describes extensions and improvements to IBM’s large vocabulary continuous speech recogni...
In this paper, we describe a method to enhance the readability of the textual output in a large voca...
The daily and real-time transcription of Broadcast News (BN) is a challenging task both in acoustic ...
One of the most prevailing problems of large-vocabulary speech recognition systems is the large numb...
Although the vocabularies of ASR systems are designed to achieve high coverage for the expected doma...
This paper investigates the problem of updating over time the statistical language model (LM) of an ...
High out-of-vocabulary (OOV) rates are one of the most prevail-ing problems for languages with a rap...
High out-of-vocabulary (OOV) rates are one of the most prevail-ing problems for languages with a rap...
This paper describes first results of our DARPA-sponsored efforts toward recognizing and browsing fo...
This paper describes the IBM approach to Broadcast News Transcription. Typical problems in the Broa...
A regular automatic speech recognizer works with a so-called recognition lexicon. This lexicon conta...
This paper describes our efforts in extending a large vocabulary speech recognition system to handle...
This paper investigates methods for coping with out-of-vocabulary words in a large vocabulary speech...
This thesis deals with the problem of Out-Of-Vocabulary words in speech recognition. The standard re...
This paper investigates the effectiveness of online temporal language model adaptation when applied ...
This paper describes extensions and improvements to IBM’s large vocabulary continuous speech recogni...
In this paper, we describe a method to enhance the readability of the textual output in a large voca...