This paper reports on the work done on vocabulary and language model daily adaptation for a European Portuguese broadcast news transcription system. The proposed adaptation framework takes into consideration European Portuguese language characteristics, such as its high level of inflection and complex verbal system. A multi-pass speech recognition framework using contemporary written texts available daily on the Web is proposed. It uses morpho-syntactic knowledge (part-of-speech information) about an in-domain training corpus for daily selection of an optimal vocabulary. Using an information retrieval engine and the ASR hypotheses as query material, relevant documents are extracted from a dynamic and large-size dataset to generate a story-b...
In this work we investigate methods to extend the lexicon of a broadcast news (BN) speech recognitio...
The paper describes recent progress in the development the Slovak language models for transcription ...
In this paper, an approach for unsupervised dynamic adaptation of the language model used in an auto...
Abstract. Up-to-date language modeling is recognized to be a critical aspect of maintaining the leve...
The daily and real-time transcription of Broadcast News (BN) is a challenging task both in acoustic ...
Although the vocabularies of ASR systems are designed to achieve high coverage for the expected doma...
Abstract. The main goal of this work is the adaptation of a broadcast news transcription system to a...
Various information sources naturally contains new words that appear in a daily basis and which are ...
This paper describes an accent identification system for Portuguese, that explores different type of...
One of the most prevailing problems of large-vocabulary speech recognition systems is the large numb...
This paper describes first results of our DARPA-sponsored efforts toward recognizing and browsing fo...
This paper describes first results of our DARPA-sponsored efforts toward recognizing and browsing fo...
This paper investigates the problem of updating over time the statistical language model (LM) of an ...
This paper describes a Broadcast News (BN) database called Transcrigal-DB. The news shows are mainly...
Speech provides a natural way for human–computer interaction. In particular, speech synthesis syste...
In this work we investigate methods to extend the lexicon of a broadcast news (BN) speech recognitio...
The paper describes recent progress in the development the Slovak language models for transcription ...
In this paper, an approach for unsupervised dynamic adaptation of the language model used in an auto...
Abstract. Up-to-date language modeling is recognized to be a critical aspect of maintaining the leve...
The daily and real-time transcription of Broadcast News (BN) is a challenging task both in acoustic ...
Although the vocabularies of ASR systems are designed to achieve high coverage for the expected doma...
Abstract. The main goal of this work is the adaptation of a broadcast news transcription system to a...
Various information sources naturally contains new words that appear in a daily basis and which are ...
This paper describes an accent identification system for Portuguese, that explores different type of...
One of the most prevailing problems of large-vocabulary speech recognition systems is the large numb...
This paper describes first results of our DARPA-sponsored efforts toward recognizing and browsing fo...
This paper describes first results of our DARPA-sponsored efforts toward recognizing and browsing fo...
This paper investigates the problem of updating over time the statistical language model (LM) of an ...
This paper describes a Broadcast News (BN) database called Transcrigal-DB. The news shows are mainly...
Speech provides a natural way for human–computer interaction. In particular, speech synthesis syste...
In this work we investigate methods to extend the lexicon of a broadcast news (BN) speech recognitio...
The paper describes recent progress in the development the Slovak language models for transcription ...
In this paper, an approach for unsupervised dynamic adaptation of the language model used in an auto...