International audienceSpoken language speech recognition systems need better understanding of natural spoken language phenomenon than their dictation counterparts. Current language models are mostly based on written text and/or very tedious Wizard of Oz or real dialog experiments1. In this paper we propose to use Internet documents as a very rich source of information for spoken language modeling. Through detailed experiments we show how using Internet we could automatically prepare language models adapted to a given task. For a given recognition system using this approach the word accuracy is up to 15% better than a system using language models trained on written text
We describe the use of text data scraped from the web to augment language models for Automatic Speec...
International audienceThis paper presents CLIPS laboratory activities in speech recognition related ...
This paper addresses a critical problem in deploying a spoken dialog system (SDS). One of the main b...
Language models used in current automatic speech recognition systems are trained on general-purpose ...
This article describes a methodology for collecting text from the Web to match a target sublanguage ...
In a text-based discovery and analytical environment, high quality textual representation is needed ...
EUROSPEECH2001: the 7th European Conference on Speech Communication and Technology, September 3-7, ...
In this paper, a new methodology for speech corpora definition from internet documents is described,...
WOCCI 2008: The 1st Workshop on Child, Computer, and Interaction, October 23, 2008, Chania, Crete,...
It is convenient to use the Internet to create a corpus. Because if the written texts of a certain l...
Training language model made from conversational speech is difficult due to large variation of the w...
International audienceIn this paper, a new methodology for speech corpora definition from internet d...
Speech recognition is the process of converting acoustic waveforms into text. This requires models t...
We attemped to improve recognition accuracy by reduc-ing the inadequacies of the lexicon and languag...
In a previous paper we proposed Web-based language models relying on the possibility theory. These m...
We describe the use of text data scraped from the web to augment language models for Automatic Speec...
International audienceThis paper presents CLIPS laboratory activities in speech recognition related ...
This paper addresses a critical problem in deploying a spoken dialog system (SDS). One of the main b...
Language models used in current automatic speech recognition systems are trained on general-purpose ...
This article describes a methodology for collecting text from the Web to match a target sublanguage ...
In a text-based discovery and analytical environment, high quality textual representation is needed ...
EUROSPEECH2001: the 7th European Conference on Speech Communication and Technology, September 3-7, ...
In this paper, a new methodology for speech corpora definition from internet documents is described,...
WOCCI 2008: The 1st Workshop on Child, Computer, and Interaction, October 23, 2008, Chania, Crete,...
It is convenient to use the Internet to create a corpus. Because if the written texts of a certain l...
Training language model made from conversational speech is difficult due to large variation of the w...
International audienceIn this paper, a new methodology for speech corpora definition from internet d...
Speech recognition is the process of converting acoustic waveforms into text. This requires models t...
We attemped to improve recognition accuracy by reduc-ing the inadequacies of the lexicon and languag...
In a previous paper we proposed Web-based language models relying on the possibility theory. These m...
We describe the use of text data scraped from the web to augment language models for Automatic Speec...
International audienceThis paper presents CLIPS laboratory activities in speech recognition related ...
This paper addresses a critical problem in deploying a spoken dialog system (SDS). One of the main b...