The incorporation of grammatical information into speech recognition systems is often used to increase performance in morphologically rich languages. However, this introduces demands for sufficiently large training corpora and proper methods of using the additional information. In this paper, we present a method for building factored language models that use data obtained by morphosyntactic tagging. The models use only relevant factors that help to increase performance and ignore data from other factors, thus also reducing the need for large morphosyntactically tagged training corpora. Which data is relevant is determined at run-time, based on the current text segment being estimated, i.e., the context. We show that using a context-dependen...
This paper evaluates six commonly available parts-of-speech tagging tools over corpora other than th...
dialog This paper describes a way ofusing intonation and dialog context to improve the performance o...
This paper describes a method for selecting text data from a corpus with the aim of training auxil...
Various information sources naturally contains new words that appear in a daily basis and which are ...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
We present a new approach towards using contextual information to enhance speech recognition and und...
We attemped to improve recognition accuracy by reduc-ing the inadequacies of the lexicon and languag...
The goal of this thesis is to explore various strategies for incorporating contextual information in...
Statistical language models (SLMs) for speech recognition have the advantage of robustness, and gram...
International audienceTexts generated by automatic speech recognition (ASR) systems have some specif...
Speech is at the core of human communication. Speaking and listing comes so natural to us that we do...
This thesis focuses on the development of effective and efficient language models (LMs) for speech r...
Context-dependent models for language units are essential in high-accuracy speech recognition. Howev...
Language models (LMs) are often constructed by building com-ponent models on multiple text sources t...
Speech recognition is the task of decoding an acoustic speech signal into a written text. Large voca...
This paper evaluates six commonly available parts-of-speech tagging tools over corpora other than th...
dialog This paper describes a way ofusing intonation and dialog context to improve the performance o...
This paper describes a method for selecting text data from a corpus with the aim of training auxil...
Various information sources naturally contains new words that appear in a daily basis and which are ...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
We present a new approach towards using contextual information to enhance speech recognition and und...
We attemped to improve recognition accuracy by reduc-ing the inadequacies of the lexicon and languag...
The goal of this thesis is to explore various strategies for incorporating contextual information in...
Statistical language models (SLMs) for speech recognition have the advantage of robustness, and gram...
International audienceTexts generated by automatic speech recognition (ASR) systems have some specif...
Speech is at the core of human communication. Speaking and listing comes so natural to us that we do...
This thesis focuses on the development of effective and efficient language models (LMs) for speech r...
Context-dependent models for language units are essential in high-accuracy speech recognition. Howev...
Language models (LMs) are often constructed by building com-ponent models on multiple text sources t...
Speech recognition is the task of decoding an acoustic speech signal into a written text. Large voca...
This paper evaluates six commonly available parts-of-speech tagging tools over corpora other than th...
dialog This paper describes a way ofusing intonation and dialog context to improve the performance o...
This paper describes a method for selecting text data from a corpus with the aim of training auxil...