Many of the kinds of language model used in speech understanding suffer from imperfect modeling of intra-sentential contextual influences. I argue that this problem can be addressed by clustering the sentences in a training corpus automatically into subcorpora on the criterion of entropy reduction, and calculating separate language model parameters for each cluster. This kind of clustering offers a way to represent important contextual effects and can therefore significantly improve the performance of a model. It also offers a reasonably automatic means to gather evidence on whether a more complex, context-sensitive model using the same general kind of linguistic information is likely to reward the effort that would be required to develop i...
Many recent studies on large-scale language models have reported successful in-context zero- and few...
The ability to discover groupings in continuous stimuli on the basis of distributional information i...
The incorporation of grammatical information into speech recognition systems is often used to increa...
Croft (2001) argues that distributional analysis of word classes is doomed to failure because there ...
I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, includ...
Prior work has shown that generalization of data in an Example Based Machine Translation (EBMT) syst...
Colloque avec actes et comité de lecture.This article describes a comparative study of language mode...
Abstract. In this paper, we present a language model based on clusters obtained by applying regular ...
Pelemans J., Van hamme H., Wambacq P., ''Translation-based word clustering for language models'', Bo...
This study investigates the joint influences of three factors on the discovery of new word-like unit...
Cluster analysis related to computational linguistics seldom concerned with Pragmatics level. Featur...
ABSTRACT-This paper proposes a new method for learning a context-sensitive conditional probability c...
We describe and experimentally evaluate a method for automatically clustering words according to the...
Abstract: "A parallel distributed processing model is described that learns to comprehend single cla...
In this paper, we review recent progress in the field of machine learning and examine its implicatio...
Many recent studies on large-scale language models have reported successful in-context zero- and few...
The ability to discover groupings in continuous stimuli on the basis of distributional information i...
The incorporation of grammatical information into speech recognition systems is often used to increa...
Croft (2001) argues that distributional analysis of word classes is doomed to failure because there ...
I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, includ...
Prior work has shown that generalization of data in an Example Based Machine Translation (EBMT) syst...
Colloque avec actes et comité de lecture.This article describes a comparative study of language mode...
Abstract. In this paper, we present a language model based on clusters obtained by applying regular ...
Pelemans J., Van hamme H., Wambacq P., ''Translation-based word clustering for language models'', Bo...
This study investigates the joint influences of three factors on the discovery of new word-like unit...
Cluster analysis related to computational linguistics seldom concerned with Pragmatics level. Featur...
ABSTRACT-This paper proposes a new method for learning a context-sensitive conditional probability c...
We describe and experimentally evaluate a method for automatically clustering words according to the...
Abstract: "A parallel distributed processing model is described that learns to comprehend single cla...
In this paper, we review recent progress in the field of machine learning and examine its implicatio...
Many recent studies on large-scale language models have reported successful in-context zero- and few...
The ability to discover groupings in continuous stimuli on the basis of distributional information i...
The incorporation of grammatical information into speech recognition systems is often used to increa...