International audienceIn this work, we introduce the concept of Multiclass for language modeling and we compare it to the Polyclass model. The originality of the Multiclass is its capability to parse a string of classes/tags into variable length independent sequences. A few experimental tests were carried out on a class corpus extracted from the French « Le Monde » word corpus labeled automatically. This corpus contains a set of 43 million of words. In our experiments, Multiclass outperform first-order Polyclass but are slightly outperformed by second-order Polyclass
The universal and typological status of the notion of word class — closely related to part-of-speech...
AbstractWe investigate languages consisting of words following one of the given finitely many patter...
International audienceThe aim of this empirical and statistical study is to describe the classifiers...
International audienceIn this work, we introduce the concept of Multiclass for language modeling and...
International audienceIn contrast to conventional n-gram approaches, which are the most used languag...
International audienceIn this paper we report the results of four experiments conducted to extract l...
In this paper, we propose a new language model based on depen-dent word sequences organized in a mul...
148 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2001.This thesis presents theoreti...
Colloque avec actes et comité de lecture. internationale.International audienceIn this paper, we pro...
Previous attempts to automatically determine multi-words as the basic unit for language modeling hav...
The multigram model assumes that language can be described as the output of a memoryless source that...
In natural language, several sequences of words are very frequent. A classical language model, like ...
International audienceIn this paper, we describe a new language model based on dependent word sequen...
Article dans revue scientifique avec comité de lecture.In natural language and especially in spontan...
This chapter is set in the context of Corpus Pattern Analysis (CPA), a technique developed by Patric...
The universal and typological status of the notion of word class — closely related to part-of-speech...
AbstractWe investigate languages consisting of words following one of the given finitely many patter...
International audienceThe aim of this empirical and statistical study is to describe the classifiers...
International audienceIn this work, we introduce the concept of Multiclass for language modeling and...
International audienceIn contrast to conventional n-gram approaches, which are the most used languag...
International audienceIn this paper we report the results of four experiments conducted to extract l...
In this paper, we propose a new language model based on depen-dent word sequences organized in a mul...
148 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2001.This thesis presents theoreti...
Colloque avec actes et comité de lecture. internationale.International audienceIn this paper, we pro...
Previous attempts to automatically determine multi-words as the basic unit for language modeling hav...
The multigram model assumes that language can be described as the output of a memoryless source that...
In natural language, several sequences of words are very frequent. A classical language model, like ...
International audienceIn this paper, we describe a new language model based on dependent word sequen...
Article dans revue scientifique avec comité de lecture.In natural language and especially in spontan...
This chapter is set in the context of Corpus Pattern Analysis (CPA), a technique developed by Patric...
The universal and typological status of the notion of word class — closely related to part-of-speech...
AbstractWe investigate languages consisting of words following one of the given finitely many patter...
International audienceThe aim of this empirical and statistical study is to describe the classifiers...