International audienceIn this paper, we introduce a set of resources that we have derived from the EST RÉPUBLICAIN CORPUS, a large, freely-available collection of regional newspaper articles in French, totaling 150 million words. Our resources are the result of a full NLP treatment of the EST RÉPUBLICAIN CORPUS: handling of multi-word expressions, lemmatization, part-of-speech tagging, and syntactic parsing. Processing of the corpus is carried out using statistical machine-learning approaches - joint model of data driven lemmatization and part- of-speech tagging, PCFG-LA and dependency based models for parsing - that have been shown to achieve state-of-the-art performance when evaluated on the French Treebank. Our derived resources are made...
International audienceThis article presents ANCOR_Centre, a French coreference corpus, available und...
short paper (4 pages)International audienceWe present a semi-supervised method to improve statistica...
International audienceThis work investigates a possibility of combining two different types of corpo...
International audienceIn this paper, we introduce a set of resources that we have derived from the E...
This paper presents the current status of the French treebank developed at Paris 7 (Abeille ́ et al....
Very few gold standard annotated corpora are currently available for French. We present an ongoing p...
International audienceWe present the motivations and objectives of French Passage project that ambit...
9 pagesInternational audienceWe compare the performance of three statistical parsing architectures o...
This paper shows that training a lexicalized parser on a lemmatized morphologically-rich treebank su...
posterInternational audienceThis article introduces results about probabilistic parsing enhanced wit...
International audienceWe present and discuss experiments in statistical parsing of French, where ter...
International audienceAccording to the cost of speech transcription, it is very important to pool da...
International audienceOld French parsing : Which language properties have the greatest influence on ...
We present a method for enlarge a lexicon (with frequencies information), that is useful for parsing...
International audienceThis paper investigates the impact on French dependency parsing of lexical gen...
International audienceThis article presents ANCOR_Centre, a French coreference corpus, available und...
short paper (4 pages)International audienceWe present a semi-supervised method to improve statistica...
International audienceThis work investigates a possibility of combining two different types of corpo...
International audienceIn this paper, we introduce a set of resources that we have derived from the E...
This paper presents the current status of the French treebank developed at Paris 7 (Abeille ́ et al....
Very few gold standard annotated corpora are currently available for French. We present an ongoing p...
International audienceWe present the motivations and objectives of French Passage project that ambit...
9 pagesInternational audienceWe compare the performance of three statistical parsing architectures o...
This paper shows that training a lexicalized parser on a lemmatized morphologically-rich treebank su...
posterInternational audienceThis article introduces results about probabilistic parsing enhanced wit...
International audienceWe present and discuss experiments in statistical parsing of French, where ter...
International audienceAccording to the cost of speech transcription, it is very important to pool da...
International audienceOld French parsing : Which language properties have the greatest influence on ...
We present a method for enlarge a lexicon (with frequencies information), that is useful for parsing...
International audienceThis paper investigates the impact on French dependency parsing of lexical gen...
International audienceThis article presents ANCOR_Centre, a French coreference corpus, available und...
short paper (4 pages)International audienceWe present a semi-supervised method to improve statistica...
International audienceThis work investigates a possibility of combining two different types of corpo...