We present the Modified French Treebank (MFT), a completely revamped French Treebank, derived from the Paris 7 Treebank (P7T), which is cleaner, more coherent, has several transformed structures, and introduces new linguistic analyses. To determine the effect of these changes, we investigate how theMFT fares in statistical parsing. Probabilistic parsers trained on the MFT training set (currently 3800 trees) already perform better than their counterparts trained on five times the P7T data (18,548 trees), providing an extreme example of the importance of data quality over quantity in statistical parsing. Moreover, regression analysis on the learning curve of parsers trained on the MFT lead to the prediction that parsers trained on the full...
International audienceRecently, several statistical parsers have been trained and evaluated on the d...
International audienceIn this paper we present the PASSAGE project which aims at building automatica...
Analyse probabiliste est l'un des domaines de recherche les plus attractives en langage naturel En t...
We present the Modified French Treebank (MFT), a completely revamped French Treebank, derived from t...
We present the Modified French Treebank (MFT), a completely revamped French Treebank, derived from t...
This paper presents the current status of the French treebank developed at Paris 7 (Abeille ́ et al....
International audienceThis paper reports preliminary results on grammatical induction for French. We...
This paper shows that training a lexicalized parser on a lemmatized morphologically-rich treebank su...
International audienceWe first describe the automatic conversion of the French Treebank (Abeillé and...
We evaluate statistical parsing of French using two probabilistic models derived from the Tree Adjoi...
Motivated by the expense in time and other resources to produce hand-crafted grammars, there has bee...
posterInternational audienceThis article introduces results about probabilistic parsing enhanced wit...
Motivated by the expense in time and other resources to produce hand-crafted grammars, there has bee...
International audienceWe describe and evaluate the semi-automatic addition of a deep syntactic layer...
International audienceThis article evaluates the integration of data extracted from a French syntact...
International audienceRecently, several statistical parsers have been trained and evaluated on the d...
International audienceIn this paper we present the PASSAGE project which aims at building automatica...
Analyse probabiliste est l'un des domaines de recherche les plus attractives en langage naturel En t...
We present the Modified French Treebank (MFT), a completely revamped French Treebank, derived from t...
We present the Modified French Treebank (MFT), a completely revamped French Treebank, derived from t...
This paper presents the current status of the French treebank developed at Paris 7 (Abeille ́ et al....
International audienceThis paper reports preliminary results on grammatical induction for French. We...
This paper shows that training a lexicalized parser on a lemmatized morphologically-rich treebank su...
International audienceWe first describe the automatic conversion of the French Treebank (Abeillé and...
We evaluate statistical parsing of French using two probabilistic models derived from the Tree Adjoi...
Motivated by the expense in time and other resources to produce hand-crafted grammars, there has bee...
posterInternational audienceThis article introduces results about probabilistic parsing enhanced wit...
Motivated by the expense in time and other resources to produce hand-crafted grammars, there has bee...
International audienceWe describe and evaluate the semi-automatic addition of a deep syntactic layer...
International audienceThis article evaluates the integration of data extracted from a French syntact...
International audienceRecently, several statistical parsers have been trained and evaluated on the d...
International audienceIn this paper we present the PASSAGE project which aims at building automatica...
Analyse probabiliste est l'un des domaines de recherche les plus attractives en langage naturel En t...