This corpus contains a collection of texts in Picard which were manually annotated with parts-of-speech, lemmas, translations into French and location entities. The corpus was produced in the context of the RESTAURE project, funded by the French ANR. The current version of the corpus contains 25 documents. The annotation process is detailed in the following article: http://hal.archives-ouvertes.fr/hal-01704806 The untokenised and unannotated versions of the documents are found in the “extraits_reference_bruts” folder when available. The annotated versions of the documents are found in the the “extraits_reference_annotes” folder (original CSV file) and “picud” folder (CoNLL-U format). Additional information is also given; (inflected) tr...
C-PROM is a French segmented and annotated corpus, for prominence study. 24 recordings within 7 spea...
This paper presents C-PROM, an annotated corpus for French prominence studies. The corpus, including...
This is the corpus description of a set of data containing a collection of texts in several dialects...
This corpus contains a collection of texts in Picard which were manually annotated with parts-of-spe...
These guidelines were produced in the context of the RESTAURE project, funded by the French ANR. The...
This corpus contains a collection of texts in the Alsatian dialects which were manually annotated wi...
This corpus contains a collection of texts in Occitan which were manually annotated with parts-of-sp...
International audienceThis article describes the creation of corpora with part-of-speech annotations...
International audiencePicartext is a textual database, built up since about 10 years in Picardy Univ...
International audienceIn contrast to French, the vast majority of regional languages of France can b...
This paper describes the ANNODIS ressource, a corpus of written French enriched with several markups...
These guidelines were produced in the context of the RESTAURE project, funded by the French ANR. The...
This paper presents the current status of the French treebank developed at Paris 7 (Abeille ́ et al....
C-PROM is a French segmented and annotated corpus, for prominence study. 24 recordings within 7 spea...
This paper presents C-PROM, an annotated corpus for French prominence studies. The corpus, including...
This is the corpus description of a set of data containing a collection of texts in several dialects...
This corpus contains a collection of texts in Picard which were manually annotated with parts-of-spe...
These guidelines were produced in the context of the RESTAURE project, funded by the French ANR. The...
This corpus contains a collection of texts in the Alsatian dialects which were manually annotated wi...
This corpus contains a collection of texts in Occitan which were manually annotated with parts-of-sp...
International audienceThis article describes the creation of corpora with part-of-speech annotations...
International audiencePicartext is a textual database, built up since about 10 years in Picardy Univ...
International audienceIn contrast to French, the vast majority of regional languages of France can b...
This paper describes the ANNODIS ressource, a corpus of written French enriched with several markups...
These guidelines were produced in the context of the RESTAURE project, funded by the French ANR. The...
This paper presents the current status of the French treebank developed at Paris 7 (Abeille ́ et al....
C-PROM is a French segmented and annotated corpus, for prominence study. 24 recordings within 7 spea...
This paper presents C-PROM, an annotated corpus for French prominence studies. The corpus, including...
This is the corpus description of a set of data containing a collection of texts in several dialects...