This corpus contains a collection of texts in the Alsatian dialects which were manually annotated with parts-of-speech, lemmas, translations into French and location entities. The corpus was produced in the context of the RESTAURE project, funded by the French ANR. The current version of the corpus contains 21 documents and 12,570 tokens. The annotation process is detailed in the following article: http://hal.archives-ouvertes.fr/hal-01704806 The untokenised and unannotated versions of the documents are found in the “txt” folder. The annotated versions of the documents are found in the “annotated” folder. They are provided in a TSV format with the following columns: id: token index in the document form: word form translation: transla...
National audienceWe present here the results of an experiment on part-of-speech annotation of a corp...
This dataset contains a lexicon of place names in the Alsatian dialects. These place names were coll...
International audienceThe paper presents a project of the Laboratoire Parole et Langage which aims a...
This corpus contains a collection of texts in the Alsatian dialects which were manually annotated wi...
These guidelines were produced in the context of the RESTAURE project, funded by the French ANR. The...
<p>This corpus contains a collection of texts in Occitan which were manually annotated with parts-of...
This corpus contains a collection of texts in Picard which were manually annotated with parts-of-spe...
International audienceThis article describes the creation of corpora with part-of-speech annotations...
International audienceIn contrast to French, the vast majority of regional languages of France can b...
This paper describes the ANNODIS ressource, a corpus of written French enriched with several markups...
This is the corpus description of a set of data containing a collection of texts in several dialects...
Site web des Archives numérisées du Département de Dialectologie alsacienne et mosellane de l'Univer...
These guidelines were produced in the context of the RESTAURE project, funded by the French ANR. The...
National audienceWe present here the results of an experiment on part-of-speech annotation of a corp...
This dataset contains a lexicon of place names in the Alsatian dialects. These place names were coll...
International audienceThe paper presents a project of the Laboratoire Parole et Langage which aims a...
This corpus contains a collection of texts in the Alsatian dialects which were manually annotated wi...
These guidelines were produced in the context of the RESTAURE project, funded by the French ANR. The...
<p>This corpus contains a collection of texts in Occitan which were manually annotated with parts-of...
This corpus contains a collection of texts in Picard which were manually annotated with parts-of-spe...
International audienceThis article describes the creation of corpora with part-of-speech annotations...
International audienceIn contrast to French, the vast majority of regional languages of France can b...
This paper describes the ANNODIS ressource, a corpus of written French enriched with several markups...
This is the corpus description of a set of data containing a collection of texts in several dialects...
Site web des Archives numérisées du Département de Dialectologie alsacienne et mosellane de l'Univer...
These guidelines were produced in the context of the RESTAURE project, funded by the French ANR. The...
National audienceWe present here the results of an experiment on part-of-speech annotation of a corp...
This dataset contains a lexicon of place names in the Alsatian dialects. These place names were coll...
International audienceThe paper presents a project of the Laboratoire Parole et Langage which aims a...