8 pages, 2 figures, 4 tablesInternational audienceLanguage models for historical states of language are becoming increasingly important to allow the optimal digitisation and analysis of old textual sources. Because these historical states are at the same time more complex to process and more scarce in the corpora available, specific efforts are necessary to train natural language processing (NLP) tools adapted to the data. In this paper, we present our efforts to develop NLP tools for Early Modern French (historical French from the 16th to the 18th centuries). We present the FreEMmax corpus of Early Modern French and D'AlemBERT, a RoBERTa-based language model trained on FreEMmax. We evaluate the usefulness of D'AlemBERT by fine-tuning it on...
For explore the role of extended phraseology in the structuring of literary textual genres in mediev...
International audienceLinguistic change in 17th c. France: new scriptometric approaches The end of t...
Grammar models conceived for parsing purposes are often poorer than models that are linguistically m...
8 pages, 2 figures, 4 tablesInternational audienceLanguage models for historical states of language ...
International audienceThe successes of contextual word embeddings learned by training large-scale la...
International audienceSpelling normalisation is a useful step in the study and analysis of historica...
With the development of big corpora of various periods, it becomescrucial to standardise linguistic ...
We investigate the creation of a 17th c. French literary corpus. We present the main options regardi...
International audienceIn order to automatically extend a treebank of Old French (9 th-13 th c.) with...
This brief thirty-year history of Lexicons of Early Modern English, an online database of glossaries...
International audienceWith the development of big corpora of various periods, it becomes crucial to ...
In recent years, neural methods for Natural Language Processing (NLP) have consistently and repeated...
International audienceOld French parsing : Which language properties have the greatest influence on ...
International audienceWe investigate the creation of a 17th c. French literary corpus. We present th...
International audienceThe "Preclassical" French language period extends throughout the sixteenth cen...
For explore the role of extended phraseology in the structuring of literary textual genres in mediev...
International audienceLinguistic change in 17th c. France: new scriptometric approaches The end of t...
Grammar models conceived for parsing purposes are often poorer than models that are linguistically m...
8 pages, 2 figures, 4 tablesInternational audienceLanguage models for historical states of language ...
International audienceThe successes of contextual word embeddings learned by training large-scale la...
International audienceSpelling normalisation is a useful step in the study and analysis of historica...
With the development of big corpora of various periods, it becomescrucial to standardise linguistic ...
We investigate the creation of a 17th c. French literary corpus. We present the main options regardi...
International audienceIn order to automatically extend a treebank of Old French (9 th-13 th c.) with...
This brief thirty-year history of Lexicons of Early Modern English, an online database of glossaries...
International audienceWith the development of big corpora of various periods, it becomes crucial to ...
In recent years, neural methods for Natural Language Processing (NLP) have consistently and repeated...
International audienceOld French parsing : Which language properties have the greatest influence on ...
International audienceWe investigate the creation of a 17th c. French literary corpus. We present th...
International audienceThe "Preclassical" French language period extends throughout the sixteenth cen...
For explore the role of extended phraseology in the structuring of literary textual genres in mediev...
International audienceLinguistic change in 17th c. France: new scriptometric approaches The end of t...
Grammar models conceived for parsing purposes are often poorer than models that are linguistically m...