International audienceSpelling normalisation is a useful step in the study and analysis of historical language texts, whether it is manual analysis by experts or automatic analysis using downstream natural language processing (NLP) tools. Not only does it help to homogenise the variable spelling that often exists in historical texts, but it also facilitates the use of off-the-shelf contemporary NLP tools, if contemporary spelling conventions are used for normalisation. We present FREEMnorm, a new benchmark for the normalisation of Early Modern French (from the 17th century) into contemporary French and provide a thorough comparison of three different normalisation methods: ABA, an alignment-based approach and MT-approaches, (both statistica...
There is no consensus on the state-of-the-art approach to historical text normalization. Many techni...
Corpora of Early Modern English have been collected and released for research for a number of years....
Historical text constitutes a rich source of information for historians and other researchers in hum...
International audienceSpelling normalisation is a useful step in the study and analysis of historica...
International audienceThe study of old state of languages is facing a double problem : on the one ha...
International audienceLinguistic change in 17th c. France: new scriptometric approaches The end of t...
This paper presents work on manual and semi-automatic normalization of historical language data. We ...
International audienceIf NMT has proven to be the most efficient solution for normalising pre-orthog...
This paper presents work on manual and semi-automatic normalization of historical language data. We ...
To be able to profit from natural language processing (NLP) tools for analysing historical text, an ...
Corpora of Early Modern English have been collected and released for research for a number of years....
8 pages, 2 figures, 4 tablesInternational audienceLanguage models for historical states of language ...
To be able to use existing natural language processing tools for analysing historical text, an impor...
To be able to use existing natural language processing tools for analysing historical text, an impor...
International audienceBoth statistical and rule-based methods for named entity recognition are quite...
There is no consensus on the state-of-the-art approach to historical text normalization. Many techni...
Corpora of Early Modern English have been collected and released for research for a number of years....
Historical text constitutes a rich source of information for historians and other researchers in hum...
International audienceSpelling normalisation is a useful step in the study and analysis of historica...
International audienceThe study of old state of languages is facing a double problem : on the one ha...
International audienceLinguistic change in 17th c. France: new scriptometric approaches The end of t...
This paper presents work on manual and semi-automatic normalization of historical language data. We ...
International audienceIf NMT has proven to be the most efficient solution for normalising pre-orthog...
This paper presents work on manual and semi-automatic normalization of historical language data. We ...
To be able to profit from natural language processing (NLP) tools for analysing historical text, an ...
Corpora of Early Modern English have been collected and released for research for a number of years....
8 pages, 2 figures, 4 tablesInternational audienceLanguage models for historical states of language ...
To be able to use existing natural language processing tools for analysing historical text, an impor...
To be able to use existing natural language processing tools for analysing historical text, an impor...
International audienceBoth statistical and rule-based methods for named entity recognition are quite...
There is no consensus on the state-of-the-art approach to historical text normalization. Many techni...
Corpora of Early Modern English have been collected and released for research for a number of years....
Historical text constitutes a rich source of information for historians and other researchers in hum...