This paper presents work on manual and semi-automatic normalization of historical language data. We first address the guide-lines that we use for mapping historical to modern word forms. The guidelines dis-tinguish between normalization (preferring forms close to the original) and moderniza-tion (preferring forms close to modern lan-guage). Average inter-annotator agreement is 88.38 % on a set of data from Early New High German. We then present Norma, a semi-automatic normalization tool. It in-tegrates different modules (lexicon lookup, rewrite rules) for normalizing words in a
To be able to use existing natural language processing tools for analysing historical text, an impor...
Historische Dokumente werden zunehmend in digitalisierter Form verfügbar gemacht. Häufig sind sie je...
To be able to use existing natural language processing tools for analysing historical text, an impor...
This paper presents work on manual and semi-automatic normalization of historical language data. We ...
Historical text constitutes a rich source of information for historians and other researchers in hum...
Natural language processing for historical text imposes a variety of challenges, such as to deal wit...
Language technology tools can be very use- ful for making information concealed in historical docume...
Language technology tools can be very use-ful for making information concealed in historical documen...
Corpora of Early Modern English have been collected and released for research for a number of years....
Corpora of Early Modern English have been collected and released for research for a number of years....
To be able to profit from natural language processing (NLP) tools for analysing historical text, an ...
International audienceSpelling normalisation is a useful step in the study and analysis of historica...
Historical texts are an important resource for researchers in the humanities. However, standard NLP ...
<p>This article deals with the regularization of non-standard spellings of the verbal forms extracte...
To study and automatically process Swiss German, it is necessary to resolve the issue of variation i...
To be able to use existing natural language processing tools for analysing historical text, an impor...
Historische Dokumente werden zunehmend in digitalisierter Form verfügbar gemacht. Häufig sind sie je...
To be able to use existing natural language processing tools for analysing historical text, an impor...
This paper presents work on manual and semi-automatic normalization of historical language data. We ...
Historical text constitutes a rich source of information for historians and other researchers in hum...
Natural language processing for historical text imposes a variety of challenges, such as to deal wit...
Language technology tools can be very use- ful for making information concealed in historical docume...
Language technology tools can be very use-ful for making information concealed in historical documen...
Corpora of Early Modern English have been collected and released for research for a number of years....
Corpora of Early Modern English have been collected and released for research for a number of years....
To be able to profit from natural language processing (NLP) tools for analysing historical text, an ...
International audienceSpelling normalisation is a useful step in the study and analysis of historica...
Historical texts are an important resource for researchers in the humanities. However, standard NLP ...
<p>This article deals with the regularization of non-standard spellings of the verbal forms extracte...
To study and automatically process Swiss German, it is necessary to resolve the issue of variation i...
To be able to use existing natural language processing tools for analysing historical text, an impor...
Historische Dokumente werden zunehmend in digitalisierter Form verfügbar gemacht. Häufig sind sie je...
To be able to use existing natural language processing tools for analysing historical text, an impor...