Historical text constitutes a rich source of information for historians and other researchers in humanities. Many texts are however not available in an electronic format, and even if they are, there is a lack of NLP tools designed to handle historical text. In my thesis, I aim to provide a generic workflow for automatic linguistic analysis and information extraction from historical text, with spelling normalisation as a core component in the pipeline. In the spelling normalisation step, the historical input text is automatically normalised to a more modern spelling, enabling the use of existing taggers and parsers trained on modern language data in the succeeding linguistic analysis step. In the final information extraction step, certain li...
Corpora of Early Modern English have been collected and released for research for a number of years....
To be able to use existing natural language processing tools for analysing historical text, an impor...
In this article, we describe the respective approaches we have taken when addressing issues of spell...
Historical text constitutes a rich source of information for historians and other researchers in hum...
Language technology tools can be very use- ful for making information concealed in historical docume...
Language technology tools can be very use-ful for making information concealed in historical documen...
Natural language processing for historical text imposes a variety of challenges, such as to deal wit...
Advances in computational linguistics can provide new opportunities for historical linguistics, but ...
This paper presents work on manual and semi-automatic normalization of historical language data. We ...
This paper presents work on manual and semi-automatic normalization of historical language data. We ...
Even though NLP tools are widely used for contemporary text today, there is a lack of tools that can...
To be able to profit from natural language processing (NLP) tools for analysing historical text, an ...
Historical texts are an important resource for researchers in the humanities. However, standard NLP ...
Corpora of Early Modern English have been collected and released for research for a number of years....
To be able to use existing natural language processing tools for analysing historical text, an impor...
Corpora of Early Modern English have been collected and released for research for a number of years....
To be able to use existing natural language processing tools for analysing historical text, an impor...
In this article, we describe the respective approaches we have taken when addressing issues of spell...
Historical text constitutes a rich source of information for historians and other researchers in hum...
Language technology tools can be very use- ful for making information concealed in historical docume...
Language technology tools can be very use-ful for making information concealed in historical documen...
Natural language processing for historical text imposes a variety of challenges, such as to deal wit...
Advances in computational linguistics can provide new opportunities for historical linguistics, but ...
This paper presents work on manual and semi-automatic normalization of historical language data. We ...
This paper presents work on manual and semi-automatic normalization of historical language data. We ...
Even though NLP tools are widely used for contemporary text today, there is a lack of tools that can...
To be able to profit from natural language processing (NLP) tools for analysing historical text, an ...
Historical texts are an important resource for researchers in the humanities. However, standard NLP ...
Corpora of Early Modern English have been collected and released for research for a number of years....
To be able to use existing natural language processing tools for analysing historical text, an impor...
Corpora of Early Modern English have been collected and released for research for a number of years....
To be able to use existing natural language processing tools for analysing historical text, an impor...
In this article, we describe the respective approaches we have taken when addressing issues of spell...