The development of (semi-)automatic tools such as the VARD (Baron and Rayson, 2008) has afforded compilers of historical corpora the opportunity to normalise variant spellings relatively quickly – following, that is, a dedicated period of manual training using relevant corpus samples (see, e.g., Lehto et al. 2010). In the case of VARD2, this period of manual training involves the user: (i) reading a given text, via the VARD interface, (ii) distinguishing variants within the text – via the tool’s recommended list of (ranked) candidate replacements – or personally – by highlighting variant forms manually, (iii) choosing the most appropriate normalized form for each variant found – where relevant, being guided by the VARD’s known variant list ...
We describe, evaluate, and improve the automatic annotation of diachronic cor-pora at the levels of ...
This paper presents work on manual and semi-automatic normalization of historical language data. We ...
<p>This article deals with the regularization of non-standard spellings of the verbal forms extracte...
Corpora of Early Modern English have been collected and released for research for a number of years....
When applying corpus linguistic techniques to historical corpora, the corpus researcher should be ca...
Large quantities of spelling variation in corpora, such as that found in Early Modern English, can c...
Corpora of Early Modern English have been collected and released for research for a number of years....
To be able to profit from natural language processing (NLP) tools for analysing historical text, an ...
Analysis of English historical texts poses a number of obstacles for standard corpus analysis and an...
To be able to use existing natural language processing tools for analysing historical text, an impor...
We describe, evaluate, and improve the automatic annotation of diachronic corpora at the levels of w...
Early English Books Online contains facsimiles of virtually every English work printed between 1473 ...
To be able to use existing natural language processing tools for analysing historical text, an impor...
We describe, evaluate, and improve the automatic annotation of diachronic cor-pora at the levels of ...
Natural language processing for historical text imposes a variety of challenges, such as to deal wit...
We describe, evaluate, and improve the automatic annotation of diachronic cor-pora at the levels of ...
This paper presents work on manual and semi-automatic normalization of historical language data. We ...
<p>This article deals with the regularization of non-standard spellings of the verbal forms extracte...
Corpora of Early Modern English have been collected and released for research for a number of years....
When applying corpus linguistic techniques to historical corpora, the corpus researcher should be ca...
Large quantities of spelling variation in corpora, such as that found in Early Modern English, can c...
Corpora of Early Modern English have been collected and released for research for a number of years....
To be able to profit from natural language processing (NLP) tools for analysing historical text, an ...
Analysis of English historical texts poses a number of obstacles for standard corpus analysis and an...
To be able to use existing natural language processing tools for analysing historical text, an impor...
We describe, evaluate, and improve the automatic annotation of diachronic corpora at the levels of w...
Early English Books Online contains facsimiles of virtually every English work printed between 1473 ...
To be able to use existing natural language processing tools for analysing historical text, an impor...
We describe, evaluate, and improve the automatic annotation of diachronic cor-pora at the levels of ...
Natural language processing for historical text imposes a variety of challenges, such as to deal wit...
We describe, evaluate, and improve the automatic annotation of diachronic cor-pora at the levels of ...
This paper presents work on manual and semi-automatic normalization of historical language data. We ...
<p>This article deals with the regularization of non-standard spellings of the verbal forms extracte...