In this paper, a method for measuring synchronic corpus (dis-)similarity put forward by Kilgarriff (2001) is adapted and extended to identify trends and correlated changes in diachronic text data, using the Corpus of Historical American English (Davies 2010a) and the Google Ngram Corpora (Michel et al. 2010a). This paper shows that this fully data-driven method, which extracts word types that have undergone the most pronounced change in frequency in a given period of time, is computationally very cheap and that it allows interpretations of diachronic trends that are both intuitively plausible and motivated from the perspective of information theory. Furthermore, it demonstrates that the method is able to identify correlated linguistic chang...
This thesis consists of the following three papers that all have been published in international pee...
With the increasing availability of diachronic corpora, machine-aided identification of linguistic i...
With the increasing availability of diachronic corpora, machine-aided identification of linguistic i...
In this paper, an exploratory data-driven method is presented that extracts word-types from diachron...
When exploring diachronic corpora, it is often beneficial for linguists to pinpoint not only the fir...
© Springer Nature Switzerland AG 2020. The article proposes a method for detecting semantic change u...
The thesis presents a method for diachronic comparison of synchronic corpora that reflect language o...
The thesis presents a method for diachronic comparison of synchronic corpora that reflect language o...
We present a data-driven approach to detect periods of linguistic change and the lexical and grammat...
Using the Google Ngram Corpora for six different languages (including two varieties of English), a l...
With the increasing availability of diachronic corpora, machine-aided identification of linguistic i...
With the increasing availability of diachronic corpora, machine-aided identification of linguistic i...
Using the Google Ngram Corpora for six different languages (including two varieties of English), a l...
With the increasing availability of diachronic corpora, machine-aided identification of linguistic i...
Presented at the University of Kansas, Institute for Digital Research in the Humanities, January 26,...
This thesis consists of the following three papers that all have been published in international pee...
With the increasing availability of diachronic corpora, machine-aided identification of linguistic i...
With the increasing availability of diachronic corpora, machine-aided identification of linguistic i...
In this paper, an exploratory data-driven method is presented that extracts word-types from diachron...
When exploring diachronic corpora, it is often beneficial for linguists to pinpoint not only the fir...
© Springer Nature Switzerland AG 2020. The article proposes a method for detecting semantic change u...
The thesis presents a method for diachronic comparison of synchronic corpora that reflect language o...
The thesis presents a method for diachronic comparison of synchronic corpora that reflect language o...
We present a data-driven approach to detect periods of linguistic change and the lexical and grammat...
Using the Google Ngram Corpora for six different languages (including two varieties of English), a l...
With the increasing availability of diachronic corpora, machine-aided identification of linguistic i...
With the increasing availability of diachronic corpora, machine-aided identification of linguistic i...
Using the Google Ngram Corpora for six different languages (including two varieties of English), a l...
With the increasing availability of diachronic corpora, machine-aided identification of linguistic i...
Presented at the University of Kansas, Institute for Digital Research in the Humanities, January 26,...
This thesis consists of the following three papers that all have been published in international pee...
With the increasing availability of diachronic corpora, machine-aided identification of linguistic i...
With the increasing availability of diachronic corpora, machine-aided identification of linguistic i...