This article describes how a monitor corpus can be created from existing corpora where the texts are annotated with basic bibliographical information. The focus is on the core vocabulary and how to document diachronic change. The implementation uses simple, readily available techniques from corpus linguistics. A norm is defined by isolating a small vocabulary unlikely to change from text to text, consisting of the most frequent words. These words are the building blocks of language – function words and some highly frequent verbs and nouns, words essential to producing grammatical sentences. The relative frequencies of these words from one text collection to another will show minor deviations. This slight deviation is used to specify the nor...
The study of language variation has achieved a significant growth in the past half-century, but ther...
This thesis consists of the following three papers that all have been published in international pee...
With the increasing availability of diachronic corpora, machine-aided identification of linguistic i...
This paper gives an account of recent efforts within corpus-based lexicography in Norway. I explore ...
A corpus is a collection of authentic, non-elicited texts selected and assembled to study language. ...
In this paper we present our study how to use the Meta Dictionary of the Norwegian Language Collecti...
ABSTRACT At the Department of Linguistics and Scandinavian Studies (ILN) and the University of Oslo,...
The paper deals with the main problems of word corpus studies. It includes a general survey of Dani...
This article investigates the diachronic development of language mixing within noun phrases in the h...
This paper contains a description of the Corpus of American Norwegian Speech, a new tool for heritag...
In this paper, a method for measuring synchronic corpus (dis-)similarity put forward by Kilgarriff (...
Language documentation, including the development and use of corpora, is frequently linked to revita...
Presented at the University of Kansas, Institute for Digital Research in the Humanities, January 26,...
In this paper, we describe the Nordic Dialect Corpus, which has recently been completed. The corpus ...
Nowadays a corpus is typically a large collection of text excerpts, representing a range of register...
The study of language variation has achieved a significant growth in the past half-century, but ther...
This thesis consists of the following three papers that all have been published in international pee...
With the increasing availability of diachronic corpora, machine-aided identification of linguistic i...
This paper gives an account of recent efforts within corpus-based lexicography in Norway. I explore ...
A corpus is a collection of authentic, non-elicited texts selected and assembled to study language. ...
In this paper we present our study how to use the Meta Dictionary of the Norwegian Language Collecti...
ABSTRACT At the Department of Linguistics and Scandinavian Studies (ILN) and the University of Oslo,...
The paper deals with the main problems of word corpus studies. It includes a general survey of Dani...
This article investigates the diachronic development of language mixing within noun phrases in the h...
This paper contains a description of the Corpus of American Norwegian Speech, a new tool for heritag...
In this paper, a method for measuring synchronic corpus (dis-)similarity put forward by Kilgarriff (...
Language documentation, including the development and use of corpora, is frequently linked to revita...
Presented at the University of Kansas, Institute for Digital Research in the Humanities, January 26,...
In this paper, we describe the Nordic Dialect Corpus, which has recently been completed. The corpus ...
Nowadays a corpus is typically a large collection of text excerpts, representing a range of register...
The study of language variation has achieved a significant growth in the past half-century, but ther...
This thesis consists of the following three papers that all have been published in international pee...
With the increasing availability of diachronic corpora, machine-aided identification of linguistic i...