In this paper, we discuss how different types of automatic annotation of digitised newspaper articles can be integrated into the iterative questioning of the source material and the creation of research corpora out of a collection of unstructured texts (kept in a structured collection). We annotate a sizeable collection of Swiss press articles (183,270), extracted via the impresso interface1 using topic modelling (MALLET)2 as well as a naïve Bayes classifier (script by Milan van Lange). The methodological discussion we propose is to explore how text mining can help identify historical discourses that are difficult to query with keywords because of their inherent ambiguity and how to grasp them in a large corpus. We argue that the automated ...
The poster presents the idea behind and first steps in the recently started project Distant Spectato...
A trend towards automation of scientific research has recently resulted in what has been termed “dat...
This paper presents the initial efforts towards the creation of a new corpus on the history domain. ...
How can computer-assisted methods help us to solve problems that are fundamental to historical resea...
These are the slides from the 2021 Workshop ‘Historical Newspaper Content Mining: findings from the ...
impresso. Media Monitoring of the Past is an interdisciplinary research project in which a team of c...
The availability of large digital archives of historical newspaper content has transformed the histo...
Following decades of massive digitization, an unprecedented amount of historical document facsimile...
Comparative historical research on the the intensity, diversity and fluidity of public discourses ha...
Newspapers have been a rich source of information for historians for the past hundred years or so. I...
The labour-intensive nature of manual content analysis and the problematic accessibility of source m...
Comparative historical research on the the intensity, diversity and fluidity of public discourses ha...
The labour-intensive nature of manual content analysis and the problematic accessibility of source m...
Abstract. Comparative historical research on the the intensity, diversity and flu-idity of public di...
The application of digital technologies to historical newspapers have changed the research landscape...
The poster presents the idea behind and first steps in the recently started project Distant Spectato...
A trend towards automation of scientific research has recently resulted in what has been termed “dat...
This paper presents the initial efforts towards the creation of a new corpus on the history domain. ...
How can computer-assisted methods help us to solve problems that are fundamental to historical resea...
These are the slides from the 2021 Workshop ‘Historical Newspaper Content Mining: findings from the ...
impresso. Media Monitoring of the Past is an interdisciplinary research project in which a team of c...
The availability of large digital archives of historical newspaper content has transformed the histo...
Following decades of massive digitization, an unprecedented amount of historical document facsimile...
Comparative historical research on the the intensity, diversity and fluidity of public discourses ha...
Newspapers have been a rich source of information for historians for the past hundred years or so. I...
The labour-intensive nature of manual content analysis and the problematic accessibility of source m...
Comparative historical research on the the intensity, diversity and fluidity of public discourses ha...
The labour-intensive nature of manual content analysis and the problematic accessibility of source m...
Abstract. Comparative historical research on the the intensity, diversity and flu-idity of public di...
The application of digital technologies to historical newspapers have changed the research landscape...
The poster presents the idea behind and first steps in the recently started project Distant Spectato...
A trend towards automation of scientific research has recently resulted in what has been termed “dat...
This paper presents the initial efforts towards the creation of a new corpus on the history domain. ...