Historical cabinet protocols are a useful resource which enable historians to identify the opinions expressed by politicians on different subjects and at different points of time. While cabinet protocols are often available in digitized form, so far the only method to access their information content is by keyword-based search, which often returns sub-optimal results. We present a method for enriching German cabinet protocols with information about the originators of statements. This requires automatic speaker attribution. In order to avoid costly manual annotation of training data, we design a rule-based system which exploits morpho-syntactic cues. Unlike many other approaches, our method can also deal with cases in which the speaker is no...
This work presents a text mining context and its use for a deep analysis of the messages delivered b...
This paper presents a salience-based technique for the annotation of directly quoted speech from fic...
A database of segmented audio files from the German Bundestag The database contains data from 9 Ger...
In the last decade, high-level features for speaker recognition have become a research focus, as the...
In order to help journalists investigate inside large audiovisual archives, as maintained by news br...
In historical sciences, the term oral history refers to conducting and analyzing interviews with con...
This text archive focuses on German political speeches held by top officials mostly from 1990 onward...
We describe a method for identifying the speakers of quoted speech in natural-language textual stori...
This paper describes methods that exploit stenographic transcripts of the German parliament to impro...
This paper reports the first authorship attribution results based on the automatic computational met...
We introduce the Merkel Podcast Corpus, an audio-visual-text corpus in German collected from 16 year...
We present a search system for grammatically analyzed corpora of Finnish parliamentary records and i...
International audienceIn this paper, we consider the extraction of speaker identity (first name and ...
International audienceIn this paper, we consider the extraction of speaker identity (first name and ...
This paper investigates the identification of populist rhetoric in text and presents a novel cross-l...
This work presents a text mining context and its use for a deep analysis of the messages delivered b...
This paper presents a salience-based technique for the annotation of directly quoted speech from fic...
A database of segmented audio files from the German Bundestag The database contains data from 9 Ger...
In the last decade, high-level features for speaker recognition have become a research focus, as the...
In order to help journalists investigate inside large audiovisual archives, as maintained by news br...
In historical sciences, the term oral history refers to conducting and analyzing interviews with con...
This text archive focuses on German political speeches held by top officials mostly from 1990 onward...
We describe a method for identifying the speakers of quoted speech in natural-language textual stori...
This paper describes methods that exploit stenographic transcripts of the German parliament to impro...
This paper reports the first authorship attribution results based on the automatic computational met...
We introduce the Merkel Podcast Corpus, an audio-visual-text corpus in German collected from 16 year...
We present a search system for grammatically analyzed corpora of Finnish parliamentary records and i...
International audienceIn this paper, we consider the extraction of speaker identity (first name and ...
International audienceIn this paper, we consider the extraction of speaker identity (first name and ...
This paper investigates the identification of populist rhetoric in text and presents a novel cross-l...
This work presents a text mining context and its use for a deep analysis of the messages delivered b...
This paper presents a salience-based technique for the annotation of directly quoted speech from fic...
A database of segmented audio files from the German Bundestag The database contains data from 9 Ger...