The GermaParl Corpus has been prepared in the PolMine Project (http://polmine.github.io) and comprises all protocols of plenary sessions in the German Bundestag (1996 - 2016). This version of the corpus is based on plain text documents issued by the German Bundestag. For a period between 2008 and 2010, txt files are not available. To fill the gap, pdf documents were processed. As part of the corpus preparation pipeline, the data has been linguistically annotated (using the TreeTagger) and imported into the Corpus Workbench (CWB). See the GermaParl documentation website (http://polmine.github.io/GermaParl) for further information
The ArchiMob corpus represents German varieties spoken on the territory of Switzerland. It is the fi...
The siParl corpus contains minutes of the Assembly of the Republic of Slovenia for 11th legislative ...
The siParl corpus contains minutes of the Assembly of the Republic of Slovenia for 11th legislative ...
The GermaParl Corpus has been prepared in the PolMine Project (http://polmine.github.io) and compris...
The GermaParl Corpus of Parliamentary Protocols covers 72 years of debates in the German Bundestags,...
The GermaParlSample Corpus is a small subset of the GermaParl corpus that has been prepared in the P...
The ParisParl Corpus of Parliamentary Debates, prepared in the PolMine Project, comprises all protoc...
MigParl is an indexed and linguistically annotated corpus of speeches on migration and integration a...
A corpus called DutchParl is created which aims to contain all digitally available parliamentary doc...
This text archive focuses on German political speeches held by top officials mostly from 1990 onward...
International audienceThe present German political speeches corpus follows from a initial release wh...
ParlaMint 3.0 is a multilingual set of 26 comparable corpora containing parliamentary debates mostly...
The ParCzech 4.0 corpus consists of stenographic protocols that record the Chamber of Deputies' meet...
Parliamentary proceedings (PP) are a rich source of data used by e.g. scholars in historiography, so...
We release the data of the Encyclopedic Module of the Polifonia Textual Corpus (Wikipedia pages), co...
The ArchiMob corpus represents German varieties spoken on the territory of Switzerland. It is the fi...
The siParl corpus contains minutes of the Assembly of the Republic of Slovenia for 11th legislative ...
The siParl corpus contains minutes of the Assembly of the Republic of Slovenia for 11th legislative ...
The GermaParl Corpus has been prepared in the PolMine Project (http://polmine.github.io) and compris...
The GermaParl Corpus of Parliamentary Protocols covers 72 years of debates in the German Bundestags,...
The GermaParlSample Corpus is a small subset of the GermaParl corpus that has been prepared in the P...
The ParisParl Corpus of Parliamentary Debates, prepared in the PolMine Project, comprises all protoc...
MigParl is an indexed and linguistically annotated corpus of speeches on migration and integration a...
A corpus called DutchParl is created which aims to contain all digitally available parliamentary doc...
This text archive focuses on German political speeches held by top officials mostly from 1990 onward...
International audienceThe present German political speeches corpus follows from a initial release wh...
ParlaMint 3.0 is a multilingual set of 26 comparable corpora containing parliamentary debates mostly...
The ParCzech 4.0 corpus consists of stenographic protocols that record the Chamber of Deputies' meet...
Parliamentary proceedings (PP) are a rich source of data used by e.g. scholars in historiography, so...
We release the data of the Encyclopedic Module of the Polifonia Textual Corpus (Wikipedia pages), co...
The ArchiMob corpus represents German varieties spoken on the territory of Switzerland. It is the fi...
The siParl corpus contains minutes of the Assembly of the Republic of Slovenia for 11th legislative ...
The siParl corpus contains minutes of the Assembly of the Republic of Slovenia for 11th legislative ...