The repository contains a cleaned and pre-processed corpus of parliamentary debates from the Croatian Parliament (Sabor). The corpus is accompanied by the metadata on elected representatives and their political parties. It covers the period of 2003-2020 (five complete terms) and counts over 500 thousand speeches. If you use the dataset, please cite: Mochtak, Michal, Josip Glaurdić, and Christophe Lesschaeve (2022): CROCorp: Corpus of Parliamentary Debates in Croatia (v1.1.1), https://doi.org/10.5281/zenodo.6521372. v1.1.1 (latest version) - added the concept DOI to codebooks (DOI was generated only after the repository was published) v1.1.0 - improved coding of dummy variable "moderator" (using less error-prone alghoritm for detecting th...
ParlaMint is a multilingual set of comparable corpora containing parliamentary debates mostly starti...
The SlovParl corpus contains minutes of the Chamber of Associated Labour of the Assembly of the Repu...
ParlaMint 4.0 is a set of comparable corpora containing transcriptions of parliamentary debates of 2...
The repository contains a cleaned and pre-processed corpus of parliamentary debates from the Croatia...
The repository contains a cleaned and pre-processed corpus of parliamentary debates from the Croatia...
The repository contains a cleaned and pre-processed corpus of parliamentary debates from the Nationa...
The repository contains a cleaned and pre-processed corpus of parliamentary debates from the Parliam...
ParlaMint 2.1 is a multilingual set of 17 comparable corpora containing parliamentary debates mostly...
The siParl corpus contains minutes of the Assembly of the Republic of Slovenia for 11th legislative ...
This is a repository for the corpus of transcripts of parliamentary debates in the National Council ...
ParlaMint 2.1 is a multilingual set of 17 comparable corpora containing parliamentary debates mostly...
ParlaMint 3.0 is a multilingual set of 26 comparable corpora containing parliamentary debates mostly...
ParlaMint is a multilingual set of comparable corpora containing parliamentary debates mostly starti...
ParlaMint 3.0 is a multilingual set of 26 comparable corpora containing parliamentary debates mostly...
The SlovParl corpus contains minutes of the Assembly of the Republic of Slovenia for the legislative...
ParlaMint is a multilingual set of comparable corpora containing parliamentary debates mostly starti...
The SlovParl corpus contains minutes of the Chamber of Associated Labour of the Assembly of the Repu...
ParlaMint 4.0 is a set of comparable corpora containing transcriptions of parliamentary debates of 2...
The repository contains a cleaned and pre-processed corpus of parliamentary debates from the Croatia...
The repository contains a cleaned and pre-processed corpus of parliamentary debates from the Croatia...
The repository contains a cleaned and pre-processed corpus of parliamentary debates from the Nationa...
The repository contains a cleaned and pre-processed corpus of parliamentary debates from the Parliam...
ParlaMint 2.1 is a multilingual set of 17 comparable corpora containing parliamentary debates mostly...
The siParl corpus contains minutes of the Assembly of the Republic of Slovenia for 11th legislative ...
This is a repository for the corpus of transcripts of parliamentary debates in the National Council ...
ParlaMint 2.1 is a multilingual set of 17 comparable corpora containing parliamentary debates mostly...
ParlaMint 3.0 is a multilingual set of 26 comparable corpora containing parliamentary debates mostly...
ParlaMint is a multilingual set of comparable corpora containing parliamentary debates mostly starti...
ParlaMint 3.0 is a multilingual set of 26 comparable corpora containing parliamentary debates mostly...
The SlovParl corpus contains minutes of the Assembly of the Republic of Slovenia for the legislative...
ParlaMint is a multilingual set of comparable corpora containing parliamentary debates mostly starti...
The SlovParl corpus contains minutes of the Chamber of Associated Labour of the Assembly of the Repu...
ParlaMint 4.0 is a set of comparable corpora containing transcriptions of parliamentary debates of 2...