This is the ParlamentParla speech corpus for Catalan prepared by Col·lectivaT. The audio segments were extracted from recordings the Catalan Parliament (Parlament de Catalunya) plenary sessions, which took place between 2007/07/11 - 2018/07/17. We aligned the transcriptions with the recordings and extracted the corpus. The content belongs to the Catalan Parliament and the data is released conforming their terms of use. Preparation of this corpus was partly supported by the Department of Culture of the Catalan autonomous government, and the v2.0 was supported by the Barcelona Supercomputing Center, within the framework of the project AINA of the Departament de Polítiques Digitals. As of v2.0 the corpus is separated into 211 hours of clean ...
This paper presents the ParlaMint corpora containing transcriptions of the sessions of the 17 Europe...
The siParl corpus contains minutes of the Assembly of the Republic of Slovenia for 11th legislative ...
Aquest document conté el text POL2, una "sessió parlamentària" que forma part del Corpus Oral de Reg...
ParlaMint 2.1 is a multilingual set of 17 comparable corpora containing parliamentary debates mostly...
ParlaMint is a multilingual set of comparable corpora containing parliamentary debates mostly starti...
ParlaMint 2.1 is a multilingual set of 17 comparable corpora containing parliamentary debates mostly...
ParlaMint is a multilingual set of comparable corpora containing parliamentary debates mostly starti...
ParlaMint 3.0 is a multilingual set of 26 comparable corpora containing parliamentary debates mostly...
The Catalan Textual Corpus is a 1760-million-token web corpus of Catalan built from several sources:...
ParlaMint 4.0 is a set of comparable corpora containing transcriptions of parliamentary debates of 2...
Nos_ParlaSpeech-GL is an ASR corpus of more than 1,600 hours of automatically aligned speech and tex...
ParlaMint 3.0 is a multilingual set of 26 comparable corpora containing parliamentary debates mostly...
The corpus consists of recordings from the Chamber of Deputies of the Parliament of the Czech Republ...
The Catalan Government Crawling Corpus is a 39-million-token web corpus of Catalan built from the we...
The ParisParl Corpus of Parliamentary Debates, prepared in the PolMine Project, comprises all protoc...
This paper presents the ParlaMint corpora containing transcriptions of the sessions of the 17 Europe...
The siParl corpus contains minutes of the Assembly of the Republic of Slovenia for 11th legislative ...
Aquest document conté el text POL2, una "sessió parlamentària" que forma part del Corpus Oral de Reg...
ParlaMint 2.1 is a multilingual set of 17 comparable corpora containing parliamentary debates mostly...
ParlaMint is a multilingual set of comparable corpora containing parliamentary debates mostly starti...
ParlaMint 2.1 is a multilingual set of 17 comparable corpora containing parliamentary debates mostly...
ParlaMint is a multilingual set of comparable corpora containing parliamentary debates mostly starti...
ParlaMint 3.0 is a multilingual set of 26 comparable corpora containing parliamentary debates mostly...
The Catalan Textual Corpus is a 1760-million-token web corpus of Catalan built from several sources:...
ParlaMint 4.0 is a set of comparable corpora containing transcriptions of parliamentary debates of 2...
Nos_ParlaSpeech-GL is an ASR corpus of more than 1,600 hours of automatically aligned speech and tex...
ParlaMint 3.0 is a multilingual set of 26 comparable corpora containing parliamentary debates mostly...
The corpus consists of recordings from the Chamber of Deputies of the Parliament of the Czech Republ...
The Catalan Government Crawling Corpus is a 39-million-token web corpus of Catalan built from the we...
The ParisParl Corpus of Parliamentary Debates, prepared in the PolMine Project, comprises all protoc...
This paper presents the ParlaMint corpora containing transcriptions of the sessions of the 17 Europe...
The siParl corpus contains minutes of the Assembly of the Republic of Slovenia for 11th legislative ...
Aquest document conté el text POL2, una "sessió parlamentària" que forma part del Corpus Oral de Reg...