written; domain-specific (newspaper); synchronic; bilingual; parallel; unidirectional; XML; S-alignmen
The slenWaC corpus version 1.0 consists of parallel Slovene-English texts crawled from the .si top-l...
A comprehensive corpus of news articles on the topic of language, published in major daily newspaper...
Czech-Slovak parallel corpus consisting of several freely available corpora (Acquis [1], Europarl [2...
written; domain-specific (newspaper); synchronic; bilingual; parallel; unidirectional; XML; S-alignm...
written; domain-specific (fiction); diachronic (the French side); bilingual; parallel; ca 263,000 to...
The hrenWaC corpus version 2.0 consists of parallel Croatian-English texts crawled from the .hr top-...
written; domain-specific (newspaper); diachronic; bilingual; comparable; ca 3,500,000 tokens (393 Kw...
The srenWaC corpus consists of sentence aligned parallel Serbian-English texts crawled from the .rs ...
Sentence aligned parallel corpus built by automatically crawling 25 websites from the tourism domain
Corpus of manually aligned Czech-English parallel sentences. It comprises 2500 parallel sentences fr...
The Croatian-English parallel corpus MaCoCu-hr-en 1.0 was built by crawling the ".hr" internet top-l...
The Slovene-English parallel corpus MaCoCu-sl-en 1.0 was built by crawling the ".si" internet top-le...
English-Slovak parallel corpus consisting of several freely available corpora. Corpus is given in bo...
We are presenting a new, unique and freely available parallel corpus available in all 20 official Eu...
This corpora is part of Deliverable 5.5 of the European Commission project QTLeap FP7-ICT-2013.4.1-6...
The slenWaC corpus version 1.0 consists of parallel Slovene-English texts crawled from the .si top-l...
A comprehensive corpus of news articles on the topic of language, published in major daily newspaper...
Czech-Slovak parallel corpus consisting of several freely available corpora (Acquis [1], Europarl [2...
written; domain-specific (newspaper); synchronic; bilingual; parallel; unidirectional; XML; S-alignm...
written; domain-specific (fiction); diachronic (the French side); bilingual; parallel; ca 263,000 to...
The hrenWaC corpus version 2.0 consists of parallel Croatian-English texts crawled from the .hr top-...
written; domain-specific (newspaper); diachronic; bilingual; comparable; ca 3,500,000 tokens (393 Kw...
The srenWaC corpus consists of sentence aligned parallel Serbian-English texts crawled from the .rs ...
Sentence aligned parallel corpus built by automatically crawling 25 websites from the tourism domain
Corpus of manually aligned Czech-English parallel sentences. It comprises 2500 parallel sentences fr...
The Croatian-English parallel corpus MaCoCu-hr-en 1.0 was built by crawling the ".hr" internet top-l...
The Slovene-English parallel corpus MaCoCu-sl-en 1.0 was built by crawling the ".si" internet top-le...
English-Slovak parallel corpus consisting of several freely available corpora. Corpus is given in bo...
We are presenting a new, unique and freely available parallel corpus available in all 20 official Eu...
This corpora is part of Deliverable 5.5 of the European Commission project QTLeap FP7-ICT-2013.4.1-6...
The slenWaC corpus version 1.0 consists of parallel Slovene-English texts crawled from the .si top-l...
A comprehensive corpus of news articles on the topic of language, published in major daily newspaper...
Czech-Slovak parallel corpus consisting of several freely available corpora (Acquis [1], Europarl [2...