These Catalan word embeddings in FastText have been generated from the largest corpus ever made in Catalan till the date. The corpus has more than 10Gb of curated high quality text. If this material is useful, please cite it. Copyright (c) 2021 Text Mining Unit - Barcelona Supercomputing CenterFunded by the Plan de Impulso de las Tecnologías del Lenguaje (Plan TL) and the Generalitat de Catalunya, Departament de Polítiques Digitals i Administració Pública
The Catalan Government Crawling Corpus is a 39-million-token web corpus of Catalan built from the we...
[Plan TL/medicine/word embeddings] Word embeddings generated from Spanish corpora that include: (a) ...
Embeddings with the Catalan Textual Corpus The embeddings have been trained with a Catalan textual ...
These Catalan sub-word embeddings in FastText using BPE have been generated from the largest corpus ...
Spanish Biomedical Sub-word Embeddings in FastText These embeddings have been generated from the la...
The Catalan Textual Corpus is a 1760-million-token web corpus of Catalan built from several sources:...
Spanish Biomedical Word Embeddings in FastText These word embeddings have been generated from the l...
These Spanish word embeddings in FastText have been generated from the largest corpus ever made in S...
Spanish Clinical Word Embeddings in FastText These embeddings have been generated from the largest ...
These Spanish word embeddings in FastText have been generated from the largest corpus ever made in S...
Spanish Clinical Sub-word Embeddings in FastText These embeddings have been generated from the larg...
Spanish Biomedical Sub-word Embeddings in FastText These embeddings have been generated from the la...
The Catalan Newswire Corpus is a 163-million-token corpus of Catalan newswire text built from three ...
Spanish Legal Word and Sub-word Embeddings in FastText These embeddings have been generated from th...
We present a large Spanish-Catalan parallel corpus extracted from ten years of the paper edition of ...
The Catalan Government Crawling Corpus is a 39-million-token web corpus of Catalan built from the we...
[Plan TL/medicine/word embeddings] Word embeddings generated from Spanish corpora that include: (a) ...
Embeddings with the Catalan Textual Corpus The embeddings have been trained with a Catalan textual ...
These Catalan sub-word embeddings in FastText using BPE have been generated from the largest corpus ...
Spanish Biomedical Sub-word Embeddings in FastText These embeddings have been generated from the la...
The Catalan Textual Corpus is a 1760-million-token web corpus of Catalan built from several sources:...
Spanish Biomedical Word Embeddings in FastText These word embeddings have been generated from the l...
These Spanish word embeddings in FastText have been generated from the largest corpus ever made in S...
Spanish Clinical Word Embeddings in FastText These embeddings have been generated from the largest ...
These Spanish word embeddings in FastText have been generated from the largest corpus ever made in S...
Spanish Clinical Sub-word Embeddings in FastText These embeddings have been generated from the larg...
Spanish Biomedical Sub-word Embeddings in FastText These embeddings have been generated from the la...
The Catalan Newswire Corpus is a 163-million-token corpus of Catalan newswire text built from three ...
Spanish Legal Word and Sub-word Embeddings in FastText These embeddings have been generated from th...
We present a large Spanish-Catalan parallel corpus extracted from ten years of the paper edition of ...
The Catalan Government Crawling Corpus is a 39-million-token web corpus of Catalan built from the we...
[Plan TL/medicine/word embeddings] Word embeddings generated from Spanish corpora that include: (a) ...
Embeddings with the Catalan Textual Corpus The embeddings have been trained with a Catalan textual ...