We propose a simple yet effective approach to learning bilingual word embeddings (BWEs) from non-parallel document-aligned data (based on the omnipresent skip-gram model), and its application to bilingual lexicon induction (BLI). We demonstrate the utility of the induced BWEs in the BLI task by reporting on benchmarking BLI datasets for three language pairs: (1) We show that our BWE-based BLI models significantly outperform the MuPTM-based and context-counting models in this setting, and obtain the best reported BLI results for all three tested language pairs; (2) We also show that our BWE-based BLI models outperform other BLI models based on recently proposed BWEs that require parallel data for bilingual training.status: publishe
This chapter introduces a strategy for the automatic extraction of multilingual collocation equivale...
International audienceIn this paper, we present a simple protocol to evaluate word aligners on bilin...
We introduce bilingual word embeddings: semantic embeddings associated across two languages in the c...
We propose a new model for learning bilingual word representations from non-parallel document-aligne...
We propose a new model for learning bilingual word representations from non-parallel document-aligne...
Building bilingual lexica from non-parallel data is a long-standing natural language processing rese...
We propose a new model for learning bilingual word representations from non-parallel document-aligne...
© 2017 Association for Computational Linguistics. We study the problem of bilingual lexicon inductio...
We study the problem of bilingual lexicon induction (BLI) in a setting where some translation resour...
Bilingual Word Embeddings (BWEs) are one of the cornerstones of cross-lingual transfer of NLP models...
Bilingual language models (Bi-LMs) refer to language models that are modeled using both source and t...
Recent work in learning bilingual repre-sentations tend to tailor towards achiev-ing good performanc...
Thesis (Master's)--University of Washington, 2018In an emergency, machine translation systems can be...
Thesis (Master's)--University of Washington, 2018In an emergency, machine translation systems can be...
Recent research has discovered that a shared bilingual word embedding space can be induced by projec...
This chapter introduces a strategy for the automatic extraction of multilingual collocation equivale...
International audienceIn this paper, we present a simple protocol to evaluate word aligners on bilin...
We introduce bilingual word embeddings: semantic embeddings associated across two languages in the c...
We propose a new model for learning bilingual word representations from non-parallel document-aligne...
We propose a new model for learning bilingual word representations from non-parallel document-aligne...
Building bilingual lexica from non-parallel data is a long-standing natural language processing rese...
We propose a new model for learning bilingual word representations from non-parallel document-aligne...
© 2017 Association for Computational Linguistics. We study the problem of bilingual lexicon inductio...
We study the problem of bilingual lexicon induction (BLI) in a setting where some translation resour...
Bilingual Word Embeddings (BWEs) are one of the cornerstones of cross-lingual transfer of NLP models...
Bilingual language models (Bi-LMs) refer to language models that are modeled using both source and t...
Recent work in learning bilingual repre-sentations tend to tailor towards achiev-ing good performanc...
Thesis (Master's)--University of Washington, 2018In an emergency, machine translation systems can be...
Thesis (Master's)--University of Washington, 2018In an emergency, machine translation systems can be...
Recent research has discovered that a shared bilingual word embedding space can be induced by projec...
This chapter introduces a strategy for the automatic extraction of multilingual collocation equivale...
International audienceIn this paper, we present a simple protocol to evaluate word aligners on bilin...
We introduce bilingual word embeddings: semantic embeddings associated across two languages in the c...