International audienceRecent studies on the analysis of the multilingual representations focus on identifying whether there is an emergence of languageindependent representations, or whether a multilingual model partitions its weights among different languages. While most of such work has been conducted in a "black-box" manner, this paper aims to analyze individual components of a multilingual neural translation (NMT) model. In particular, we look at the encoder self-attention and encoder-decoder attention heads (in a many-to-one NMT model) that are more specific to the translation of a certain language pair than others by (1) employing metrics that quantify some aspects of the attention weights such as "variance" or "confidence", and (2) s...
Though early successes of Statistical Machine Translation (SMT) systems are attributed in part to th...
We explore the suitability of self-attention models for character-level neural machine translation. ...
The dominant neural machine translation (NMT) models that based on the encoder-decoder architecture ...
In this thesis, I explore neural machine translation (NMT) models via targeted investigation of vari...
In this paper, we explore a multilingual translation model with a cross-lingually shared layer that ...
Neural machine translation has considerably improved the quality of automatic translations by learni...
Recently, neural machine translation (NMT) has been extended to multilinguality, that is to handle m...
In this paper, we propose an architecture for machine translation (MT) capable of obtaining multilin...
Transformer model (Vaswani et al. 2017) has been widely used in machine translation tasks and obtain...
Neural machine translation has been lately established as the new state of the art in machine transl...
An attentional mechanism has lately been used to improve neural machine transla-tion (NMT) by select...
Multi-head self-attention is a key component of the Transformer, a state-of-the-art architecture for...
This paper presents an extension of neural machine translation (NMT) model to incorporate additional...
Multilingual Neural Machine Translation (MNMT) trains a single NMT model that supports translation b...
Since many languages originated from a common ancestral language and influence each other, there wou...
Though early successes of Statistical Machine Translation (SMT) systems are attributed in part to th...
We explore the suitability of self-attention models for character-level neural machine translation. ...
The dominant neural machine translation (NMT) models that based on the encoder-decoder architecture ...
In this thesis, I explore neural machine translation (NMT) models via targeted investigation of vari...
In this paper, we explore a multilingual translation model with a cross-lingually shared layer that ...
Neural machine translation has considerably improved the quality of automatic translations by learni...
Recently, neural machine translation (NMT) has been extended to multilinguality, that is to handle m...
In this paper, we propose an architecture for machine translation (MT) capable of obtaining multilin...
Transformer model (Vaswani et al. 2017) has been widely used in machine translation tasks and obtain...
Neural machine translation has been lately established as the new state of the art in machine transl...
An attentional mechanism has lately been used to improve neural machine transla-tion (NMT) by select...
Multi-head self-attention is a key component of the Transformer, a state-of-the-art architecture for...
This paper presents an extension of neural machine translation (NMT) model to incorporate additional...
Multilingual Neural Machine Translation (MNMT) trains a single NMT model that supports translation b...
Since many languages originated from a common ancestral language and influence each other, there wou...
Though early successes of Statistical Machine Translation (SMT) systems are attributed in part to th...
We explore the suitability of self-attention models for character-level neural machine translation. ...
The dominant neural machine translation (NMT) models that based on the encoder-decoder architecture ...