Low-frequency word prediction remains a challenge in modern neural machine translation (NMT) systems. Recent adaptive training methods promote the output of infrequent words by emphasizing their weights in the overall training objectives. Despite the improved recall of low-frequency words, their prediction precision is unexpectedly hindered by the adaptive objectives. Inspired by the observation that low-frequency words form a more compact embedding space, we tackle this challenge from a representation learning perspective. Specifically, we propose a frequency-aware token-level contrastive learning method, in which the hidden state of each decoding step is pushed away from the counterparts of other target words, in a soft contrastive way ba...
The quality of a Neural Machine Translation system depends substantially on the availability of siza...
This paper demonstrates that word sense disambiguation (WSD) can improve neural machine translation ...
Although Neural Machine Translation (NMT) models have advanced state-of-the-art performance in machi...
Neural Machine Translation has achieved state-of-the-art performance for several language pairs usin...
Neural machine translation (NMT) has achieved notable success in recent times, however it is also wi...
Neural language models do not scale well when the vocabulary is large. Noise contrastive estimation ...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
Neural Machine Translation (NMT) is a new approach to machine translation that has made great progre...
Neural machine translation (NMT) has become the de facto standard in the machine translation communi...
Neural machine translation (NMT) is often described as ‘data hungry’ as it typically requires large ...
This paper presents an extension of neural machine translation (NMT) model to incorporate additional...
This paper demonstrates that word sense disambiguation (WSD) can improve neural machine translation ...
Neural language models do not scale well when the vocabulary is large. Noise-contrastive estimation ...
Neural machine translation (NMT) conducts end-to-end translation with a source language encoder and ...
Unsupervised cross-lingual pretraining has achieved strong results in neural machine translation (NM...
The quality of a Neural Machine Translation system depends substantially on the availability of siza...
This paper demonstrates that word sense disambiguation (WSD) can improve neural machine translation ...
Although Neural Machine Translation (NMT) models have advanced state-of-the-art performance in machi...
Neural Machine Translation has achieved state-of-the-art performance for several language pairs usin...
Neural machine translation (NMT) has achieved notable success in recent times, however it is also wi...
Neural language models do not scale well when the vocabulary is large. Noise contrastive estimation ...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
Neural Machine Translation (NMT) is a new approach to machine translation that has made great progre...
Neural machine translation (NMT) has become the de facto standard in the machine translation communi...
Neural machine translation (NMT) is often described as ‘data hungry’ as it typically requires large ...
This paper presents an extension of neural machine translation (NMT) model to incorporate additional...
This paper demonstrates that word sense disambiguation (WSD) can improve neural machine translation ...
Neural language models do not scale well when the vocabulary is large. Noise-contrastive estimation ...
Neural machine translation (NMT) conducts end-to-end translation with a source language encoder and ...
Unsupervised cross-lingual pretraining has achieved strong results in neural machine translation (NM...
The quality of a Neural Machine Translation system depends substantially on the availability of siza...
This paper demonstrates that word sense disambiguation (WSD) can improve neural machine translation ...
Although Neural Machine Translation (NMT) models have advanced state-of-the-art performance in machi...