In recent years, neural machine translation (NMT) has become the dominant approach in automated translation. However, like many other deep learning approaches, NMT suffers from overfitting when the amount of training data is limited. This is a serious issue for low-resource language pairs and many specialized translation domains that are inherently limited in the amount of available supervised data. For this reason, in this paper we propose regressing word (ReWE) and sentence (ReSE) embeddings at training time as a way to regularize NMT models and improve their generalization. During training, our models are trained to jointly predict categorical (words in the vocabulary) and continuous (word and sentence embeddings) outputs. An extensive s...
Recent advances in Neural Machine Translation (NMT) systems have achieved impressive results on lang...
Neural Machine Translation (NMT) models have been proved strong when translating clean texts, but th...
Differently from the traditional statistical MT that decomposes the translation task into distinct s...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
Embedding matrices are key components in neural natural language processing (NLP) models that are re...
Without real bilingual corpus available, unsupervised Neural Machine Translation (NMT) typically req...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
As a new neural machine translation approach, NonAutoregressive machine Translation (NAT) has attrac...
Neural machine translation (NMT) is often described as ‘data hungry’ as it typically requires large ...
UNMT tackles translation on monolingual corpora in two required languages. Since there is no explici...
Neural Machine Translation (NMT) models tend to achieve best performance when larger sets of paral...
Neural machine translation (NMT) has achieved notable success in recent times, however it is also wi...
In the context of neural machine translation, data augmentation (DA) techniques may be used for gene...
The requirement for neural machine translation (NMT) models to use fixed-size input and output vocab...
Semi-supervised learning algorithms in neural machine translation (NMT) have significantly improved ...
Recent advances in Neural Machine Translation (NMT) systems have achieved impressive results on lang...
Neural Machine Translation (NMT) models have been proved strong when translating clean texts, but th...
Differently from the traditional statistical MT that decomposes the translation task into distinct s...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
Embedding matrices are key components in neural natural language processing (NLP) models that are re...
Without real bilingual corpus available, unsupervised Neural Machine Translation (NMT) typically req...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
As a new neural machine translation approach, NonAutoregressive machine Translation (NAT) has attrac...
Neural machine translation (NMT) is often described as ‘data hungry’ as it typically requires large ...
UNMT tackles translation on monolingual corpora in two required languages. Since there is no explici...
Neural Machine Translation (NMT) models tend to achieve best performance when larger sets of paral...
Neural machine translation (NMT) has achieved notable success in recent times, however it is also wi...
In the context of neural machine translation, data augmentation (DA) techniques may be used for gene...
The requirement for neural machine translation (NMT) models to use fixed-size input and output vocab...
Semi-supervised learning algorithms in neural machine translation (NMT) have significantly improved ...
Recent advances in Neural Machine Translation (NMT) systems have achieved impressive results on lang...
Neural Machine Translation (NMT) models have been proved strong when translating clean texts, but th...
Differently from the traditional statistical MT that decomposes the translation task into distinct s...