The neural network joint model (NNJM), which augments the neural network lan-guage model (NNLM) with an m-word source context window, has achieved large gains in machine translation accuracy, but also has problems with high normalization cost when using large vocabularies. Train-ing the NNJM with noise-contrastive es-timation (NCE), instead of standard maxi-mum likelihood estimation (MLE), can re-duce computation cost. In this paper, we propose an alternative to NCE, the bina-rized NNJM (BNNJM), which learns a bi-nary classifier that takes both the context and target words as input, and can be ef-ficiently trained using MLE. We compare the BNNJM and NNJM trained by NCE on various translation tasks.
Most statistical machine translation (SMT) systems are modeled using a log-linear framework. Althoug...
Relying on large-scale parallel corpora, neural machine translation has achieved great success in ce...
Though early successes of Statistical Machine Translation (SMT) systems are attributed in part to th...
Recent work has shown success in us-ing neural network language models (NNLMs) as features in MT sys...
Recent work has shown success in us-ing neural network language models (NNLMs) as features in MT sys...
We explore the application of neural language models to machine translation. We develop a new model ...
We explore the application of neural language models to machine translation. We develop a new model ...
The quality of translations produced by statistical machine translation (SMT) systems crucially depe...
Neural Machine Translation (NMT) is a new approach to machine translation that has made great progre...
Neural machine translation (NMT) conducts end-to-end translation with a source language encoder and ...
Monolingual data have been demonstrated to be helpful in improving translation quality of both stati...
[EN] Neural Network Language Models (NNLMs) are a successful approach to Natural Language Processing...
Neural language models do not scale well when the vocabulary is large. Noise-contrastive estimation ...
Machine translation presents its root in the domain of textual processing that focuses on the usage ...
With economic globalization and the rapid development of the Internet, the connections between diffe...
Most statistical machine translation (SMT) systems are modeled using a log-linear framework. Althoug...
Relying on large-scale parallel corpora, neural machine translation has achieved great success in ce...
Though early successes of Statistical Machine Translation (SMT) systems are attributed in part to th...
Recent work has shown success in us-ing neural network language models (NNLMs) as features in MT sys...
Recent work has shown success in us-ing neural network language models (NNLMs) as features in MT sys...
We explore the application of neural language models to machine translation. We develop a new model ...
We explore the application of neural language models to machine translation. We develop a new model ...
The quality of translations produced by statistical machine translation (SMT) systems crucially depe...
Neural Machine Translation (NMT) is a new approach to machine translation that has made great progre...
Neural machine translation (NMT) conducts end-to-end translation with a source language encoder and ...
Monolingual data have been demonstrated to be helpful in improving translation quality of both stati...
[EN] Neural Network Language Models (NNLMs) are a successful approach to Natural Language Processing...
Neural language models do not scale well when the vocabulary is large. Noise-contrastive estimation ...
Machine translation presents its root in the domain of textual processing that focuses on the usage ...
With economic globalization and the rapid development of the Internet, the connections between diffe...
Most statistical machine translation (SMT) systems are modeled using a log-linear framework. Althoug...
Relying on large-scale parallel corpora, neural machine translation has achieved great success in ce...
Though early successes of Statistical Machine Translation (SMT) systems are attributed in part to th...