Neural machine translation (NMT) has been shown to outperform statistical machine translation. However, NMT models typically require a large number of parameters and are expensive to train and deploy. Moreover, its large model size makes parallel training inefficient due to costly network communication. Likewise, distributing and locally running the model for a client-based NMT model such as a web browser or mobile device remains challenging. This thesis investigates ways to approximately train an NMT system by compressing either the gradients or the parameters for faster communication or reduced memory consumption. We propose a gradient compression technique that exchanges only the top 1% of the most significant gradient values whi...
[EN] Neural Network Language Models (NNLMs) are a successful approach to Natural Language Processing...
It is today acknowledged that neural network language models outperform backoff language models in a...
Pre-trained models have revolutionized the natural language processing field by leveraging large-sca...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
Neural Machine Translation is the primary algorithm used in industry to perform machine translation....
Neural machine translation (NMT) systems have greatly improved the quality available from machine tr...
Neural machine translation (NMT) systems have greatly improved the quality available from machine tr...
Humans benefit from communication but suffer from language barriers. Machine translation (MT) aims t...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
Neural Machine Translation (NMT) systems have received much recent attention due to their human-leve...
The goal of neural machine translation (NMT) is to build an end-to-end system that automatically tr...
Recent progress in neural machine translation is directed towards larger neural networks trained on ...
With economic globalization and the rapid development of the Internet, the connections between diffe...
Neural machine translation (NMT) has been a mainstream method for the machine translation (MT) task....
Large model size and high computational complexity prevent the neural machine translation (NMT) mode...
[EN] Neural Network Language Models (NNLMs) are a successful approach to Natural Language Processing...
It is today acknowledged that neural network language models outperform backoff language models in a...
Pre-trained models have revolutionized the natural language processing field by leveraging large-sca...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
Neural Machine Translation is the primary algorithm used in industry to perform machine translation....
Neural machine translation (NMT) systems have greatly improved the quality available from machine tr...
Neural machine translation (NMT) systems have greatly improved the quality available from machine tr...
Humans benefit from communication but suffer from language barriers. Machine translation (MT) aims t...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
Neural Machine Translation (NMT) systems have received much recent attention due to their human-leve...
The goal of neural machine translation (NMT) is to build an end-to-end system that automatically tr...
Recent progress in neural machine translation is directed towards larger neural networks trained on ...
With economic globalization and the rapid development of the Internet, the connections between diffe...
Neural machine translation (NMT) has been a mainstream method for the machine translation (MT) task....
Large model size and high computational complexity prevent the neural machine translation (NMT) mode...
[EN] Neural Network Language Models (NNLMs) are a successful approach to Natural Language Processing...
It is today acknowledged that neural network language models outperform backoff language models in a...
Pre-trained models have revolutionized the natural language processing field by leveraging large-sca...