We investigate a new approach for SMT system training within the streaming model of computation. We develop and test incrementally retrainable models which, given an incoming stream of new data, can efficiently incorporate the stream data online. A naive approach using a stream would use an unbounded amount of space. Instead, our online SMT system can incorporate information from unbounded incoming streams and maintain constant space and time. Crucially, we are able to match (or even exceed) translation performance of comparable systems which are batch retrained and use unbounded space. Our approach is particularly suited for situations when there is arbitrarily large amounts of new training material and we wish to incorporate it ef...
Statistical machine translation (SMT) systems use statistical learning methods to learn how to trans...
We present a novel online learning approach for statistical machine translation tailored to the comp...
Statistical Machine Translation (SMT) is by far the most dominant paradigm of Machine Translation. ...
We investigate a new approach for SMT system training within the streaming model of computation. We ...
Randomised techniques allow very big language models to be represented suc-cinctly. However, being b...
In this paper, we introduce our work of building a Streaming Multilingual Speech Model (SM2), which ...
Simultaneous machine translation systems rely on a policy to schedule read and write operations in o...
Statistical machine translation relies heavily on available parallel corpora, but SMT may not have t...
Parallel corpus is an indispensable resource for translation model training in statistical machine t...
We use feature decay algorithms (FDA) for fast deployment of accurate statistical machine translatio...
2014-07-28The goal of machine translation is to translate from one natural language into another usi...
Statistical machine translation, the task of translating text from one natural language into another...
Statistical machine translation (SMT) should benefit from linguistic information to improve perform...
© Cambridge University Press, 2015.Statistical machine translation (SMT) is gaining interest given t...
Translation needs have greatly increased during the last years. In many situations, text to be tran...
Statistical machine translation (SMT) systems use statistical learning methods to learn how to trans...
We present a novel online learning approach for statistical machine translation tailored to the comp...
Statistical Machine Translation (SMT) is by far the most dominant paradigm of Machine Translation. ...
We investigate a new approach for SMT system training within the streaming model of computation. We ...
Randomised techniques allow very big language models to be represented suc-cinctly. However, being b...
In this paper, we introduce our work of building a Streaming Multilingual Speech Model (SM2), which ...
Simultaneous machine translation systems rely on a policy to schedule read and write operations in o...
Statistical machine translation relies heavily on available parallel corpora, but SMT may not have t...
Parallel corpus is an indispensable resource for translation model training in statistical machine t...
We use feature decay algorithms (FDA) for fast deployment of accurate statistical machine translatio...
2014-07-28The goal of machine translation is to translate from one natural language into another usi...
Statistical machine translation, the task of translating text from one natural language into another...
Statistical machine translation (SMT) should benefit from linguistic information to improve perform...
© Cambridge University Press, 2015.Statistical machine translation (SMT) is gaining interest given t...
Translation needs have greatly increased during the last years. In many situations, text to be tran...
Statistical machine translation (SMT) systems use statistical learning methods to learn how to trans...
We present a novel online learning approach for statistical machine translation tailored to the comp...
Statistical Machine Translation (SMT) is by far the most dominant paradigm of Machine Translation. ...