In recent years, corpus based approaches to machine translation have become predominant, with Statistical Machine Translation (SMT) being the most actively progressing area. Success of these approaches depends on the availability of parallel corpora. In this paper we propose Active Crowd Translation (ACT), a new paradigm where active learning and crowd-sourcing come together to enable automatic translation for low-resource language pairs. Active learning aims at reducing cost of label acquisition by prioritizing the most informative data for annotation, while crowd-sourcing reduces cost by using the power of the crowds to make do for the lack of expensive language experts. We experiment and compare our active learning strategies with strong...
Crowd-sourcing has become a popular means of acquiring labeled data for many tasks where humans are ...
Crowd-sourcing has become a popular means of acquiring labeled data for many tasks where humans are ...
We explore how to improve machine translation systems by adding more translation data in situations ...
Corpus based approaches to automatic translation such as Example Based and Statistical Machine Trans...
The limited availability of in-domain training data is a major issue in the training of application-...
The limited availability of in-domain training data is a major issue in the training of application-...
Statistical Machine Translation (SMT) models learn how to translate by examining a bilingual paralle...
The logistics of collecting resources for Machine Translation (MT) has always been a cause of concer...
Building machine translation (MT) for many minority languages in the world is a serious challenge. ...
We present a widely applicable methodology to bring machine translation (MT) to under-resourced lang...
In data-driven Machine Translation approaches, like Example-Based Machine Translation (EBMT) (Brown...
This paper investigates active learning to improve statistical machine translation (SMT) for low-res...
Statistical machine translation (SMT) mod-els need large bilingual corpora for train-ing, which are ...
This work studies the capability of Crowd sourcing related to translation and software localization ...
Crowdsourcing makes it possible to create translations at much lower cost than hiring professional t...
Crowd-sourcing has become a popular means of acquiring labeled data for many tasks where humans are ...
Crowd-sourcing has become a popular means of acquiring labeled data for many tasks where humans are ...
We explore how to improve machine translation systems by adding more translation data in situations ...
Corpus based approaches to automatic translation such as Example Based and Statistical Machine Trans...
The limited availability of in-domain training data is a major issue in the training of application-...
The limited availability of in-domain training data is a major issue in the training of application-...
Statistical Machine Translation (SMT) models learn how to translate by examining a bilingual paralle...
The logistics of collecting resources for Machine Translation (MT) has always been a cause of concer...
Building machine translation (MT) for many minority languages in the world is a serious challenge. ...
We present a widely applicable methodology to bring machine translation (MT) to under-resourced lang...
In data-driven Machine Translation approaches, like Example-Based Machine Translation (EBMT) (Brown...
This paper investigates active learning to improve statistical machine translation (SMT) for low-res...
Statistical machine translation (SMT) mod-els need large bilingual corpora for train-ing, which are ...
This work studies the capability of Crowd sourcing related to translation and software localization ...
Crowdsourcing makes it possible to create translations at much lower cost than hiring professional t...
Crowd-sourcing has become a popular means of acquiring labeled data for many tasks where humans are ...
Crowd-sourcing has become a popular means of acquiring labeled data for many tasks where humans are ...
We explore how to improve machine translation systems by adding more translation data in situations ...