We present the first ever results show-ing that tuning a machine translation sys-tem against a semantic frame based ob-jective function, MEANT, produces more robustly adequate translations than tun-ing against BLEU or TER as measured across commonly used metrics and human subjective evaluation. Moreover, for in-formal web forum data, human evalua-tors preferredMEANT-tuned systems over BLEU- or TER-tuned systems by a sig-nificantly wider margin than that for for-mal newswire—even though automatic se-mantic parsing might be expected to fare worse on informal language. We argue that by preserving themeaning of the trans-lations as captured by semantic frames right in the training process, an MT sys-tem is constrained to make more accu-rate cho...
Minimum error rate training (MERT) in-volves choosing parameter values for a machine translation (MT...
Machine Translation (MT) systems tend to underperform when faced with long, linguistically complex s...
Translations generated by current statistical systems often have a large variance, in terms of their...
Discriminative training, a.k.a. tuning, is an important part of Statistical Machine Translation. Thi...
Automatic evaluation of machine translation (MT) is based on the idea that the quality of the MT out...
Translation systems are generally trained to optimize BLEU, but many alternative metrics are availab...
Many machine translation (MT) evaluation metrics have been shown to correlate better with human judg...
Automatic evaluation of machine translation (MT) is based on the idea that the quality of the MT out...
Automatic evaluation of machine translation (MT) is based on the idea that the quality of the MT out...
Automatic metrics are fundamental for the development and evaluation of machine translation systems....
Evaluation of machine translation (MT) output is a challenging task. In most cases, there is no sing...
Evaluation of machine translation (MT) output is a challenging task. In most cases, there is no sing...
We present a comparison of automatic metrics against human evaluations of translation quality in sev...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2014.This thesis investigates d...
The problem of evaluating machine translation (MT) systems is more challenging than it may first app...
Minimum error rate training (MERT) in-volves choosing parameter values for a machine translation (MT...
Machine Translation (MT) systems tend to underperform when faced with long, linguistically complex s...
Translations generated by current statistical systems often have a large variance, in terms of their...
Discriminative training, a.k.a. tuning, is an important part of Statistical Machine Translation. Thi...
Automatic evaluation of machine translation (MT) is based on the idea that the quality of the MT out...
Translation systems are generally trained to optimize BLEU, but many alternative metrics are availab...
Many machine translation (MT) evaluation metrics have been shown to correlate better with human judg...
Automatic evaluation of machine translation (MT) is based on the idea that the quality of the MT out...
Automatic evaluation of machine translation (MT) is based on the idea that the quality of the MT out...
Automatic metrics are fundamental for the development and evaluation of machine translation systems....
Evaluation of machine translation (MT) output is a challenging task. In most cases, there is no sing...
Evaluation of machine translation (MT) output is a challenging task. In most cases, there is no sing...
We present a comparison of automatic metrics against human evaluations of translation quality in sev...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2014.This thesis investigates d...
The problem of evaluating machine translation (MT) systems is more challenging than it may first app...
Minimum error rate training (MERT) in-volves choosing parameter values for a machine translation (MT...
Machine Translation (MT) systems tend to underperform when faced with long, linguistically complex s...
Translations generated by current statistical systems often have a large variance, in terms of their...