Behavioral testing in NLP allows fine-grained evaluation of systems by examining their linguistic capabilities through the analysis of input-output behavior. Unfortunately, existing work on behavioral testing in Machine Translation (MT) is currently restricted to largely handcrafted tests covering a limited range of capabilities and languages. To address this limitation, we propose to use Large Language Models (LLMs) to generate a diverse set of source sentences tailored to test the behavior of MT models in a range of situations. We can then verify whether the MT model exhibits the expected behavior through matching candidate sets that are also generated using LLMs. Our approach aims to make behavioral testing of MT systems practical while ...
The term translationese has been used to describe features of translated text, and in this paper, we...
The problem of evaluating machine translation (MT) systems is more challenging than it may first app...
Neural Machine Translation (NMT) systems have received much recent attention due to their human-leve...
Large language models are becoming increasingly practical for translating code across programming la...
Building machine translation (MT) test sets is a relatively expensive task. As MT becomes increasing...
In order to assess the suitability of a text for machine translation (MT), the factors in play are m...
Translating text that diverges from the training domain is a key challenge for machine translation. ...
The effect of translationese has been studied in the field of machine translation (MT), mostly with ...
Large language models (LLMs) have demonstrated significant capability to generalize across a large n...
Automatic metrics play a crucial role in machine translation. Despite the widespread use of n-gram-b...
Discriminative training, a.k.a. tuning, is an important part of Statistical Machine Translation. Thi...
This paper shows the applicability of lan-guage testing techniques to machine trans-lation (MT) eval...
Abstract. Statistical Machine Translation (SMT) is currently used in real-time and commercial settin...
Open-sourced large language models (LLMs) have demonstrated remarkable efficacy in various tasks wit...
In this paper, we propose a new method for effective error analysis of machine translation (MT) syst...
The term translationese has been used to describe features of translated text, and in this paper, we...
The problem of evaluating machine translation (MT) systems is more challenging than it may first app...
Neural Machine Translation (NMT) systems have received much recent attention due to their human-leve...
Large language models are becoming increasingly practical for translating code across programming la...
Building machine translation (MT) test sets is a relatively expensive task. As MT becomes increasing...
In order to assess the suitability of a text for machine translation (MT), the factors in play are m...
Translating text that diverges from the training domain is a key challenge for machine translation. ...
The effect of translationese has been studied in the field of machine translation (MT), mostly with ...
Large language models (LLMs) have demonstrated significant capability to generalize across a large n...
Automatic metrics play a crucial role in machine translation. Despite the widespread use of n-gram-b...
Discriminative training, a.k.a. tuning, is an important part of Statistical Machine Translation. Thi...
This paper shows the applicability of lan-guage testing techniques to machine trans-lation (MT) eval...
Abstract. Statistical Machine Translation (SMT) is currently used in real-time and commercial settin...
Open-sourced large language models (LLMs) have demonstrated remarkable efficacy in various tasks wit...
In this paper, we propose a new method for effective error analysis of machine translation (MT) syst...
The term translationese has been used to describe features of translated text, and in this paper, we...
The problem of evaluating machine translation (MT) systems is more challenging than it may first app...
Neural Machine Translation (NMT) systems have received much recent attention due to their human-leve...