Pretraining-based (PT-based) automatic evaluation metrics (e.g., BERTScore and BARTScore) have been widely used in several sentence generation tasks (e.g., machine translation and text summarization) due to their better correlation with human judgments over traditional overlap-based methods. Although PT-based methods have become the de facto standard for training grammatical error correction (GEC) systems, GEC evaluation still does not benefit from pretrained knowledge. This paper takes the first step towards understanding and improving GEC evaluation with pretraining. We first find that arbitrarily applying PT-based metrics to GEC evaluation brings unsatisfactory correlation results because of the excessive attention to inessential systems...
A major challenge in the field of Text Generation is evaluation: Human evaluations are cost-intensiv...
Natural Language Generation (NLG) refers to the operation of expressing the calculation results of a...
Automatic machine translation evaluation was crucial for the rapid development of machine translatio...
Grammar is one of the most important properties of natural language. It is a set of structural (i.e....
The problem of evaluating machine translation (MT) systems is more challenging than it may first app...
ChatGPT has demonstrated impressive performance in various downstream tasks. However, in the Chinese...
We address the problem of class imbalance in supervised grammatical error detection (GED) for non-na...
Evaluating generated text received new attention with the introduction of model-based metrics in rec...
The paper presents experiments on using a Grammatical Error Correction (GEC) model to assess the cor...
Machine learning models that perform grammar error correction (GEC) suffer from insufficient trainin...
ABSTRACT Many evaluation issues for grammatical error detection have previously been overlooked, mak...
The detection and correction of grammatical errors still represent very hard problems for modern err...
Grammatical Error Correction (GEC) and Grammatical Error Correction (GED) are two important tasks in...
Grammatical error correction, like other machine learning tasks, greatly benefits from large quant...
Automatic evaluation of language generation systems is a well-studied problem in Natural Language Pr...
A major challenge in the field of Text Generation is evaluation: Human evaluations are cost-intensiv...
Natural Language Generation (NLG) refers to the operation of expressing the calculation results of a...
Automatic machine translation evaluation was crucial for the rapid development of machine translatio...
Grammar is one of the most important properties of natural language. It is a set of structural (i.e....
The problem of evaluating machine translation (MT) systems is more challenging than it may first app...
ChatGPT has demonstrated impressive performance in various downstream tasks. However, in the Chinese...
We address the problem of class imbalance in supervised grammatical error detection (GED) for non-na...
Evaluating generated text received new attention with the introduction of model-based metrics in rec...
The paper presents experiments on using a Grammatical Error Correction (GEC) model to assess the cor...
Machine learning models that perform grammar error correction (GEC) suffer from insufficient trainin...
ABSTRACT Many evaluation issues for grammatical error detection have previously been overlooked, mak...
The detection and correction of grammatical errors still represent very hard problems for modern err...
Grammatical Error Correction (GEC) and Grammatical Error Correction (GED) are two important tasks in...
Grammatical error correction, like other machine learning tasks, greatly benefits from large quant...
Automatic evaluation of language generation systems is a well-studied problem in Natural Language Pr...
A major challenge in the field of Text Generation is evaluation: Human evaluations are cost-intensiv...
Natural Language Generation (NLG) refers to the operation of expressing the calculation results of a...
Automatic machine translation evaluation was crucial for the rapid development of machine translatio...