In the field of automatic text simplification, assessing whether or not the meaning of the original text has been preserved during simplification is of paramount importance. Metrics relying on n-gram overlap assessment may struggle to deal with simplifications which replace complex phrases with their simpler paraphrases. Current evaluation metrics for meaning preservation based on large language models (LLMs), such as BertScore in machine translation or QuestEval in summarization, have been proposed. However, none has a strong correlation with human judgment of meaning preservation. Moreover, such metrics have not been assessed in the context of text simplification research. In this study, we present a meta-evaluation of several metrics we ...
Machine translation translates a text from one language to another, while text simplification conver...
Assessing the semantic similarity between two texts is a central task in many applications, includin...
The rapid development of such natural language processing tasks as style transfer, paraphrase, and m...
In the field of automatic text simplification, assessing whether or not the meaning of the original ...
This study explores the possibility of re-placing the costly and time-consuming human evaluation of ...
Automatic evaluation remains an open research question in Natural Language Generation. In the contex...
This paper presents the results of the shared task of the Workshop on Quality Assessment for Text Si...
International audienceThe evaluation of text simplification (TS) systems remains an open challenge. ...
Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons License Attribu...
Evaluating the quality of generated text is difficult, since traditional NLG evaluation metrics, foc...
Readability assessment can play a role in the evaluation of a simplification algorithm as well as in...
In order to simplify a sentence, human editors perform multiple rewriting transformations: they spli...
The quality of the output generated by automatic Text Simplification (TS) systems is traditionally a...
The rapid development of such natural language processing tasks as style transfer, paraphrase, and m...
Metrics for graph-based meaning representations (e.g., Abstract Meaning Representation, AMR) can hel...
Machine translation translates a text from one language to another, while text simplification conver...
Assessing the semantic similarity between two texts is a central task in many applications, includin...
The rapid development of such natural language processing tasks as style transfer, paraphrase, and m...
In the field of automatic text simplification, assessing whether or not the meaning of the original ...
This study explores the possibility of re-placing the costly and time-consuming human evaluation of ...
Automatic evaluation remains an open research question in Natural Language Generation. In the contex...
This paper presents the results of the shared task of the Workshop on Quality Assessment for Text Si...
International audienceThe evaluation of text simplification (TS) systems remains an open challenge. ...
Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons License Attribu...
Evaluating the quality of generated text is difficult, since traditional NLG evaluation metrics, foc...
Readability assessment can play a role in the evaluation of a simplification algorithm as well as in...
In order to simplify a sentence, human editors perform multiple rewriting transformations: they spli...
The quality of the output generated by automatic Text Simplification (TS) systems is traditionally a...
The rapid development of such natural language processing tasks as style transfer, paraphrase, and m...
Metrics for graph-based meaning representations (e.g., Abstract Meaning Representation, AMR) can hel...
Machine translation translates a text from one language to another, while text simplification conver...
Assessing the semantic similarity between two texts is a central task in many applications, includin...
The rapid development of such natural language processing tasks as style transfer, paraphrase, and m...