Zarrieß S, Loth S, Schlangen D. Reading Times Predict the Quality of Generated Text Above and Beyond Human Ratings. In: Belz A, Gatt A, Portet F, Purver M, eds. Proceedings of the 15th European Workshop on Natural Language Generation. BRIGHTON, East Sussex, United Kingdom: The Association for Computational Linguistics; 2015: 38-47.Typically, human evaluation of NLG output is based on user ratings. We collected ratings and reading time data in a simple, low-cost experimental paradigm for text generation. Participants were presented corpus texts, automatically linearised texts, and texts containing predicted referring expressions and automatic linearisation. We demonstrate that the reading time metrics outperform the ratings in classifying t...
We describe SkillSum, a Natural Language Generation (NLG) system that generates a personalised feedb...
What are the effects of word-by-word predictability on sentence processing times during the natural ...
Evaluation of the narrative text generated by machines has traditionally been a challenge, particula...
Automatic methods and metrics that assess various quality criteria of automatically generated texts ...
Evaluating the output of NLG systems is notoriously difficult, and performing assessments of text qu...
We consider the evaluation problem in Natural Language Generation (NLG) and present results for eval...
Currently, there is little agreement as to how Natural Language Generation (NLG) systems should be e...
International audienceIt has been shown that complexity metrics, computed by a syntactic parser, is ...
It has been shown that complexity metrics, computed by a syntactic parser, is a predictor of human r...
There is growing interest in using automatically computed corpus-based evaluation metrics to evaluat...
Automatic evaluation for open-ended natural language generation tasks remains a challenge. We propos...
The research field of Natural Language Generation offers practitioners a wide range of techniques fo...
Automatic evaluation metrics are fast and cost-effective measurements of the quality of a Machine Tr...
Revisiting Readability: A Unified Framework for Predicting Text Quality We combine lexical, syntacti...
How well can we predict reading times and thus cognitive processing load? This study first assesses ...
We describe SkillSum, a Natural Language Generation (NLG) system that generates a personalised feedb...
What are the effects of word-by-word predictability on sentence processing times during the natural ...
Evaluation of the narrative text generated by machines has traditionally been a challenge, particula...
Automatic methods and metrics that assess various quality criteria of automatically generated texts ...
Evaluating the output of NLG systems is notoriously difficult, and performing assessments of text qu...
We consider the evaluation problem in Natural Language Generation (NLG) and present results for eval...
Currently, there is little agreement as to how Natural Language Generation (NLG) systems should be e...
International audienceIt has been shown that complexity metrics, computed by a syntactic parser, is ...
It has been shown that complexity metrics, computed by a syntactic parser, is a predictor of human r...
There is growing interest in using automatically computed corpus-based evaluation metrics to evaluat...
Automatic evaluation for open-ended natural language generation tasks remains a challenge. We propos...
The research field of Natural Language Generation offers practitioners a wide range of techniques fo...
Automatic evaluation metrics are fast and cost-effective measurements of the quality of a Machine Tr...
Revisiting Readability: A Unified Framework for Predicting Text Quality We combine lexical, syntacti...
How well can we predict reading times and thus cognitive processing load? This study first assesses ...
We describe SkillSum, a Natural Language Generation (NLG) system that generates a personalised feedb...
What are the effects of word-by-word predictability on sentence processing times during the natural ...
Evaluation of the narrative text generated by machines has traditionally been a challenge, particula...