We present LQVSumm, a corpus of about 2000 automatically created extractive multi-document summaries from the TAC 2011 shared task on Guided Summarization, which we annotated with several types of linguistic quality violations. Examples for such violations include pronouns that lack antecedents or ungrammatical clauses. We give details on the annotation scheme and show that inter-annotator agreement is good given the open-ended nature of the task. The annotated summaries have previously been scored for Readability on a numeric scale by human annotators in the context of the TAC challenge; we show that the number of instances of violations of linguistic quality of a summary correlates with these intuitively assigned numeric scores. On a syst...
Natural Language Processing (NLP) methods demand elaborate strategies for the creation of corpora th...
In this paper, we use the information redundancy in multilingual input to correct errors in machine ...
Automatic summarization has advanced greatly in the past few decades. However, there remains a huge ...
To date, few attempts have been made to develop and validate methods for automatic evaluation of lin...
We present a novel system combination of machine translation and text summariza-tion which provides ...
Recently, the credibility of information on the Web has become an important issue. In addition to te...
While large language models (LLMs) already achieve strong performance on standard generic summarizat...
To date, few attempts have been made to develop and validate methods for automatic evaluation of lin...
We present a large-scale meta evaluation of eight evaluation measures for both single-document and...
Multi-document summaries produced via sentence extraction often suffer from a number of cohesion pro...
The propensity of abstractive summarization systems to make factual errors has been the subject of s...
In this paper we address two issues. The first one analyzes whether the performance of a text summar...
Abstract Recent advances in large language models (LLMs) have demonstrated remarkable successes in z...
Abstract — Automatic text summarization is based on numerical, linguistical and empirical methods wh...
The multilingual summarization pilot task at TAC’11 opened a lot of problems we are facing when we t...
Natural Language Processing (NLP) methods demand elaborate strategies for the creation of corpora th...
In this paper, we use the information redundancy in multilingual input to correct errors in machine ...
Automatic summarization has advanced greatly in the past few decades. However, there remains a huge ...
To date, few attempts have been made to develop and validate methods for automatic evaluation of lin...
We present a novel system combination of machine translation and text summariza-tion which provides ...
Recently, the credibility of information on the Web has become an important issue. In addition to te...
While large language models (LLMs) already achieve strong performance on standard generic summarizat...
To date, few attempts have been made to develop and validate methods for automatic evaluation of lin...
We present a large-scale meta evaluation of eight evaluation measures for both single-document and...
Multi-document summaries produced via sentence extraction often suffer from a number of cohesion pro...
The propensity of abstractive summarization systems to make factual errors has been the subject of s...
In this paper we address two issues. The first one analyzes whether the performance of a text summar...
Abstract Recent advances in large language models (LLMs) have demonstrated remarkable successes in z...
Abstract — Automatic text summarization is based on numerical, linguistical and empirical methods wh...
The multilingual summarization pilot task at TAC’11 opened a lot of problems we are facing when we t...
Natural Language Processing (NLP) methods demand elaborate strategies for the creation of corpora th...
In this paper, we use the information redundancy in multilingual input to correct errors in machine ...
Automatic summarization has advanced greatly in the past few decades. However, there remains a huge ...