Computer-produced summaries have traditionally been evaluated by comparing them with human-produced summaries using the F-measure. However, the F-measure is not appropriate when alternative sentences are possible in a human-produced extract. In this paper, we examine some evaluation methods devised to overcome the problem, including utility-based evaluation. By giving scores for moderately important sentences that does not appear in the human-produced extract, utility-based evaluation can resolve the problem. However, the method requires much effort from humans to provide data for evaluation. In this paper, we first propose a pseudo-utility-based evaluation that uses human-produced extracts at different compression ratios. To evaluate the e...
This paper describes three novel techniques to\ud automatically evaluate sentence extract summaries....
Two methods are used for evaluation of summarization systems: an evaluation of generated summaries a...
We propose in this paper an automatic evaluation procedure based on a metric which could provide sum...
The ability to effectively evaluate a learned model is a critical component of machine learning rese...
The term summary, of a statement or of an account, comprises the chief points or the sum and substan...
In this paper, we compare some automatic and manual methods for summary evaluation. One of the essen...
International audienceThe increasing volume of textual information on any topic requires its compres...
International audienceThe increasing volume of textual information on any topic requires its compres...
International audienceThe increasing volume of textual information on any topic requires its compres...
International audienceThe increasing volume of textual information on any topic requires its compres...
International audienceThe increasing volume of textual information on any topic requires its compres...
We present a series of experiments to demonstrate the validity of Relative Utility (RU) as a measure...
International audienceThe increasing volume of textual information on any topic requires its compres...
International audienceThe increasing volume of textual information on any topic requires its compres...
Two methods are used for evaluation of summarization systems: an evaluation of generated summaries a...
This paper describes three novel techniques to\ud automatically evaluate sentence extract summaries....
Two methods are used for evaluation of summarization systems: an evaluation of generated summaries a...
We propose in this paper an automatic evaluation procedure based on a metric which could provide sum...
The ability to effectively evaluate a learned model is a critical component of machine learning rese...
The term summary, of a statement or of an account, comprises the chief points or the sum and substan...
In this paper, we compare some automatic and manual methods for summary evaluation. One of the essen...
International audienceThe increasing volume of textual information on any topic requires its compres...
International audienceThe increasing volume of textual information on any topic requires its compres...
International audienceThe increasing volume of textual information on any topic requires its compres...
International audienceThe increasing volume of textual information on any topic requires its compres...
International audienceThe increasing volume of textual information on any topic requires its compres...
We present a series of experiments to demonstrate the validity of Relative Utility (RU) as a measure...
International audienceThe increasing volume of textual information on any topic requires its compres...
International audienceThe increasing volume of textual information on any topic requires its compres...
Two methods are used for evaluation of summarization systems: an evaluation of generated summaries a...
This paper describes three novel techniques to\ud automatically evaluate sentence extract summaries....
Two methods are used for evaluation of summarization systems: an evaluation of generated summaries a...
We propose in this paper an automatic evaluation procedure based on a metric which could provide sum...