This paper presents an unsupervised extractive approach to summarize scientific long documents based on the Information Bottleneck principle. Inspired by previous work which uses the Information Bottleneck principle for sentence compression, we extend it to document level summarization with two separate steps. In the first step, we use signal(s) as queries to retrieve the key content from the source document. Then, a pre-trained language model conducts further sentence search and edit to return the final extracted summaries. Importantly, our work can be flexibly extended to a multi-view framework by different signals. Automatic evaluation on three scientific document datasets verifies the effectiveness of the proposed framework. The further...
The amount of data on the Internet has increased exponentially over the past decade. Therefore, we n...
Document summarization has been an intriguing task of Computational linguistics. A number of definit...
This paper discusses a text extraction approach to multi-document summarization that builds on singl...
This thesis focuses on methods for condensing large documents into highly concise sum-maries, achiev...
This paper investigates on sentence extraction based single Document summarization. It saves time in...
Researchers and scientists increasingly find themselves in the position of having to quickly underst...
Despite the exponential growth in scientific textual content, research publications are still the pr...
In this thesis, we propose a novel neural single-document extractive summarization model for long d...
Text summarization is a technique for shortening down or exacting a long text or document. It become...
We show that by making use of information common to document sets belonging to a common category, we...
We propose and develop a simple and efficient algorithm for generating extractive multi-document sum...
International audienceFeature Maximization is a feature selection method that deals efficiently with...
The task of automatic document summarization aims at generating short summaries for originally long ...
Along with the increasing number of scientific publications, many scientific communities must read t...
This dissertation provides a new method for sentence embedding and document summarization. The topic...
The amount of data on the Internet has increased exponentially over the past decade. Therefore, we n...
Document summarization has been an intriguing task of Computational linguistics. A number of definit...
This paper discusses a text extraction approach to multi-document summarization that builds on singl...
This thesis focuses on methods for condensing large documents into highly concise sum-maries, achiev...
This paper investigates on sentence extraction based single Document summarization. It saves time in...
Researchers and scientists increasingly find themselves in the position of having to quickly underst...
Despite the exponential growth in scientific textual content, research publications are still the pr...
In this thesis, we propose a novel neural single-document extractive summarization model for long d...
Text summarization is a technique for shortening down or exacting a long text or document. It become...
We show that by making use of information common to document sets belonging to a common category, we...
We propose and develop a simple and efficient algorithm for generating extractive multi-document sum...
International audienceFeature Maximization is a feature selection method that deals efficiently with...
The task of automatic document summarization aims at generating short summaries for originally long ...
Along with the increasing number of scientific publications, many scientific communities must read t...
This dissertation provides a new method for sentence embedding and document summarization. The topic...
The amount of data on the Internet has increased exponentially over the past decade. Therefore, we n...
Document summarization has been an intriguing task of Computational linguistics. A number of definit...
This paper discusses a text extraction approach to multi-document summarization that builds on singl...