Large state-of-the-art corpora for training neural networks to create abstractive summaries are mostly limited to the news genre, as it is expensive to acquire human-written summaries for other types of text at a large scale. In this paper, we present a novel automatic corpus construction approach to tackle this issue as well as three new large open-licensed summarization corpora based on our approach that can be used for training abstractive summarization models. Our constructed corpora contain fictional narratives, descriptive texts, and summaries about movies, television, and book series from different domains. All sources use a creative commons (CC) license, hence we can provide the corpora for download. In addition, we also provide a r...
As the information on the internet continues to expand exponentially, machine learning, is becoming ...
In this paper, we present a Text Summarisation tool, compendium, capable of generating the most comm...
Recently, neural network-based approaches have pushed the performance of both extractive and abstrac...
Text summarization models are approaching human levels of fidelity. Existing benchmarking corpora pr...
Abstract \ud \ud Due to the vast amount of information we are faced with, summarization has become a...
Automatic summarization has so far focused on datasets of ten to twenty rather short documents of mo...
Live blogs are an increasingly popular news format to cover breaking news and live events in online ...
With the Internet becoming widespread, countless articles and multimedia content have been filled in...
Natural Language Processing (NLP) methods demand elaborate strategies for the creation of corpora th...
Recent deep learning and sequence-to-sequence learning technology have produced impressive results o...
This work proposes a trainable system for summarizing news and obtaining an approximate argumentati...
We present a new system for custom summarizations of large text corpora at interactive speed. The ta...
We explore how machine learning can be employed to learn rulesets for the traditional modules of con...
Automatic summarization of texts is now crucial for several information retrieval tasks owing to the...
Most people need textual or visual interfaces in order to make sense of Semantic Web data. In this t...
As the information on the internet continues to expand exponentially, machine learning, is becoming ...
In this paper, we present a Text Summarisation tool, compendium, capable of generating the most comm...
Recently, neural network-based approaches have pushed the performance of both extractive and abstrac...
Text summarization models are approaching human levels of fidelity. Existing benchmarking corpora pr...
Abstract \ud \ud Due to the vast amount of information we are faced with, summarization has become a...
Automatic summarization has so far focused on datasets of ten to twenty rather short documents of mo...
Live blogs are an increasingly popular news format to cover breaking news and live events in online ...
With the Internet becoming widespread, countless articles and multimedia content have been filled in...
Natural Language Processing (NLP) methods demand elaborate strategies for the creation of corpora th...
Recent deep learning and sequence-to-sequence learning technology have produced impressive results o...
This work proposes a trainable system for summarizing news and obtaining an approximate argumentati...
We present a new system for custom summarizations of large text corpora at interactive speed. The ta...
We explore how machine learning can be employed to learn rulesets for the traditional modules of con...
Automatic summarization of texts is now crucial for several information retrieval tasks owing to the...
Most people need textual or visual interfaces in order to make sense of Semantic Web data. In this t...
As the information on the internet continues to expand exponentially, machine learning, is becoming ...
In this paper, we present a Text Summarisation tool, compendium, capable of generating the most comm...
Recently, neural network-based approaches have pushed the performance of both extractive and abstrac...