This dataset is part of a larger project on using headlines to predict the social media popularity of news articles. The first stage of the project was to develop methods of automatically extracting news values scores, which offer a new perspective on analysing headlines. The dataset consists of two headlines corpora -- The Guardian and New York Times -- collected in 2014 using news outlet APIs and used to develop the news values extraction methods. Each corpus includes a unique headline identifier (to enable recreating the corpus by querying the relevant API) and news values scores for each headline
People rely on news to know what is happening around the world and inform their daily lives. In toda...
The headline of a news article is designed to succinctly summarize its content, providing the reader...
The dataset consists of a list of news articles headlines retrieved from tweets published by @BBCBre...
Headlines play a crucial role in attracting audiences’ attention to online artefacts (e.g. news arti...
This dataset is part of a larger project on using headlines to predict the social media popularity o...
This data set contains automated sentiment and emotionality annotations of 23 million headlines from...
This dataset is part of the Monash, UEA & UCR time series regression repository. http://tseregressio...
This paper explores the problem of media content data analysis with the focus on the phenomenon of v...
In recent years, several datasets have been released that include images and text, giving impulse ...
This dataset is part of the Monash, UEA & UCR time series regression repository. http://tseregressio...
The analysis of query logs from blog search engines show that news-related queries occupy a signific...
This repository contains the enrichments for the dataset The New York Times Annotated Corpus develop...
Headline or short summary generation is an important problem in Text Summarization and has several p...
Event extraction from news articles is a commonly required prerequisite for various tasks, such as a...
The way we formulate headlines matters -- this is the central tenet of this thesis. Headlines pl...
People rely on news to know what is happening around the world and inform their daily lives. In toda...
The headline of a news article is designed to succinctly summarize its content, providing the reader...
The dataset consists of a list of news articles headlines retrieved from tweets published by @BBCBre...
Headlines play a crucial role in attracting audiences’ attention to online artefacts (e.g. news arti...
This dataset is part of a larger project on using headlines to predict the social media popularity o...
This data set contains automated sentiment and emotionality annotations of 23 million headlines from...
This dataset is part of the Monash, UEA & UCR time series regression repository. http://tseregressio...
This paper explores the problem of media content data analysis with the focus on the phenomenon of v...
In recent years, several datasets have been released that include images and text, giving impulse ...
This dataset is part of the Monash, UEA & UCR time series regression repository. http://tseregressio...
The analysis of query logs from blog search engines show that news-related queries occupy a signific...
This repository contains the enrichments for the dataset The New York Times Annotated Corpus develop...
Headline or short summary generation is an important problem in Text Summarization and has several p...
Event extraction from news articles is a commonly required prerequisite for various tasks, such as a...
The way we formulate headlines matters -- this is the central tenet of this thesis. Headlines pl...
People rely on news to know what is happening around the world and inform their daily lives. In toda...
The headline of a news article is designed to succinctly summarize its content, providing the reader...
The dataset consists of a list of news articles headlines retrieved from tweets published by @BBCBre...