Title and subtitles of Wikipedia articles

Sanchez-Charles, D. (David)

Open link

Publication date

January 2017

DOI

10.4121/uuid:61fb9665-40ab-4b70-8214-767c521cc950

Publisher

4TU.Centre for Research Data

ISSN

1865-1348

Abstract

This dataset contains 871 articles from Wikipedia (retrieved on 8th August 2016), selected from the list of featured articles () of the 'Media', 'Literature and Theater', 'Music biographies', 'Media biographies', 'History biographies' and 'Video gaming' categories. From the list of articles, the structure of the document, i.e. sections and subsections of the text, is extracted. The dataset also contains a proposed clusterization of the event names to increase comparability of Wikipedia articles

Extracted data

We use cookies to provide a better user experience.

Data Protection

Title and subtitles of Wikipedia articles

Abstract

Extracted data

Title and subtitles of Wikipedia articles

Abstract

Extracted data

Related items

Related items