Dataset Metrics Total size of data uncompressed: 59,515,177,346 bytes Number of objects (submissions): 19,456,493 Reddit API Documentation: https://www.reddit.com/dev/api/ Overview This dataset contains all available submissions from Reddit during the month of May, 2019 (using UTC time boundaries). The data has been split to accommodate the file upload limitations for dataverse. Each file is a collection of json objects (ndjson). Each file was then compressed using zstandard compression (https://facebook.github.io/zstd). The files should be ordered by the id of the submission (represented by the id field). The time that each object was ingested is recorded in the retrieved_on field (in epoch seconds). Methodology Monthly Red...
The Coronavirus Open Citations Dataset curated by OpenCitations currently contains (as of 16 May 202...
Sets of statistics derived from posts to two sets of online forums, each serving an ideological comm...
Version 159 of the dataset. NOTES: Data for 3/15 - 3/18 was not extracted due to unexpected and unan...
The Pushshift Reddit Dataset We provide a small sample of the Pushshift Reddit dataset. The sample ...
Reddit is a social news, content rating and discussion website. It's one of the most popular sites o...
Reddit contents and complementary data regarding the r/The_Donald community and its main moderation ...
IMPORTANT: Dataset is now completely accessible from Github: https://github.com/mediatechnologycente...
This data set contains anonymized data collected from Reddit (via the Pushshift API) and StackOverfl...
Abstract With the rapid proliferation of social media sites, researchers have increasingly turned t...
<p>This file contains the posting preferences for over 850,000 active reddit users. This sample was ...
Social media are becoming more popular as a source of data for social science researchers. These dat...
Dataset Metrics Total size of data uncompressed:115901693 bytes Number of objects (submissions):...
The WE1S reddit dataset contains 1,034,174 Reddit comments containing the terms "humanities", "liber...
These four datasets are gathered from Instagram users who were chosen randomly. The MainDataset enc...
This dataset was used in the manuscript "Scaling laws and dynamics of hashtags on Twitter".. The Tw...
The Coronavirus Open Citations Dataset curated by OpenCitations currently contains (as of 16 May 202...
Sets of statistics derived from posts to two sets of online forums, each serving an ideological comm...
Version 159 of the dataset. NOTES: Data for 3/15 - 3/18 was not extracted due to unexpected and unan...
The Pushshift Reddit Dataset We provide a small sample of the Pushshift Reddit dataset. The sample ...
Reddit is a social news, content rating and discussion website. It's one of the most popular sites o...
Reddit contents and complementary data regarding the r/The_Donald community and its main moderation ...
IMPORTANT: Dataset is now completely accessible from Github: https://github.com/mediatechnologycente...
This data set contains anonymized data collected from Reddit (via the Pushshift API) and StackOverfl...
Abstract With the rapid proliferation of social media sites, researchers have increasingly turned t...
<p>This file contains the posting preferences for over 850,000 active reddit users. This sample was ...
Social media are becoming more popular as a source of data for social science researchers. These dat...
Dataset Metrics Total size of data uncompressed:115901693 bytes Number of objects (submissions):...
The WE1S reddit dataset contains 1,034,174 Reddit comments containing the terms "humanities", "liber...
These four datasets are gathered from Instagram users who were chosen randomly. The MainDataset enc...
This dataset was used in the manuscript "Scaling laws and dynamics of hashtags on Twitter".. The Tw...
The Coronavirus Open Citations Dataset curated by OpenCitations currently contains (as of 16 May 202...
Sets of statistics derived from posts to two sets of online forums, each serving an ideological comm...
Version 159 of the dataset. NOTES: Data for 3/15 - 3/18 was not extracted due to unexpected and unan...