Data collection This dataset contains information on the eprints posted on arXiv from its launch in 1991 until the end of 2019 (1,589,006 unique eprints), plus the data on their citations and the associated impact metrics. Here, eprints include preprints, conference proceedings, book chapters, data sets and commentary, i.e. every electronic material that has been posted on arXiv. The content and metadata of the arXiv eprints were retrieved from the arXiv API (https://arxiv.org/help/api/) as of 21st January 2020, where the metadata included data of the eprint’s title, author, abstract, subject category and the arXiv ID (the arXiv’s original eprint identifier). In addition, the associated citation data were derived from the Semantic Schola...
This dataset contains citations to published preprints, both before they are published and after the...
This is a full archive of metadata about papers on arxiv.org from 1993-2018, including abstracts. Da...
This dataset is meant to be used for experiments of Authorship Analysis. The dataset consists of abs...
<div><div><b>arXiv publications dataset with simulated citation relationships</b></div><div><br></di...
This study aims to provide an overview of the citation rate of arXiv.org since its launch in August ...
This article shows an approach to the study of two fundamental aspects of the prepublication of scie...
We propose a new data set based on all publications from all scientific fields available on arXiv.or...
Since its creation in 1991, arXiv has become central to the diffusion of research in a number of fie...
unarXive is a scholarly data set containing publications’ full-text, annotated in-text citations, an...
The main purpose of this article is to reveal the effect of self-archiving on the citation impact of...
The present work has calculated the minimum Open Archive Impact Factors and Open Archive Immediacy I...
[EN] This article shows an approach to the study of two fundamental aspects of the prepublication of...
Physics articles self-archived in arXiv have up to 4 times as much citation impact as articles that ...
This dataset contains citation-based impact indicators (a.k.a, "measures") for ~151M distinct DOIs t...
The rise in the use of the arXiv preprint server (astro-ph) over the past decade has led to a major ...
This dataset contains citations to published preprints, both before they are published and after the...
This is a full archive of metadata about papers on arxiv.org from 1993-2018, including abstracts. Da...
This dataset is meant to be used for experiments of Authorship Analysis. The dataset consists of abs...
<div><div><b>arXiv publications dataset with simulated citation relationships</b></div><div><br></di...
This study aims to provide an overview of the citation rate of arXiv.org since its launch in August ...
This article shows an approach to the study of two fundamental aspects of the prepublication of scie...
We propose a new data set based on all publications from all scientific fields available on arXiv.or...
Since its creation in 1991, arXiv has become central to the diffusion of research in a number of fie...
unarXive is a scholarly data set containing publications’ full-text, annotated in-text citations, an...
The main purpose of this article is to reveal the effect of self-archiving on the citation impact of...
The present work has calculated the minimum Open Archive Impact Factors and Open Archive Immediacy I...
[EN] This article shows an approach to the study of two fundamental aspects of the prepublication of...
Physics articles self-archived in arXiv have up to 4 times as much citation impact as articles that ...
This dataset contains citation-based impact indicators (a.k.a, "measures") for ~151M distinct DOIs t...
The rise in the use of the arXiv preprint server (astro-ph) over the past decade has led to a major ...
This dataset contains citations to published preprints, both before they are published and after the...
This is a full archive of metadata about papers on arxiv.org from 1993-2018, including abstracts. Da...
This dataset is meant to be used for experiments of Authorship Analysis. The dataset consists of abs...