This work contains the data used to test and evaluate tools for references extraction from papers in PDF format. This work derives from my thesis on the references extraction and parsing tools, where the selected tools are: Anystyle, Cermine, ExCite, Grobid, Pdfssa4met, Scholarcy and Science Parse. The folder PDF_papers contains the 56 papers in PDF used as input dataset. The names of the file are composed by the abridged form of the research field they belong to plus a numeric value which orders them from 1 to 54. As regards z_notes the numbering restarts from 0. These last two files are particular since they do not containing an explicitly named references section. They represent a further test for the tools. These papers have been sele...
These data have been gathered in the context of a study aiming to investigate citation practices for...
Metadata extraction is the process of describing extrinsic and intrinsic qualities of the resource s...
At the Mid Sweden University (MiUN) students are expected to cite relevant and domain specific intel...
Many solutions have been provided to extract bibliographic references from PDF papers. Machine learn...
The aim of this work is to identify all, and only, the tools which, given a full text paper in PDF f...
Slides of the presentation of the paper Cioffi, A., & Peroni, S. (2022). Structured References from...
This paper addresses the problem of extracting and segmenting references from PDF documents. The nov...
This paper evaluates the performance of tools for the extraction of metadata from scientific article...
This paper evaluates the performance of tools for the extraction of metadata from scientific article...
At the Mid Sweden University (MiUN) students are expected to cite relevant and domain specific intel...
Scientific full text papers are usually stored in separate places than their underlying research dat...
There are several open-source tools available to extract the bibliographic references of the Pdf. Th...
Scientific full text papers are usually stored in separate places than their underlying research dat...
International audienceScientific papers potentially offer a wealth of information that allows one to...
As part of a larger project to automatically reference link the online scholarly literature, an atte...
These data have been gathered in the context of a study aiming to investigate citation practices for...
Metadata extraction is the process of describing extrinsic and intrinsic qualities of the resource s...
At the Mid Sweden University (MiUN) students are expected to cite relevant and domain specific intel...
Many solutions have been provided to extract bibliographic references from PDF papers. Machine learn...
The aim of this work is to identify all, and only, the tools which, given a full text paper in PDF f...
Slides of the presentation of the paper Cioffi, A., & Peroni, S. (2022). Structured References from...
This paper addresses the problem of extracting and segmenting references from PDF documents. The nov...
This paper evaluates the performance of tools for the extraction of metadata from scientific article...
This paper evaluates the performance of tools for the extraction of metadata from scientific article...
At the Mid Sweden University (MiUN) students are expected to cite relevant and domain specific intel...
Scientific full text papers are usually stored in separate places than their underlying research dat...
There are several open-source tools available to extract the bibliographic references of the Pdf. Th...
Scientific full text papers are usually stored in separate places than their underlying research dat...
International audienceScientific papers potentially offer a wealth of information that allows one to...
As part of a larger project to automatically reference link the online scholarly literature, an atte...
These data have been gathered in the context of a study aiming to investigate citation practices for...
Metadata extraction is the process of describing extrinsic and intrinsic qualities of the resource s...
At the Mid Sweden University (MiUN) students are expected to cite relevant and domain specific intel...