Many solutions have been provided to extract bibliographic references from PDF papers. Machine learning, rule-based and regular expressions approaches were among the most used methods adopted in tools for addressing this task. This work aims to identify and evaluate all and only the tools which, given a full-text paper in PDF format, can recognise, extract and parse bibliographic references. We identified seven tools: Anystyle, Cermine, ExCite, Grobid, Pdfssa4met, Scholarcy and Science Parse. We compared and evaluated them against a corpus of 56 PDF articles published in 27 subject areas. Indeed, Anystyle obtained the best overall score, followed by Cermine. However, in some subject areas, other tools had better results for specific tasks
Current citation practices observed in articles are very noisy, confusing, and not standardised, mak...
Citations play an essential role in navigating academic literature and following chains of evidence ...
This research aims to develop a module for information retrieval that can trace references from bibl...
Many solutions have been provided to extract bibliographic references from PDF papers. Machine learn...
The aim of this work is to identify all, and only, the tools which, given a full text paper in PDF f...
This work contains the data used to test and evaluate tools for references extraction from papers in...
Slides of the presentation of the paper Cioffi, A., & Peroni, S. (2022). Structured References from...
There are several open-source tools available to extract the bibliographic references of the Pdf. Th...
International audienceAutomatic bibliographic reference annotation involves the tokenization and ide...
Abstract. A number of algorithms and approaches have been proposed towards the problem of scanning a...
Περιέχει το πλήρες κείμενοBased on state of the art machine learning techniques, GROBID (GeneRation ...
In this paper, we present the automatic annotation of bibliographical references' zone in papers and...
Citation indices are increasingly being used not only as navigational tools for researchers, but als...
Citations play an essential role in navigating academic literature and following chains of evidence ...
Metadata extraction is the process of describing extrinsic and intrinsic qualities of the resource s...
Current citation practices observed in articles are very noisy, confusing, and not standardised, mak...
Citations play an essential role in navigating academic literature and following chains of evidence ...
This research aims to develop a module for information retrieval that can trace references from bibl...
Many solutions have been provided to extract bibliographic references from PDF papers. Machine learn...
The aim of this work is to identify all, and only, the tools which, given a full text paper in PDF f...
This work contains the data used to test and evaluate tools for references extraction from papers in...
Slides of the presentation of the paper Cioffi, A., & Peroni, S. (2022). Structured References from...
There are several open-source tools available to extract the bibliographic references of the Pdf. Th...
International audienceAutomatic bibliographic reference annotation involves the tokenization and ide...
Abstract. A number of algorithms and approaches have been proposed towards the problem of scanning a...
Περιέχει το πλήρες κείμενοBased on state of the art machine learning techniques, GROBID (GeneRation ...
In this paper, we present the automatic annotation of bibliographical references' zone in papers and...
Citation indices are increasingly being used not only as navigational tools for researchers, but als...
Citations play an essential role in navigating academic literature and following chains of evidence ...
Metadata extraction is the process of describing extrinsic and intrinsic qualities of the resource s...
Current citation practices observed in articles are very noisy, confusing, and not standardised, mak...
Citations play an essential role in navigating academic literature and following chains of evidence ...
This research aims to develop a module for information retrieval that can trace references from bibl...