Softcite software mention extraction from the CORD-19 publications This dataset is the first result of the extraction of software mentions from the set of publications of the CORD-19 corpus (https://allenai.org/data/cord-19) by the Softcite software recognizer, see https://github.com/ourresearch/software-mentions. The CORD-19 version used for this dataset is the one dated 2020-09-11, using the metadata.csv file only. We re-harvested the PDF with https://github.com/kermitt2/article-dataset-builder in order to also extract coordinates of software mentions in the PDF and to take advantage of the latest version of GROBID to produce better full text extraction from PDF. Note: We are working on a new version of this dataset with the CORD-19 v...
These two data file contains information on patent citations for USPTO utility patents granted betwe...
The code accompanying our new dataset of software mentions in biomedical papers (dataset, preprint)....
Software and data have become major components of modern research, which is also reflected by an inc...
Softcite software mention extraction from the CORD-19 publications This dataset is the first resul...
The Softcite dataset is a gold-standard dataset of software mentions in research publications, a fre...
In this paper, we investigate progress toward improved software citation by examining current softwa...
This dataset consists of two JSON files containing: - software-contexts.json: 49195 sentences from ...
In an effort to automate the process of identifying and analyzing the use of software in biomedical ...
International audienceScientific papers potentially offer a wealth of information that allows one to...
Appropriate citation of software plays an important role in academic publications to make research r...
Περιέχει το πλήρες κείμενοBased on state of the art machine learning techniques, GROBID (GeneRation ...
Update: As of March 27, 2020 we have now analyzed 31,527 distinct sources (articles and preprints) f...
This work contains the data used to test and evaluate tools for references extraction from papers in...
International audienceThe Semeval task 5 was an opportunity for experimenting with the key term ex- ...
https://arl.org/Lists/SPARC-OAForum/Message/5806.html We have been working on extracting references ...
These two data file contains information on patent citations for USPTO utility patents granted betwe...
The code accompanying our new dataset of software mentions in biomedical papers (dataset, preprint)....
Software and data have become major components of modern research, which is also reflected by an inc...
Softcite software mention extraction from the CORD-19 publications This dataset is the first resul...
The Softcite dataset is a gold-standard dataset of software mentions in research publications, a fre...
In this paper, we investigate progress toward improved software citation by examining current softwa...
This dataset consists of two JSON files containing: - software-contexts.json: 49195 sentences from ...
In an effort to automate the process of identifying and analyzing the use of software in biomedical ...
International audienceScientific papers potentially offer a wealth of information that allows one to...
Appropriate citation of software plays an important role in academic publications to make research r...
Περιέχει το πλήρες κείμενοBased on state of the art machine learning techniques, GROBID (GeneRation ...
Update: As of March 27, 2020 we have now analyzed 31,527 distinct sources (articles and preprints) f...
This work contains the data used to test and evaluate tools for references extraction from papers in...
International audienceThe Semeval task 5 was an opportunity for experimenting with the key term ex- ...
https://arl.org/Lists/SPARC-OAForum/Message/5806.html We have been working on extracting references ...
These two data file contains information on patent citations for USPTO utility patents granted betwe...
The code accompanying our new dataset of software mentions in biomedical papers (dataset, preprint)....
Software and data have become major components of modern research, which is also reflected by an inc...