Datasets are critical for scientific research, playing a role in replication, reproducibility, and efficiency. Researchers have recently shown that datasets are becoming more important for science to function properly, even serving as artifacts of study themselves. However, citing datasets is not a common or standard practice in spite of recent efforts by data repositories and funding agencies. This greatly affects our ability to track their usage and importance. A potential solution to this problem is to automatically extract dataset mentions from scientific articles. In this work, we propose to achieve such extraction by using a neural network based on a BiLSTM-CRF architecture. Our method achieves F1=0.885 in social science articles rele...
A benefit of the increasingly interconnected world is the amount of information available to pull fr...
This dataset contains impact measures (metrics/indicators) for 104,769,307 scientific articles. In p...
Background Discovering suitable datasets is an important part of health research, particularly for p...
Today, full-texts of scientific articles are often stored in different locations than the used datas...
In this work, we have presented an approach for detecting references to datasets in social sciences ...
Scientific full text papers are usually stored in separate places than their underlying research dat...
Scientific full text papers are usually stored in separate places than their underlying research dat...
Despite the popularity of data-driven research in scientific fields, we are intrigued by the combine...
Information retrieval systems for scholarly literature rely heavily not only on text matching but on...
Over the years, scientific articles have played a vital role in disseminating scientific knowledge. ...
This work demonstrates how neural network models (NNs) can be exploited toward resolving citation li...
Recent advancements in information retrieval systems significantly rely on the context-based feature...
peer reviewedOver the last century, we observe a steady and exponentially growth of scientific publi...
Mathiak, Brigitte, Boland, Katarina. Challenges in Matching Dataset Citation Strings to Datasets in ...
The Inter-university Consortium for Political and Social Research (ICPSR) is developing a computatio...
A benefit of the increasingly interconnected world is the amount of information available to pull fr...
This dataset contains impact measures (metrics/indicators) for 104,769,307 scientific articles. In p...
Background Discovering suitable datasets is an important part of health research, particularly for p...
Today, full-texts of scientific articles are often stored in different locations than the used datas...
In this work, we have presented an approach for detecting references to datasets in social sciences ...
Scientific full text papers are usually stored in separate places than their underlying research dat...
Scientific full text papers are usually stored in separate places than their underlying research dat...
Despite the popularity of data-driven research in scientific fields, we are intrigued by the combine...
Information retrieval systems for scholarly literature rely heavily not only on text matching but on...
Over the years, scientific articles have played a vital role in disseminating scientific knowledge. ...
This work demonstrates how neural network models (NNs) can be exploited toward resolving citation li...
Recent advancements in information retrieval systems significantly rely on the context-based feature...
peer reviewedOver the last century, we observe a steady and exponentially growth of scientific publi...
Mathiak, Brigitte, Boland, Katarina. Challenges in Matching Dataset Citation Strings to Datasets in ...
The Inter-university Consortium for Political and Social Research (ICPSR) is developing a computatio...
A benefit of the increasingly interconnected world is the amount of information available to pull fr...
This dataset contains impact measures (metrics/indicators) for 104,769,307 scientific articles. In p...
Background Discovering suitable datasets is an important part of health research, particularly for p...