We examine the novel task of domain-independent scientific concept extraction from abstracts of scholarly articles and present two contributions. First, we suggest a set of generic scientific concepts that have been identified in a systematic annotation process. This set of concepts is utilised to annotate a corpus of scientific abstracts from 10 domains of Science, Technology and Medicine at the phrasal level in a joint effort with domain experts. The resulting dataset is used in a set of benchmark experiments to (a) provide baseline performance for this task, (b) examine the transferability of concepts between domains. Second, we present a state-of-the-art deep learning baseline. Further, we propose the active learning strategy for an opt...
Explicit semantic enrichment make digital scholarly publications potentially easy to find, to navig...
A benefit of the increasingly interconnected world is the amount of information available to pull fr...
International audienceIn our project about the digital library (DL) of scientific theses, we need to...
Abstract. We study the problem of extracting terms from research pa-pers, which is an important step...
Motivation: Scholarly biomedical publications report on the findings of a research investigation. Sc...
This work explores and evaluates text and graph mining methods for open domain concept and relation...
We present two complementary annotation schemes for sentence based annotation of full scientific pap...
In the past few decades, we saw a proliferation of scientific articles available online. This data-r...
International audienceThis paper describes the process of creating a corpus annotated for concepts a...
Manually analyzing large collections of research articles is a time- and resource-intensive activity...
One of the greatest challenges for search engines and other search tools, which are developed to cop...
For this dissertation two software applications were developed and three experiments were conducted ...
The continuous growth of scientific literature brings innovations and, at the same time, raises new ...
In recent years we have seen the emergence of a variety of scholarly datasets. Typically these captu...
With the large volume of unstructured data that increases continuously on the web, the motivation of...
Explicit semantic enrichment make digital scholarly publications potentially easy to find, to navig...
A benefit of the increasingly interconnected world is the amount of information available to pull fr...
International audienceIn our project about the digital library (DL) of scientific theses, we need to...
Abstract. We study the problem of extracting terms from research pa-pers, which is an important step...
Motivation: Scholarly biomedical publications report on the findings of a research investigation. Sc...
This work explores and evaluates text and graph mining methods for open domain concept and relation...
We present two complementary annotation schemes for sentence based annotation of full scientific pap...
In the past few decades, we saw a proliferation of scientific articles available online. This data-r...
International audienceThis paper describes the process of creating a corpus annotated for concepts a...
Manually analyzing large collections of research articles is a time- and resource-intensive activity...
One of the greatest challenges for search engines and other search tools, which are developed to cop...
For this dissertation two software applications were developed and three experiments were conducted ...
The continuous growth of scientific literature brings innovations and, at the same time, raises new ...
In recent years we have seen the emergence of a variety of scholarly datasets. Typically these captu...
With the large volume of unstructured data that increases continuously on the web, the motivation of...
Explicit semantic enrichment make digital scholarly publications potentially easy to find, to navig...
A benefit of the increasingly interconnected world is the amount of information available to pull fr...
International audienceIn our project about the digital library (DL) of scientific theses, we need to...