Predicting the number of citations of scholarly documents is an upcoming task in scholarly document processing. Besides the intrinsic merit of this information, it also has a wider use as an imperfect proxy for quality which has the advantage of being cheaply available for large volumes of scholarly documents. Previous work has dealt with number of citations prediction with relatively small training data sets, or larger datasets but with short, incomplete input text. In this work we leverage the open access ACL Anthology collection in combination with the Semantic Scholar bibliometric database to create a large corpus of scholarly documents with associated citation information and we propose a new citation prediction model called SChuBERT. ...
Accurately parsing citation strings is key to automatically building large-scale citation graphs, so...
Information retrieval systems for scholarly literature rely heavily not only on text matching but on...
Extracting and parsing reference strings from research articles is a challenging task. State-of-the-...
Predicting the number of citations of scholarly documents is an upcoming task in scholarly document ...
Automatic assessment of the quality of scholarly documents is a difficult task with high potential i...
Training recurrent neural networks on long texts, in particular scholarly documents, causes problems...
In most of the cases, scientists depend on previous literature which is relevant to their research f...
In this paper, we study the problem of predicting future ci-tation count of a scientific article aft...
Information retrieval systems for scholarly literature rely heavily not only on text matching but on...
Scientific dissemination is of central importance for the scientific process. This paper presents Ci...
In most of the cases, scientists depend on previous literature which is relevant to their research f...
The impact and significance of a scientific publication is measured mostly by the number of citation...
Accurately parsing citation strings is key to automatically building large-scale citation graphs, so...
Information retrieval systems for scholarly literature rely heavily not only on text matching but on...
Extracting and parsing reference strings from research articles is a challenging task. State-of-the-...
Predicting the number of citations of scholarly documents is an upcoming task in scholarly document ...
Automatic assessment of the quality of scholarly documents is a difficult task with high potential i...
Training recurrent neural networks on long texts, in particular scholarly documents, causes problems...
In most of the cases, scientists depend on previous literature which is relevant to their research f...
In this paper, we study the problem of predicting future ci-tation count of a scientific article aft...
Information retrieval systems for scholarly literature rely heavily not only on text matching but on...
Scientific dissemination is of central importance for the scientific process. This paper presents Ci...
In most of the cases, scientists depend on previous literature which is relevant to their research f...
The impact and significance of a scientific publication is measured mostly by the number of citation...
Accurately parsing citation strings is key to automatically building large-scale citation graphs, so...
Information retrieval systems for scholarly literature rely heavily not only on text matching but on...
Extracting and parsing reference strings from research articles is a challenging task. State-of-the-...