Accurately parsing citation strings is key to automatically building large-scale citation graphs, so a robust citation parser is an essential module in academic search engines. One limitation of the state-of-the-art models (such as ParsCit and Neural-ParsCit) is the lack of a large-scale training corpus. Manually annotating hundreds of thousands of citation strings is laborious and time-consuming. This thesis presents a novel transformer-based citation parser by leveraging the GIANT dataset, consisting of 1 billion synthesized citation strings covering over 1500 citation styles. As opposed to handcrafted features, our model benefits from word embeddings and character-based embeddings by combining the bidirectional long shortterm memory (BiL...
Information retrieval systems for scholarly literature rely heavily not only on text matching but on...
Technology-assisted review (TAR) refers to iterative active learning workflows for document review i...
Acknowledging the importance of citations in scientific literature, in this work we present MinScIE...
Extracting and parsing reference strings from research articles is a challenging task. State-of-the-...
Citation sentences (sentences that cite other papers) play a key role in the summarization of scient...
Predicting the number of citations of scholarly documents is an upcoming task in scholarly document ...
Citation classification aims to identify the purpose of the cited article in the citing article. Pre...
Citations are an important part of scientific papers, and the proper handling of them is indispensab...
We consider the task of reference mining: the detection, extraction and classification of references...
We investigate the effect of varying citation context window sizes on model performance in citation ...
Title from PDF of title page viewed June 14, 2021Thesis advisor: Yugyung LeeVitaIncludes bibliograph...
Research publications reflect advancements in the corresponding research domain. In these research p...
The increased pressure of publications makes it more and more difficult for researchers to find appr...
Information retrieval systems for scholarly literature rely heavily not only on text matching but on...
Transformer-based models have been utilized in natural language processing (NLP) for a wide variety ...
Information retrieval systems for scholarly literature rely heavily not only on text matching but on...
Technology-assisted review (TAR) refers to iterative active learning workflows for document review i...
Acknowledging the importance of citations in scientific literature, in this work we present MinScIE...
Extracting and parsing reference strings from research articles is a challenging task. State-of-the-...
Citation sentences (sentences that cite other papers) play a key role in the summarization of scient...
Predicting the number of citations of scholarly documents is an upcoming task in scholarly document ...
Citation classification aims to identify the purpose of the cited article in the citing article. Pre...
Citations are an important part of scientific papers, and the proper handling of them is indispensab...
We consider the task of reference mining: the detection, extraction and classification of references...
We investigate the effect of varying citation context window sizes on model performance in citation ...
Title from PDF of title page viewed June 14, 2021Thesis advisor: Yugyung LeeVitaIncludes bibliograph...
Research publications reflect advancements in the corresponding research domain. In these research p...
The increased pressure of publications makes it more and more difficult for researchers to find appr...
Information retrieval systems for scholarly literature rely heavily not only on text matching but on...
Transformer-based models have been utilized in natural language processing (NLP) for a wide variety ...
Information retrieval systems for scholarly literature rely heavily not only on text matching but on...
Technology-assisted review (TAR) refers to iterative active learning workflows for document review i...
Acknowledging the importance of citations in scientific literature, in this work we present MinScIE...