In this paper, we consider the task of assigning relevant labels to studies in the social science domain. Manual labelling is an expensive process and prone to human error. Various multi-label text classification machine learning approaches have been proposed to resolve this problem. We introduce a dataset obtained from the Finnish Social Science Archive and comprised of 2968 research studies’ metadata. The metadata of each study includes attributes, such as the “abstract” and the “set of labels”. We used the Bag of Words (BoW), TF-IDF term weighting and pretrained word embeddings obtained from FastText and BERT models to generate the text representations for each study’s abstract field. Our selection of multi-label classification methods i...
Document classification is a large body of search, many approaches were proposed for single label an...
© 2018 Elsevier Ltd Multi-label text categorization refers to the problem of assigning each document...
Multi-label classification is a fast-growing field of machine learning. Recent developments have sho...
In this paper, we consider the task of assigning relevant labels to studies in the social science do...
24th Irish Conference on Artificial Intelligence and Cognitive Science (AICS'16), Dublin, Ireland, 2...
International audienceMulti-label text classification consists in attributing, for each textual docu...
International audiencePre-trained language models have proven to be effective in multi-class text cl...
Multi-label classification is a generalization of a broader concept of multi-class classification in...
Multi-label text categorization is a crucial task in Natural Language Processing, where each text in...
The classification of scientific articles aligned to Sustainable Development Goals is crucial for re...
We created and analyzed a text classification dataset from freely-available web documents from the U...
We created and analyzed a text classification dataset from freely-available web documents from the U...
Machine learning approaches to multi-label document classification have to date largely relied on di...
Abstract Every year, around 28,100 journals publish 2.5 million research publications. Search engine...
Document classification is a large body of search, many approaches were proposed for single label an...
Document classification is a large body of search, many approaches were proposed for single label an...
© 2018 Elsevier Ltd Multi-label text categorization refers to the problem of assigning each document...
Multi-label classification is a fast-growing field of machine learning. Recent developments have sho...
In this paper, we consider the task of assigning relevant labels to studies in the social science do...
24th Irish Conference on Artificial Intelligence and Cognitive Science (AICS'16), Dublin, Ireland, 2...
International audienceMulti-label text classification consists in attributing, for each textual docu...
International audiencePre-trained language models have proven to be effective in multi-class text cl...
Multi-label classification is a generalization of a broader concept of multi-class classification in...
Multi-label text categorization is a crucial task in Natural Language Processing, where each text in...
The classification of scientific articles aligned to Sustainable Development Goals is crucial for re...
We created and analyzed a text classification dataset from freely-available web documents from the U...
We created and analyzed a text classification dataset from freely-available web documents from the U...
Machine learning approaches to multi-label document classification have to date largely relied on di...
Abstract Every year, around 28,100 journals publish 2.5 million research publications. Search engine...
Document classification is a large body of search, many approaches were proposed for single label an...
Document classification is a large body of search, many approaches were proposed for single label an...
© 2018 Elsevier Ltd Multi-label text categorization refers to the problem of assigning each document...
Multi-label classification is a fast-growing field of machine learning. Recent developments have sho...