Manuscript submitted for review.Manually indexing documents for subject-based access is a labour-intensive process. We propose using metadata gathered from bibliographic databases to train algorithms that assist librarians in that work. We have developed Annif, an open source tool and microservice for automated subject indexing. After training it with a subject vocabulary and existing metadata, Annif can be used to assign subject headings for new documents. We have tested Annif with different document collections including scientific papers, old scanned books and contemporary e-books, Q&A pairs from an “ask a librarian” service, Finnish Wikipedia, and the archives of a local newspaper. The results of analysing scientific papers and current ...
Subject indexing, i.e., the enrichment of metadata records for textual resources with descriptors fr...
Subject indexing, i.e., the enrichment of metadata records for textual resources with descriptors fr...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...
Manually indexing documents for subject-based access is a labour-intensive process. We propose using...
Manuscript accepted on 23 June 2021 for publication in JLIS.itManually indexing documents for subjec...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
Subject indexing, i.e., the enrichment of metadata records for textual resources with descriptors fr...
In order to improve search for information by people, it is important to have a good idea to what de...
In order to improve search for information by people, it is important to have a good idea to what de...
The Web empowered the authors of grey literature to publish their work on their own. In case of self...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...
Subject indexing, i.e., the enrichment of metadata records for textual resources with descriptors fr...
Subject indexing, i.e., the enrichment of metadata records for textual resources with descriptors fr...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...
Manually indexing documents for subject-based access is a labour-intensive process. We propose using...
Manuscript accepted on 23 June 2021 for publication in JLIS.itManually indexing documents for subjec...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
Subject indexing, i.e., the enrichment of metadata records for textual resources with descriptors fr...
In order to improve search for information by people, it is important to have a good idea to what de...
In order to improve search for information by people, it is important to have a good idea to what de...
The Web empowered the authors of grey literature to publish their work on their own. In case of self...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...
Subject indexing, i.e., the enrichment of metadata records for textual resources with descriptors fr...
Subject indexing, i.e., the enrichment of metadata records for textual resources with descriptors fr...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...