Manually indexing documents for subject-based access is a labour-intensive process. We propose using metadata gathered from bibliographic databases to train algorithms that assist librarians in that work. We have developed Annif, an open source tool and microservice for automated subject indexing. After training it with a subject vocabulary and existing metadata, Annif can be used to assign subject headings for new documents. We have tested Annif with different document collections including scientific papers, old scanned books and contemporary e-books, Q&A pairs from an “ask a librarian” service, Finnish Wikipedia, and the archives of a local newspaper. The results of analysing scientific papers and current books have been reassuring, whil...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
With the explosive growth in the number of electronic documents available on the internet, intranets...
Manuscript submitted for review.Manually indexing documents for subject-based access is a labour-int...
Manuscript accepted on 23 June 2021 for publication in JLIS.itManually indexing documents for subjec...
Subject indexing, i.e., the enrichment of metadata records for textual resources with descriptors fr...
Subject indexing, i.e., the enrichment of metadata records for textual resources with descriptors fr...
In order to improve search for information by people, it is important to have a good idea to what de...
In order to improve search for information by people, it is important to have a good idea to what de...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...
Academic libraries are still working based on the traditional documentation techniques of cataloguin...
The Web empowered the authors of grey literature to publish their work on their own. In case of self...
Subject indexing, i.e., the enrichment of metadata records for textual resources with descriptors fr...
Manual subject indexing in libraries is a time-consuming and costly process and the quality of the a...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
With the explosive growth in the number of electronic documents available on the internet, intranets...
Manuscript submitted for review.Manually indexing documents for subject-based access is a labour-int...
Manuscript accepted on 23 June 2021 for publication in JLIS.itManually indexing documents for subjec...
Subject indexing, i.e., the enrichment of metadata records for textual resources with descriptors fr...
Subject indexing, i.e., the enrichment of metadata records for textual resources with descriptors fr...
In order to improve search for information by people, it is important to have a good idea to what de...
In order to improve search for information by people, it is important to have a good idea to what de...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...
Academic libraries are still working based on the traditional documentation techniques of cataloguin...
The Web empowered the authors of grey literature to publish their work on their own. In case of self...
Subject indexing, i.e., the enrichment of metadata records for textual resources with descriptors fr...
Manual subject indexing in libraries is a time-consuming and costly process and the quality of the a...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
Purpose: comparing and examining the quality of the results of tagging, intellectual and automated i...
With the explosive growth in the number of electronic documents available on the internet, intranets...