This release introduces a setting to use only a part of the input text for subject indexing: the new input_limit project parameter truncates the input text to the given character number. This can improve the quality of the suggestions as the beginning of a long document typically includes an abstract and introduction. The default value for input_limit is zero, which means that truncation is not performed. Improvements include better handling of cached data in nn_ensemble training and optimization of memory usage in evaluation by using sparse matrices for suggested subjects. Many dependencies have been updated and a few minor issues fixed. New features: 446 Add a backend paratemer to limit input characters in suggest 452 Apply the input_limi...
Manually indexing documents for subject-based access is a labour-intensive process. We propose using...
This patch release fixes a bug which prevented training the SVC backend on fulltext corpus
Manuscript accepted on 23 June 2021 for publication in JLIS.itManually indexing documents for subjec...
This release includes a new STWFSA backend which is a wrapper around STWFSAPY, a lexical algorithm b...
This release includes improvements in training by reducing memory usage and adds the --cached option...
This is a cleanup and bugfix release that renames some methods and commands but brings no real new f...
This release adds a new --jobs parameter for the annif train command, which allows easy control of t...
This release includes a new language filtering feature. This input-transform filters out sentences o...
This release includes a new omikuji backend to support tree-based extreme multilabel classification ...
This release includes a new MLLM backend which is a Python implementation of the Maui-like Lexical M...
This release introduces the hyperopt CLI command for hyperparameter optimization. Initially it can o...
This patch release includes the following changes: #506 Fix NN ensemble training and learning on on...
This is a patch release that fixes two bugs in the 0.43.0 release. There are also some additions to ...
This release includes a new maui backend for integrating Annif with Maui Server, a REST service wrap...
This release includes a new trainable ensemble backend based on a neural network (nn_ensemble) and s...
Manually indexing documents for subject-based access is a labour-intensive process. We propose using...
This patch release fixes a bug which prevented training the SVC backend on fulltext corpus
Manuscript accepted on 23 June 2021 for publication in JLIS.itManually indexing documents for subjec...
This release includes a new STWFSA backend which is a wrapper around STWFSAPY, a lexical algorithm b...
This release includes improvements in training by reducing memory usage and adds the --cached option...
This is a cleanup and bugfix release that renames some methods and commands but brings no real new f...
This release adds a new --jobs parameter for the annif train command, which allows easy control of t...
This release includes a new language filtering feature. This input-transform filters out sentences o...
This release includes a new omikuji backend to support tree-based extreme multilabel classification ...
This release includes a new MLLM backend which is a Python implementation of the Maui-like Lexical M...
This release introduces the hyperopt CLI command for hyperparameter optimization. Initially it can o...
This patch release includes the following changes: #506 Fix NN ensemble training and learning on on...
This is a patch release that fixes two bugs in the 0.43.0 release. There are also some additions to ...
This release includes a new maui backend for integrating Annif with Maui Server, a REST service wrap...
This release includes a new trainable ensemble backend based on a neural network (nn_ensemble) and s...
Manually indexing documents for subject-based access is a labour-intensive process. We propose using...
This patch release fixes a bug which prevented training the SVC backend on fulltext corpus
Manuscript accepted on 23 June 2021 for publication in JLIS.itManually indexing documents for subjec...