Abstract Background Supervised learning methods need annotated data in order to generate efficient models. Annotated data, however, is a relatively scarce resource and can be expensive to obtain. For both passive and active learning methods, there is a need to estimate the size of the annotated sample required to reach a performance target. Methods We designed and implemented a method that fits an inverse power law model to points of a given learning curve created using a small annotated training set. Fitting is carried out using nonlinear weighted least squares optimization. The fitted model is then used to predict the classifier's performance and confidence interval for larger sample sizes. For evaluation, the nonlinear weighted curve fit...
Learning models used for prediction purposes are mostly developed without paying much cognizance to ...
<div><p>Clinical trials increasingly employ medical imaging data in conjunction with supervised clas...
In recent decades, the availability of a large amount of data has propelled the field of machine lea...
Background: Supervised learning methods need annotated data in order to generate efficient models. A...
The main objective of this paper is to investigate the relationship between the size of training sam...
One of the fundamental machine learning tasks is that of predictive classification. Given that organ...
The research presented here focuses on modeling machine-learning performance. The thesis introduces ...
Dataset size is considered a major concern in the medical domain, where lack of data is a common occ...
Supervised learning is a machine learning technique used for creating a data prediction model. This ...
In high-dimensional prediction settings, it remains challenging to reliably estimate the test perfor...
In biospectroscopy, suitably annotated and statistically independent samples (e. g. patients, batche...
Currently for small-scale machine learning projects, there is no limit which has been set by its res...
This paper addresses the estimation of a small gallery size that can generate the optimal error esti...
Much research has been conducted in the area of machine learning algorithms; however, the question o...
This paper addresses one of the fundamental problems en-countered in performance prediction for obje...
Learning models used for prediction purposes are mostly developed without paying much cognizance to ...
<div><p>Clinical trials increasingly employ medical imaging data in conjunction with supervised clas...
In recent decades, the availability of a large amount of data has propelled the field of machine lea...
Background: Supervised learning methods need annotated data in order to generate efficient models. A...
The main objective of this paper is to investigate the relationship between the size of training sam...
One of the fundamental machine learning tasks is that of predictive classification. Given that organ...
The research presented here focuses on modeling machine-learning performance. The thesis introduces ...
Dataset size is considered a major concern in the medical domain, where lack of data is a common occ...
Supervised learning is a machine learning technique used for creating a data prediction model. This ...
In high-dimensional prediction settings, it remains challenging to reliably estimate the test perfor...
In biospectroscopy, suitably annotated and statistically independent samples (e. g. patients, batche...
Currently for small-scale machine learning projects, there is no limit which has been set by its res...
This paper addresses the estimation of a small gallery size that can generate the optimal error esti...
Much research has been conducted in the area of machine learning algorithms; however, the question o...
This paper addresses one of the fundamental problems en-countered in performance prediction for obje...
Learning models used for prediction purposes are mostly developed without paying much cognizance to ...
<div><p>Clinical trials increasingly employ medical imaging data in conjunction with supervised clas...
In recent decades, the availability of a large amount of data has propelled the field of machine lea...