We study the performance of Machine Learning (ML) classification techniques. Leveraging the theory of large deviations, we provide the mathematical conditions for a ML classifier to exhibit error probabilities that vanish exponentially, say exp(-n I), where n is the number of informative observations available for testing (or another relevant parameter, such as the size of the target in an image) and I is the error rate. Such conditions depend on the Fenchel-Legendre transform of the cumulant-generating function of the Data-Driven Decision Function (D3F, i.e., what is thresholded before the final binary decision is made) learned in the training phase. As such, the D3F and the related error rate I depend on the given training set. The condit...
The goal of machine learning is to develop predictors that generalize well to test data. Ideally, th...
We study the interaction between input distributions, learning algo-rithms, and finite sample sizes ...
AbstractThe studies of generalization error give possible approaches to estimate the performance of ...
We study the performance of Machine Learning (ML) classification techniques. Leveraging the theory o...
We study the performance -- and specifically the rate at which the error probability converges to ze...
One of the fundamental machine learning tasks is that of predictive classification. Given that organ...
Statistical hypothesis testing is a method to make a decision among two or more hypotheses using mea...
Abstract. This paper reviews the appropriateness for application to large data sets of standard mach...
International audienceThis article discusses the asymptotic performance of classical machine learnin...
Ensembles of machine learning models yield improved system performance as well as robust and interpr...
We study the interaction between input distributions, learning algorithms and finite sample sizes in...
Recent improvements in machine learning methods have significantly advanced many fields in- cluding ...
The mean result of machine learning models is determined by utilizing k-fold cross-validation. The a...
Abstract. We propose a novel approach for the estimation of the size of training sets that are neede...
This thesis addresses three challenge of machine learning: high-dimensional data, label noise and li...
The goal of machine learning is to develop predictors that generalize well to test data. Ideally, th...
We study the interaction between input distributions, learning algo-rithms, and finite sample sizes ...
AbstractThe studies of generalization error give possible approaches to estimate the performance of ...
We study the performance of Machine Learning (ML) classification techniques. Leveraging the theory o...
We study the performance -- and specifically the rate at which the error probability converges to ze...
One of the fundamental machine learning tasks is that of predictive classification. Given that organ...
Statistical hypothesis testing is a method to make a decision among two or more hypotheses using mea...
Abstract. This paper reviews the appropriateness for application to large data sets of standard mach...
International audienceThis article discusses the asymptotic performance of classical machine learnin...
Ensembles of machine learning models yield improved system performance as well as robust and interpr...
We study the interaction between input distributions, learning algorithms and finite sample sizes in...
Recent improvements in machine learning methods have significantly advanced many fields in- cluding ...
The mean result of machine learning models is determined by utilizing k-fold cross-validation. The a...
Abstract. We propose a novel approach for the estimation of the size of training sets that are neede...
This thesis addresses three challenge of machine learning: high-dimensional data, label noise and li...
The goal of machine learning is to develop predictors that generalize well to test data. Ideally, th...
We study the interaction between input distributions, learning algo-rithms, and finite sample sizes ...
AbstractThe studies of generalization error give possible approaches to estimate the performance of ...