We consider a general supervised learning problem with strongly convex and Lipschitz loss and study the problem of model selection aggregation. In particular, given a finite dictionary functions (learners) together with the prior, we generalize the results obtained by Dai, Rigollet and Zhang [Ann. Statist. 40 (2012) 1878–1905] for Gaussian regression with squared loss and fixed design to this learning setup. Specifically, we prove that the Q-aggregation procedure outputs an estimator that satisfies optimal oracle inequalities both in expectation and with high probability. Our proof techniques somewhat de-part from traditional proofs by making most of the standard arguments on the Laplace transform of the empirical process to be controlled. ...
In this dissertation, we explore two fundamental sets of inference problems arising in machine learn...
It is generally believed that ensemble approaches, which combine multiple algorithms or models, can ...
Temporal-Difference off-policy algorithms are among the building blocks of reinforcement learning (R...
Given a finite family of functions, the goal of model selection aggrega-tion is to construct a proce...
We consider the problem of aggregating a general collection of affine estimators for fixed design re...
Given a finite collection of estimators or classifiers, we study the problem of model selection type...
Abstract: We consider the problem of aggregating a general collection of affine estimators for fixed...
We introduce an alternative to the notion of ‘fast rate’ in Learning Theory, which coincides with th...
We consider a general statistical linear inverse problem, where the solution is represented via a kn...
Performing Q-Learning in continuous state-action spaces is a problem still unsolved for many complex...
Abstract: Given a finite collection of estimators or classifiers, we study the problem of model sele...
We develop minimax optimal risk bounds for the general learning task consisting in predicting as wel...
Aggregated hold-out (agghoo) is a method which averages learning rules selected by hold-out (that i...
Given a collection of $M$ different estimators or classifiers, we study the problem of model selecti...
Given a dictionary of $M_n$ initial estimates of the unknown true regression function, we aim to con...
In this dissertation, we explore two fundamental sets of inference problems arising in machine learn...
It is generally believed that ensemble approaches, which combine multiple algorithms or models, can ...
Temporal-Difference off-policy algorithms are among the building blocks of reinforcement learning (R...
Given a finite family of functions, the goal of model selection aggrega-tion is to construct a proce...
We consider the problem of aggregating a general collection of affine estimators for fixed design re...
Given a finite collection of estimators or classifiers, we study the problem of model selection type...
Abstract: We consider the problem of aggregating a general collection of affine estimators for fixed...
We introduce an alternative to the notion of ‘fast rate’ in Learning Theory, which coincides with th...
We consider a general statistical linear inverse problem, where the solution is represented via a kn...
Performing Q-Learning in continuous state-action spaces is a problem still unsolved for many complex...
Abstract: Given a finite collection of estimators or classifiers, we study the problem of model sele...
We develop minimax optimal risk bounds for the general learning task consisting in predicting as wel...
Aggregated hold-out (agghoo) is a method which averages learning rules selected by hold-out (that i...
Given a collection of $M$ different estimators or classifiers, we study the problem of model selecti...
Given a dictionary of $M_n$ initial estimates of the unknown true regression function, we aim to con...
In this dissertation, we explore two fundamental sets of inference problems arising in machine learn...
It is generally believed that ensemble approaches, which combine multiple algorithms or models, can ...
Temporal-Difference off-policy algorithms are among the building blocks of reinforcement learning (R...