ArXiv Subjects:Statistics Theory (math.ST)International audienceHyperparameters tuning and model selection are important steps in machine learning. Unfortunately, classical hyperparameter calibration and model selection procedures are sensitive to outliers and heavy-tailed data. In this work, we construct a selection procedure which can be seen as a robust alternative to cross-validation and is based on a median-of-means principle. Using this procedure, we also build an ensemble method which, trained with algorithms and corrupted heavy-tailed data, selects an algorithm, trains it with a large uncorrupted subsample and automatically tune its hyperparameters. The construction relies on a divide-and-conquer methodology, making this method easi...
International audienceHyperparameter learning has traditionally been a manual task because of the li...
Machine learning models can learn to recognize subtle patterns in complex data, making them useful i...
The bootstrap is a widely used procedure for statistical inference because of its simplicity and att...
ArXiv Subjects:Statistics Theory (math.ST)International audienceHyperparameters tuning and model sel...
The combined algorithm selection and hyperparameter tuning (CASH) problem is characterized by large ...
The development of advanced hyperparameter optimization algorithms, using e.g. Bayesian optimization...
Hyperparameters in machine learning (ML) have received a fair amount of attention, and hyperparamete...
42 pagesThis project studies methods of using data subsampling to perform model selection. Most comm...
Many different machine learning algorithms exist; taking into account each algorithm’s hyperparamete...
Ensemble machine learning methods are often used when the true prediction function is not easily app...
Many different machine learning algorithms exist; taking into account each algorithm's set of hyperp...
Hyperparameters in machine learning (ML) have received a fair amount of attention, and hyperparamete...
Hyperparameters play a crucial role in the model selection of machine learning algorithms. Tuning th...
The robust lasso-type regularized regression is a useful tool for simultaneous estimation and variab...
This work analyzes the effects on support recovery for different choices of the hyper- or regulariza...
International audienceHyperparameter learning has traditionally been a manual task because of the li...
Machine learning models can learn to recognize subtle patterns in complex data, making them useful i...
The bootstrap is a widely used procedure for statistical inference because of its simplicity and att...
ArXiv Subjects:Statistics Theory (math.ST)International audienceHyperparameters tuning and model sel...
The combined algorithm selection and hyperparameter tuning (CASH) problem is characterized by large ...
The development of advanced hyperparameter optimization algorithms, using e.g. Bayesian optimization...
Hyperparameters in machine learning (ML) have received a fair amount of attention, and hyperparamete...
42 pagesThis project studies methods of using data subsampling to perform model selection. Most comm...
Many different machine learning algorithms exist; taking into account each algorithm’s hyperparamete...
Ensemble machine learning methods are often used when the true prediction function is not easily app...
Many different machine learning algorithms exist; taking into account each algorithm's set of hyperp...
Hyperparameters in machine learning (ML) have received a fair amount of attention, and hyperparamete...
Hyperparameters play a crucial role in the model selection of machine learning algorithms. Tuning th...
The robust lasso-type regularized regression is a useful tool for simultaneous estimation and variab...
This work analyzes the effects on support recovery for different choices of the hyper- or regulariza...
International audienceHyperparameter learning has traditionally been a manual task because of the li...
Machine learning models can learn to recognize subtle patterns in complex data, making them useful i...
The bootstrap is a widely used procedure for statistical inference because of its simplicity and att...