International audienceUnbiased assessment of the predictivity of models learnt by supervised machine-learning methods requires knowledge of the learned function over a reserved test set (not used by the learning algorithm). The quality of the assessment depends, naturally, on the properties of the test set and on the error statistic used to estimate the prediction error. In this work we tackle both issues, proposing a new predictivity criterion that carefully weights the individual observed errors to obtain a global error estimate, and using incremental experimental design methods to "optimally" select the test points on which the criterion is computed. Several incremental constructions are studied, including greedy-packing (coffee-house de...
The educational sector faced many types of research in predicting student performance based on super...
In this paper a forecasting model selection scheme is considered which amounts to testing the predic...
Abstract. The use of data analysis competitions for selecting the most appropriate model for a probl...
International audienceUnbiased assessment of the predictivity of models learnt by supervised machine...
Comparative ex-ante prediction experiments over expanding subsamples are a popular tool for the task...
Comparative ex-ante prediction experiments over expanding subsamples are a popular tool for the task...
Comparative ex-ante prediction experiments over expanding subsamples are a popular tool for the task...
Comparative ex-ante prediction experiments over expanding subsamples are a popular tool for the task...
Comparative ex-ante prediction experiments over expanding subsamples are a popular tool for the task...
<p>We show the prediction accuracy (that is, the fraction of correct rating predictions) as a functi...
In this thesis, we propose new model evaluation strategies for supervised machine learning. Our main...
The thesis discuses reliability estimation of individual predictions in the supervised learning fram...
Modern data science tools are effective to produce predictions that strongly correlate with response...
Existing criteria for evaluating the adequacy of a predictive model are model-based (e.g. AIC, BIC,...
Predictive accuracy of a model is of key importance in research and to a lay audience. Diverse model...
The educational sector faced many types of research in predicting student performance based on super...
In this paper a forecasting model selection scheme is considered which amounts to testing the predic...
Abstract. The use of data analysis competitions for selecting the most appropriate model for a probl...
International audienceUnbiased assessment of the predictivity of models learnt by supervised machine...
Comparative ex-ante prediction experiments over expanding subsamples are a popular tool for the task...
Comparative ex-ante prediction experiments over expanding subsamples are a popular tool for the task...
Comparative ex-ante prediction experiments over expanding subsamples are a popular tool for the task...
Comparative ex-ante prediction experiments over expanding subsamples are a popular tool for the task...
Comparative ex-ante prediction experiments over expanding subsamples are a popular tool for the task...
<p>We show the prediction accuracy (that is, the fraction of correct rating predictions) as a functi...
In this thesis, we propose new model evaluation strategies for supervised machine learning. Our main...
The thesis discuses reliability estimation of individual predictions in the supervised learning fram...
Modern data science tools are effective to produce predictions that strongly correlate with response...
Existing criteria for evaluating the adequacy of a predictive model are model-based (e.g. AIC, BIC,...
Predictive accuracy of a model is of key importance in research and to a lay audience. Diverse model...
The educational sector faced many types of research in predicting student performance based on super...
In this paper a forecasting model selection scheme is considered which amounts to testing the predic...
Abstract. The use of data analysis competitions for selecting the most appropriate model for a probl...