We consider the problem of choosing between several models in least-squares regression with heteroscedastic data. We prove that any penalization procedure is suboptimal when the penalty is a function of the dimension of the model, at least for some typical heteroscedastic model selection problems. In particular, Mallows' Cp is suboptimal in this framework. On the contrary, optimal model selection is possible with data-driven penalties such as resampling or $V$-fold penalties. Therefore, it is worth estimating the shape of the penalty from data, even at the price of a higher computational cost. Simulation experiments illustrate the existence of a trade-off between statistical accuracy and computational complexity. As a conclusion, we sketch ...
Abstract. Performance bounds for criteria for model selection are devel-oped using recent theory for...
Abstract This paper is mainly devoted to a precise analysis of what kind of penalties should be used...
We consider the problem of model (or variable) selection in the classical regression model using the...
We consider the estimation of a regression function with random design and heteroscedastic noise in ...
International audiencePenalization procedures often suffer from their dependence on multiplying fact...
extended version of http://hal.archives-ouvertes.fr/hal-00125455, with a technical appendixInternati...
International audienceWe build penalized least-squares estimators using the slope heuristic and resa...
58p.We consider the estimation of a bounded regression function with nonparametric heteroscedastic n...
We study model selection strategies based on penalized empirical loss minimization. We point out a...
We investigate the structure of model selection problems via the bias/variance decomposition. In par...
In this paper we investigate penalized least squares methods in linear regression models with heter...
International audienceWe investigate the optimality for model selection of the so-called slope heuri...
In the context of the high-dimensional Gaussian linear regression for ordered variables, we study th...
Cette thèse s'inscrit dans les domaines de la statistique non-asymptotique et de la théorie statisti...
This paper deals with variable selection in the regression or binary classification frameworks. It p...
Abstract. Performance bounds for criteria for model selection are devel-oped using recent theory for...
Abstract This paper is mainly devoted to a precise analysis of what kind of penalties should be used...
We consider the problem of model (or variable) selection in the classical regression model using the...
We consider the estimation of a regression function with random design and heteroscedastic noise in ...
International audiencePenalization procedures often suffer from their dependence on multiplying fact...
extended version of http://hal.archives-ouvertes.fr/hal-00125455, with a technical appendixInternati...
International audienceWe build penalized least-squares estimators using the slope heuristic and resa...
58p.We consider the estimation of a bounded regression function with nonparametric heteroscedastic n...
We study model selection strategies based on penalized empirical loss minimization. We point out a...
We investigate the structure of model selection problems via the bias/variance decomposition. In par...
In this paper we investigate penalized least squares methods in linear regression models with heter...
International audienceWe investigate the optimality for model selection of the so-called slope heuri...
In the context of the high-dimensional Gaussian linear regression for ordered variables, we study th...
Cette thèse s'inscrit dans les domaines de la statistique non-asymptotique et de la théorie statisti...
This paper deals with variable selection in the regression or binary classification frameworks. It p...
Abstract. Performance bounds for criteria for model selection are devel-oped using recent theory for...
Abstract This paper is mainly devoted to a precise analysis of what kind of penalties should be used...
We consider the problem of model (or variable) selection in the classical regression model using the...