<p>With now well-recognized nonnegligible model selection uncertainty, data analysts should no longer be satisfied with the output of a single final model from a model selection process, regardless of its sophistication. To improve reliability and reproducibility in model choice, one constructive approach is to make good use of a sound variable importance measure. Although interesting importance measures are available and increasingly used in data analysis, little theoretical justification has been done. In this article, we propose a new variable importance measure, sparsity oriented importance learning (SOIL), for high-dimensional regression from a sparse linear modeling perspective by taking into account the variable selection uncertainty...
Feature selection in high-dimensional data sets is an open problem with no universal satisfactory me...
From the perspective of econometrics, an accurate variable selection method greatly enhances the rel...
In this thesis we discuss machine learning methods performing automated variable selection for learn...
A situation where training and test samples follow different input distributions is called covariate...
Many statistical problems involve the learning of an importance/effect of a variable for predicting ...
In a regression setting, it is often of interest to quantify the importance of various features in p...
Thesis (Ph.D.)--University of Washington, 2019Assessing the relative contribution of subsets of feat...
Variable importance graphs are great tool to see, in a model, which variables are interesting. Since...
In many problem settings, parameter vectors are not merely sparse, but depen-dent in such a way that...
International audienceAssessing the uncertainty pertaining to the conclusions derived from experimen...
\ud Motivated by the recent trend in ``Big data", we are interested in the case where both $p$, the ...
Random Forests variable importance measures are often used to rank variables by their relevance to a...
The abundance of available digital big data has created new challenges in identifying relevant varia...
The focus is on treating the relationship between a dependent variable $y$ and a $p$-dimensional cov...
International audienceWe consider different approaches for assessing variable importance in clusteri...
Feature selection in high-dimensional data sets is an open problem with no universal satisfactory me...
From the perspective of econometrics, an accurate variable selection method greatly enhances the rel...
In this thesis we discuss machine learning methods performing automated variable selection for learn...
A situation where training and test samples follow different input distributions is called covariate...
Many statistical problems involve the learning of an importance/effect of a variable for predicting ...
In a regression setting, it is often of interest to quantify the importance of various features in p...
Thesis (Ph.D.)--University of Washington, 2019Assessing the relative contribution of subsets of feat...
Variable importance graphs are great tool to see, in a model, which variables are interesting. Since...
In many problem settings, parameter vectors are not merely sparse, but depen-dent in such a way that...
International audienceAssessing the uncertainty pertaining to the conclusions derived from experimen...
\ud Motivated by the recent trend in ``Big data", we are interested in the case where both $p$, the ...
Random Forests variable importance measures are often used to rank variables by their relevance to a...
The abundance of available digital big data has created new challenges in identifying relevant varia...
The focus is on treating the relationship between a dependent variable $y$ and a $p$-dimensional cov...
International audienceWe consider different approaches for assessing variable importance in clusteri...
Feature selection in high-dimensional data sets is an open problem with no universal satisfactory me...
From the perspective of econometrics, an accurate variable selection method greatly enhances the rel...
In this thesis we discuss machine learning methods performing automated variable selection for learn...