With the prevalence of high dimensional data, variable selection is crucial in many real applications. Although various methods have been investigated in the past decades, challenges still remain when tens of thousands of predictor variables are available for modeling. One difficulty arises from the spurious correlation, referring to the phenomenon that the sample correlation between two variables can be large when the dimension is relatively high even if they are independent. While many classical variable selection methods choose a variable based upon its marginal correlation with the response, the existence of spurious correlation may result in a high false discovery rate. On the other hand, when important variables are highly correlated,...
International audienceThe analysis of high throughput data has renewed the statistical methodology f...
International audienceThe analysis of high throughput data has renewed the statistical methodology f...
International audienceThe analysis of high throughput data has renewed the statistical methodology f...
Modern applications of statistical approaches involve high-dimensional complex data, where variable ...
This paper is concerned with variable selection in linear high-dimensional framework when the set of...
Analysis of interactions between variables in a large data set has recently attracted special attent...
This paper is concerned with variable selection in linear high-dimensional framework when the set of...
Modern applications of statistical approaches involve high-dimensional complex data, where variable ...
This paper is concerned with variable selection in linear high-dimensional framework when the set of...
In the first part of this thesis, we address the question of how new testing methods can be develope...
Over the last two decades, many exciting variable selection methods have been developed for finding ...
Over the last two decades, many exciting variable selection methods have been developed for finding ...
Advancements in information technology have enabled scientists to collect data of unprecedented size...
International audienceThe analysis of high throughput data has renewed the statistical methodology f...
Doctor of PhilosophyDepartment of StatisticsHaiyan WangThe advance in technologies has enabled many ...
International audienceThe analysis of high throughput data has renewed the statistical methodology f...
International audienceThe analysis of high throughput data has renewed the statistical methodology f...
International audienceThe analysis of high throughput data has renewed the statistical methodology f...
Modern applications of statistical approaches involve high-dimensional complex data, where variable ...
This paper is concerned with variable selection in linear high-dimensional framework when the set of...
Analysis of interactions between variables in a large data set has recently attracted special attent...
This paper is concerned with variable selection in linear high-dimensional framework when the set of...
Modern applications of statistical approaches involve high-dimensional complex data, where variable ...
This paper is concerned with variable selection in linear high-dimensional framework when the set of...
In the first part of this thesis, we address the question of how new testing methods can be develope...
Over the last two decades, many exciting variable selection methods have been developed for finding ...
Over the last two decades, many exciting variable selection methods have been developed for finding ...
Advancements in information technology have enabled scientists to collect data of unprecedented size...
International audienceThe analysis of high throughput data has renewed the statistical methodology f...
Doctor of PhilosophyDepartment of StatisticsHaiyan WangThe advance in technologies has enabled many ...
International audienceThe analysis of high throughput data has renewed the statistical methodology f...
International audienceThe analysis of high throughput data has renewed the statistical methodology f...
International audienceThe analysis of high throughput data has renewed the statistical methodology f...