© 2019 Zemei XuStatistical variable selection, also known as feature selection, has become an indispensable tool in many research areas involving machine learning and data mining. The object of statistical variable selection is to select the best subset of predictors for fitting or predicting the response variable from a potentially large collection of candidate predictors. It is particularly important in high-dimensional problems such as cancer genetics, where there are potentially thousands of predictors and only a few are associated with the outcome. Moreover, predictors may have underlying group structures in them, so it is desirable to take the underlying group effects into consideration when performing variable selection and handle th...
Participants from all studies provided written informed consent and each study was approved by the r...
This thesis responds to the challenges of using a large number, such as thousands, of features in re...
The main goal of this Thesis is to describe numerous statistical techniques that deal with high-dime...
We propose two multivariate extensions of the Bayesian group lasso for variable selection and estima...
Abstract. Grouping structures arise naturally in many statistical modeling problems. Several methods...
Feature selection is demanded in many modern scientific research problems that use high-dimensional ...
Clinical research often focuses on complex traits in which many variables play a role in mechanisms ...
Variable selection methods are powerful tools in analysis of high dimensional massive data. In bioin...
Building a risk prediction model for a specific subgroup of patients based on high-dimensional molec...
Feature selection is demanded in many modern scientific research problems that use high-dimensional ...
Summary: Penalized model-based clustering has been proposed for high-dimensional but small sample-si...
Group structures arise naturally in a variety of modern data applications and statistical problems i...
For regression problems with grouped covariates, we adapt the idea of sparse group lasso (SGL) [10] ...
Modern big data analytics often involve large data sets in which the features of interest are measur...
Background: The dimension and complexity of high-throughput gene expression data create many challen...
Participants from all studies provided written informed consent and each study was approved by the r...
This thesis responds to the challenges of using a large number, such as thousands, of features in re...
The main goal of this Thesis is to describe numerous statistical techniques that deal with high-dime...
We propose two multivariate extensions of the Bayesian group lasso for variable selection and estima...
Abstract. Grouping structures arise naturally in many statistical modeling problems. Several methods...
Feature selection is demanded in many modern scientific research problems that use high-dimensional ...
Clinical research often focuses on complex traits in which many variables play a role in mechanisms ...
Variable selection methods are powerful tools in analysis of high dimensional massive data. In bioin...
Building a risk prediction model for a specific subgroup of patients based on high-dimensional molec...
Feature selection is demanded in many modern scientific research problems that use high-dimensional ...
Summary: Penalized model-based clustering has been proposed for high-dimensional but small sample-si...
Group structures arise naturally in a variety of modern data applications and statistical problems i...
For regression problems with grouped covariates, we adapt the idea of sparse group lasso (SGL) [10] ...
Modern big data analytics often involve large data sets in which the features of interest are measur...
Background: The dimension and complexity of high-throughput gene expression data create many challen...
Participants from all studies provided written informed consent and each study was approved by the r...
This thesis responds to the challenges of using a large number, such as thousands, of features in re...
The main goal of this Thesis is to describe numerous statistical techniques that deal with high-dime...