Motivation: Feature selection, identifying a subset of variables that are relevant for predicting a response, is an important and challenging component of many methods in statistics and machine learning. Feature selection is especially difficult and computationally intensive when the number of variables approaches or exceeds the number of samples, as is often the case for many genomic datasets. Results: Here, we introduce a new approach – the Bayesian Ising Approximation (BIA) – to rapidly calculate posterior probabilities for feature relevance in L2 penalized linear regression. In the regime where the regression problem is strongly regularized by the prior, we show that computing the marginal posterior probabilities for features is equiva...
In what follows, we introduce two Bayesian models for feature selection in high-dimensional data, sp...
High-dimensional feature selection arises in many areas of modern sciences. For example, in genomic ...
This thesis is focused on the development of computationally efficient procedures for regression mod...
feature selection for high-dimensional linear regression via the Ising approximation with applicatio...
Identifying small subsets of features that are relevant for prediction and classification tasks is a...
This thesis responds to the challenges of using a large number, such as thousands, of features in re...
Abstract Background Many bioinformatics studies aim to identify markers, or features, that can be us...
Inspired by analysis of genomic data, the primary quest is to identify associations between studied ...
Advisors: Sanjib Basu.Committee members: Michael Geline; Balakrishna Hosmane; Alan Polansky; Duchwan...
Abstract. For many classification and regression problems, a large number of features are available ...
Support vector machines (SVM) is a popular classification method for analysis of high dimensional da...
The problem of selecting the most useful features from a great many (eg, thousands) of candidates ar...
Motivated by the problem of identifying correlations between genes or features of two related biolog...
Background: In high density arrays, the identification of relevant genes for disease classification ...
In what follows, we introduce two Bayesian models for feature selection in high-dimensional data, sp...
In what follows, we introduce two Bayesian models for feature selection in high-dimensional data, sp...
High-dimensional feature selection arises in many areas of modern sciences. For example, in genomic ...
This thesis is focused on the development of computationally efficient procedures for regression mod...
feature selection for high-dimensional linear regression via the Ising approximation with applicatio...
Identifying small subsets of features that are relevant for prediction and classification tasks is a...
This thesis responds to the challenges of using a large number, such as thousands, of features in re...
Abstract Background Many bioinformatics studies aim to identify markers, or features, that can be us...
Inspired by analysis of genomic data, the primary quest is to identify associations between studied ...
Advisors: Sanjib Basu.Committee members: Michael Geline; Balakrishna Hosmane; Alan Polansky; Duchwan...
Abstract. For many classification and regression problems, a large number of features are available ...
Support vector machines (SVM) is a popular classification method for analysis of high dimensional da...
The problem of selecting the most useful features from a great many (eg, thousands) of candidates ar...
Motivated by the problem of identifying correlations between genes or features of two related biolog...
Background: In high density arrays, the identification of relevant genes for disease classification ...
In what follows, we introduce two Bayesian models for feature selection in high-dimensional data, sp...
In what follows, we introduce two Bayesian models for feature selection in high-dimensional data, sp...
High-dimensional feature selection arises in many areas of modern sciences. For example, in genomic ...
This thesis is focused on the development of computationally efficient procedures for regression mod...