Abstract. For many classification and regression problems, a large number of features are available for possible use — this is typical of DNA microarray data on gene expression, for example. Often, for computational or other reasons, only a small subset of these features are selected for use in a model, based on some simple measure such as correlation with the response variable. This procedure may introduce an optimistic bias, however, in which the response variable appears to be more predictable than it actually is, because the high correlation of the selected features with the response may be partly or wholly due to chance. We show how this bias can be avoided when using a Bayesian model for the joint distribution of features and response...
Motivated by the problem of identifying correlations between genes or features of two related biolog...
BAYDA is a software package for flexible data analysis in predictive data mining tasks. The mathemat...
The role of feature selection is crucial in many applications. A few of these include computational ...
Description This software is used to predict the binary response based on high dimensional features,...
This thesis responds to the challenges of using a large number, such as thousands, of features in re...
Abstract Background Many bioinformatics studies aim to identify markers, or features, that can be us...
The problem of feature selection is critical in several areas of machine learning and data analysis ...
Motivation: Feature selection, identifying a subset of variables that are relevant for predicting a ...
In what follows, we introduce two Bayesian models for feature selection in high-dimensional data, sp...
Background: In high density arrays, the identification of relevant genes for disease classification ...
There has been ever increasing interest in the use of microarray experiments as a basis for the prov...
Advisors: Sanjib Basu.Committee members: Michael Geline; Balakrishna Hosmane; Alan Polansky; Duchwan...
In what follows, we introduce two Bayesian models for feature selection in high-dimensional data, sp...
<p>Bayesian variable selection often assumes normality, but the effects of model misspecification ar...
Machine learning models are difficult to employ in biology-related research. On the one hand, the av...
Motivated by the problem of identifying correlations between genes or features of two related biolog...
BAYDA is a software package for flexible data analysis in predictive data mining tasks. The mathemat...
The role of feature selection is crucial in many applications. A few of these include computational ...
Description This software is used to predict the binary response based on high dimensional features,...
This thesis responds to the challenges of using a large number, such as thousands, of features in re...
Abstract Background Many bioinformatics studies aim to identify markers, or features, that can be us...
The problem of feature selection is critical in several areas of machine learning and data analysis ...
Motivation: Feature selection, identifying a subset of variables that are relevant for predicting a ...
In what follows, we introduce two Bayesian models for feature selection in high-dimensional data, sp...
Background: In high density arrays, the identification of relevant genes for disease classification ...
There has been ever increasing interest in the use of microarray experiments as a basis for the prov...
Advisors: Sanjib Basu.Committee members: Michael Geline; Balakrishna Hosmane; Alan Polansky; Duchwan...
In what follows, we introduce two Bayesian models for feature selection in high-dimensional data, sp...
<p>Bayesian variable selection often assumes normality, but the effects of model misspecification ar...
Machine learning models are difficult to employ in biology-related research. On the one hand, the av...
Motivated by the problem of identifying correlations between genes or features of two related biolog...
BAYDA is a software package for flexible data analysis in predictive data mining tasks. The mathemat...
The role of feature selection is crucial in many applications. A few of these include computational ...