Support vector machines (SVM) is a popular classification method for analysis of high dimensional data such as genomics data. Recently, new SVM methods have been developed to achieve variable selection through either frequentist regularization or Bayesian shrinkage. The Bayesian framework provides a probabilistic interpretation for SVM and allows direct uncertainty quantification. In this dissertation, we develop four knowledge-guided SVM methods for the analysis of high dimensional data. In Chapter 1, I first review the theory of SVM and existing methods for incorporating the prior knowledge, represented bby graphs into SVM. Second, I review the terminology on variable selection and limitations of the existing methods for SVM variable sele...
<div>This record contains seven real-world test datasets used in experiments with the Bayesian SVM a...
The last decade has been characterized by an explosion of biological sequence information. When the ...
A core focus of statistics is determining how much of the variation in data may be attributed to the...
Support vector machines (SVM) is a popular classification method for analysis of high dimensional da...
Advances made in computer development along with the curiosity regarding the use of data in the worl...
"July 2014."Dissertation Co-adviser: Dr. Sounak Chakraborty.Dissertation Co-adviser: Dr. (Tony) Jian...
International audienceThis paper describes a novel method for improving classification of support vec...
Motivation: Feature selection, identifying a subset of variables that are relevant for predicting a ...
An essential aspect of medical research is the prediction for a health outcome and the scientific id...
<p>Collections of large volumes of rich and complex data has become ubiquitous in recent years, posi...
<p>In this thesis, we develop some Bayesian sparse learning methods for high dimensional data analys...
Inspired by analysis of genomic data, the primary quest is to identify associations between studied ...
Abstract Background Phenotypic classification is problematic because small samples are ubiquitous; a...
We consider the problem of variable selection in regression modeling in high dimensional spaces wher...
This thesis responds to the challenges of using a large number, such as thousands, of features in re...
<div>This record contains seven real-world test datasets used in experiments with the Bayesian SVM a...
The last decade has been characterized by an explosion of biological sequence information. When the ...
A core focus of statistics is determining how much of the variation in data may be attributed to the...
Support vector machines (SVM) is a popular classification method for analysis of high dimensional da...
Advances made in computer development along with the curiosity regarding the use of data in the worl...
"July 2014."Dissertation Co-adviser: Dr. Sounak Chakraborty.Dissertation Co-adviser: Dr. (Tony) Jian...
International audienceThis paper describes a novel method for improving classification of support vec...
Motivation: Feature selection, identifying a subset of variables that are relevant for predicting a ...
An essential aspect of medical research is the prediction for a health outcome and the scientific id...
<p>Collections of large volumes of rich and complex data has become ubiquitous in recent years, posi...
<p>In this thesis, we develop some Bayesian sparse learning methods for high dimensional data analys...
Inspired by analysis of genomic data, the primary quest is to identify associations between studied ...
Abstract Background Phenotypic classification is problematic because small samples are ubiquitous; a...
We consider the problem of variable selection in regression modeling in high dimensional spaces wher...
This thesis responds to the challenges of using a large number, such as thousands, of features in re...
<div>This record contains seven real-world test datasets used in experiments with the Bayesian SVM a...
The last decade has been characterized by an explosion of biological sequence information. When the ...
A core focus of statistics is determining how much of the variation in data may be attributed to the...