High-dimensional biomedical 'omic' datasets are accumulating rapidly from studies aimed at early detection and better management of human disease. These datasets pose tremendous challenges for analysis due to their large number of variables that represent measurements of biochemical molecules, such as proteins and mRNA, from bodily fluids or tissues extracted from a rather small cohort of samples. Machine learning methods have been applied to modeling these datasets including rule learning methods, which have been successful in generating models that are easily interpretable by the scientists. Rule learning methods have typically relied on a frequentist measure of certainty within IF-THEN (propositional) rules. In this dissertation, a Bayes...
One of the key goals of current cancer research is the identification of biologic molecules that all...
Massive amounts of genomic data are created for the advent of Next Generation Sequencing technologie...
The thesis is composed of three independent projects: (i) analyzing transposon-sequencing data to in...
Statistics from the National Cancer Institute indicate that 1 in 8 women will develop Breast cancer ...
High-dimensional biomedical 'omic' datasets are accumulating rapidly from studies aimed at early det...
Chronic pressure overload (PO) due to arterial hypertension can lead to structural changes within th...
dissertationRapidly evolving technologies such as chip arrays and next-generation sequencing are unc...
Roughly thirty percent of coronary artery bypass graft (CABG) patients develop atrial fibrillation (...
In this thesis, a natural probabilistic model has been used to test the quality of motif discovery p...
With large sample sizes, population-based cohorts and biobanks provide an exciting opportunity to id...
In this era of precision medicine, clinicians and researchers critically need the assistance of comp...
Master of ScienceDepartment of Computing and Information SciencesWilliam HsuBayesian networks are gr...
The global information space provided by the World Wide Web has changed dramatically the way knowled...
In the genomic setting, most data have relative small sample size (n) considering large number of co...
This dissertation discusses the use of automated natural language processing (NLP) for characterizat...
One of the key goals of current cancer research is the identification of biologic molecules that all...
Massive amounts of genomic data are created for the advent of Next Generation Sequencing technologie...
The thesis is composed of three independent projects: (i) analyzing transposon-sequencing data to in...
Statistics from the National Cancer Institute indicate that 1 in 8 women will develop Breast cancer ...
High-dimensional biomedical 'omic' datasets are accumulating rapidly from studies aimed at early det...
Chronic pressure overload (PO) due to arterial hypertension can lead to structural changes within th...
dissertationRapidly evolving technologies such as chip arrays and next-generation sequencing are unc...
Roughly thirty percent of coronary artery bypass graft (CABG) patients develop atrial fibrillation (...
In this thesis, a natural probabilistic model has been used to test the quality of motif discovery p...
With large sample sizes, population-based cohorts and biobanks provide an exciting opportunity to id...
In this era of precision medicine, clinicians and researchers critically need the assistance of comp...
Master of ScienceDepartment of Computing and Information SciencesWilliam HsuBayesian networks are gr...
The global information space provided by the World Wide Web has changed dramatically the way knowled...
In the genomic setting, most data have relative small sample size (n) considering large number of co...
This dissertation discusses the use of automated natural language processing (NLP) for characterizat...
One of the key goals of current cancer research is the identification of biologic molecules that all...
Massive amounts of genomic data are created for the advent of Next Generation Sequencing technologie...
The thesis is composed of three independent projects: (i) analyzing transposon-sequencing data to in...