Supervised and unsupervised classification are common topics in machine learning in both scientific and industrial fields, which usually involve three tasks: prediction, exploration, and explanation. False discovery rate (FDR) theory has a close connection to classical classification theory, which must be employed in a sophisticated way to achieve good performance in various contexts. The study aims to explore novel supervised classifiers and unsupervised classification approaches for functional data and high-dimensional data in genome study by using FDR, respectively. One work develops a novel classifier for functional data by casting the classification problem into a multiple testing task, which involves using statistical depth functions....
High-dimensional hypothesis testing is ubiquitous in the biomedical sciences, and informative covari...
In genome-wide studies, hundreds of thousands of hypothesis tests are performed simultaneously. Bonf...
This thesis deals with statistical questions raised by the analysis of high-dimensional genomic data...
The technical advancements in genomics, functional magnetic-resonance and other areas of scientific ...
High-throughput screening has become an important mainstay for contemporary biomedical research. A s...
Motivation: The false discovery rate (fdr) is a key tool for statistical assessment of differential ...
The development of high-throughput biological technologies have enabled researchers to simultaneousl...
This thesis focuses on analyzing the type of data returned by two pieces of technology, the older an...
DNA microarray technologies allow us to monitor expression levels of thousands of genes simultaneous...
A large-scale multiple testing problem simultaneously tests thousands or even millions of null hypot...
Motivation Presently available methods that use p-values to estimate or control the false discovery ...
Controlling the false discovery rate (FDR) is a powerful approach to deal with a large number of hyp...
False Discover Rate (FDR) method provides more powerful multiple hypothesis testing criteria than th...
Advances in microarray technology have equipped researchers to measure gene expression levels simult...
This research focuses on using statistical learning methods on high-dimensional biological data anal...
High-dimensional hypothesis testing is ubiquitous in the biomedical sciences, and informative covari...
In genome-wide studies, hundreds of thousands of hypothesis tests are performed simultaneously. Bonf...
This thesis deals with statistical questions raised by the analysis of high-dimensional genomic data...
The technical advancements in genomics, functional magnetic-resonance and other areas of scientific ...
High-throughput screening has become an important mainstay for contemporary biomedical research. A s...
Motivation: The false discovery rate (fdr) is a key tool for statistical assessment of differential ...
The development of high-throughput biological technologies have enabled researchers to simultaneousl...
This thesis focuses on analyzing the type of data returned by two pieces of technology, the older an...
DNA microarray technologies allow us to monitor expression levels of thousands of genes simultaneous...
A large-scale multiple testing problem simultaneously tests thousands or even millions of null hypot...
Motivation Presently available methods that use p-values to estimate or control the false discovery ...
Controlling the false discovery rate (FDR) is a powerful approach to deal with a large number of hyp...
False Discover Rate (FDR) method provides more powerful multiple hypothesis testing criteria than th...
Advances in microarray technology have equipped researchers to measure gene expression levels simult...
This research focuses on using statistical learning methods on high-dimensional biological data anal...
High-dimensional hypothesis testing is ubiquitous in the biomedical sciences, and informative covari...
In genome-wide studies, hundreds of thousands of hypothesis tests are performed simultaneously. Bonf...
This thesis deals with statistical questions raised by the analysis of high-dimensional genomic data...