Nowadays it is common to collect large volumes of data in many fields with an extensive amount of variables, but often a small or moderate number of samples. For example, in the analysis of genomic data, the number of genes can be very large, varying from tens of thousands to several millions, whereas the number of samples is several hundreds to thousands. Pharmacogenomics is an example of genomics data analysis that we are considering here. Pharmacogenomics research uses whole-genome genetic information to predict individuals\u27 drug response. Because whole-genome data are high dimensional and their relationships to drug response are complicated, we are developing a variety of nonparametric methods, including variable selection using loca...
For many large-scale datasets it is necessary to reduce dimensionality to the point where further ex...
<div><p>In genetical genomics studies, it is important to jointly analyze gene expression data and g...
It is increasingly common to have measurements from multiple platforms on the same set of samples in...
Nowadays it is common to collect large volumes of data in many fields with an extensive amount of va...
The aim of this paper is to present basic principles of common multivariate statistical approaches t...
Modern biomedical studies generate high-dimensional data, meaning that the number of variables colle...
Variable selection becomes more crucial than before, since high dimensional data are frequently seen...
[EN] Dimension reduction techniques are used to explore genomic data. Due to the large number of var...
High-dimensional data are becoming increasingly available as data collection technology advances. Ov...
Doctor of PhilosophyDepartment of StatisticsCen WuVariable selection from both the frequentist and B...
Doctor of PhilosophyDepartment of StatisticsCen WuVariable selection from both the frequentist and B...
Several statistical problems can be described as estimation problem, where the goal is to learn a se...
Advancements in information technology have enabled scientists to collect data of unprecedented size...
multicollinearity and high dimensionality problems, making it impossible to obtain stable estimates ...
The advent of new genomic technologies has resulted in production of massive data sets. The outcomes...
For many large-scale datasets it is necessary to reduce dimensionality to the point where further ex...
<div><p>In genetical genomics studies, it is important to jointly analyze gene expression data and g...
It is increasingly common to have measurements from multiple platforms on the same set of samples in...
Nowadays it is common to collect large volumes of data in many fields with an extensive amount of va...
The aim of this paper is to present basic principles of common multivariate statistical approaches t...
Modern biomedical studies generate high-dimensional data, meaning that the number of variables colle...
Variable selection becomes more crucial than before, since high dimensional data are frequently seen...
[EN] Dimension reduction techniques are used to explore genomic data. Due to the large number of var...
High-dimensional data are becoming increasingly available as data collection technology advances. Ov...
Doctor of PhilosophyDepartment of StatisticsCen WuVariable selection from both the frequentist and B...
Doctor of PhilosophyDepartment of StatisticsCen WuVariable selection from both the frequentist and B...
Several statistical problems can be described as estimation problem, where the goal is to learn a se...
Advancements in information technology have enabled scientists to collect data of unprecedented size...
multicollinearity and high dimensionality problems, making it impossible to obtain stable estimates ...
The advent of new genomic technologies has resulted in production of massive data sets. The outcomes...
For many large-scale datasets it is necessary to reduce dimensionality to the point where further ex...
<div><p>In genetical genomics studies, it is important to jointly analyze gene expression data and g...
It is increasingly common to have measurements from multiple platforms on the same set of samples in...