Modern data analysis frequently involves multiple large and diverse data sets generated from current high-throughput technologies. An integrative analysis of these sources of information is very promising for improving knowledge discovery in various fields. This dissertation focuses on three distinct challenges in the integration of information. The variables obtained from diverse and novel platforms often have highly non-Gaussian marginal distributions and therefore are challenging to analyze by commonly used methods. The first part introduces an automatic transformation for improving data quality before integrating multiple data sources. For each variable, a new family of parametrizations of the shifted logarithm transformation is propose...
Data integration is the process of extracting information from multiple sources and analyzing differ...
A vast amount of the statistical literature deals with a single sample coming from a distribution wh...
Technological advancements and global data sharing allow for the collection of information from mult...
My thesis is about developing statistical methods by integrating disparate data sources with real da...
This dissertation is centered on the modeling of heterogeneous data which is ubiquitous in this digi...
Integrative analysis is of great interest in modern scientific research. This dissertation mainly fo...
Modern data collection in bioinformatics and other big-data paradigms often incorporates traits deri...
A fundamental aspect of statistics is the integration of data from different sources. Classically, F...
University of Minnesota Ph.D. dissertation. July 2018. Major: Biostatistics. Advisor: Eric Lock. 1 c...
This research is focused on high dimensional data integration by combing test statistics or informa...
Research in several fields now requires the analysis of datasets in which multiple high-dimensional ...
Research in genomics and related fields now often requires the analysis of emph{multi-block} data, i...
Driven by the growth of the Internet, online applications, and data sharing initiatives, available s...
In this dissertation, I develop statistical methods to address three important scientific problems. ...
This article describes an optimal method (conflation) to consolidate data from different experiments...
Data integration is the process of extracting information from multiple sources and analyzing differ...
A vast amount of the statistical literature deals with a single sample coming from a distribution wh...
Technological advancements and global data sharing allow for the collection of information from mult...
My thesis is about developing statistical methods by integrating disparate data sources with real da...
This dissertation is centered on the modeling of heterogeneous data which is ubiquitous in this digi...
Integrative analysis is of great interest in modern scientific research. This dissertation mainly fo...
Modern data collection in bioinformatics and other big-data paradigms often incorporates traits deri...
A fundamental aspect of statistics is the integration of data from different sources. Classically, F...
University of Minnesota Ph.D. dissertation. July 2018. Major: Biostatistics. Advisor: Eric Lock. 1 c...
This research is focused on high dimensional data integration by combing test statistics or informa...
Research in several fields now requires the analysis of datasets in which multiple high-dimensional ...
Research in genomics and related fields now often requires the analysis of emph{multi-block} data, i...
Driven by the growth of the Internet, online applications, and data sharing initiatives, available s...
In this dissertation, I develop statistical methods to address three important scientific problems. ...
This article describes an optimal method (conflation) to consolidate data from different experiments...
Data integration is the process of extracting information from multiple sources and analyzing differ...
A vast amount of the statistical literature deals with a single sample coming from a distribution wh...
Technological advancements and global data sharing allow for the collection of information from mult...