This research evaluates pattern recognition techniques on a subclass of big data where the dimensionality of the input space p is much larger than the number of observations n. Seven gene-expression microarray cancer datasets, where the ratio κ = n/p is less than one, were chosen for evaluation. The statistical and computational challenges inherent with this type of high-dimensional low sample size (HDLSS) data were explored. The capability and performance of a diverse set of machine learning algorithms is presented and compared. The sparsity and collinearity of the data being employed, in conjunction with the complexity of the algorithms studied, demanded rigorous and careful tuning of the hyperparameters and regularization parameters. Thi...
Microarray dataset dimensionality reduction is a prerequisite for avoiding overfitting, and hence de...
This research focuses on using statistical learning methods on high-dimensional biological data anal...
Microarray technologies have gained tremendous interest from researchers in recent years. The proble...
This research evaluates pattern recognition techniques on a subclass of big data where the dimension...
Advances in microarray technology have equipped researchers to measure gene expression levels simult...
During the exploration of high dimension-low-sample-size (HDLSS) data in different fields such as ge...
Doctor of PhilosophyDepartment of StatisticsHaiyan WangThis dissertation considers different methods ...
Background: Machine learning is a powerful approach for describing and predicting classes in microar...
In many technological or industrial fields, the amount of high dimensional data is steadily growing....
Introduction: As part of the MicroArray Quality Control (MAQC)-II project, this analysis examines ho...
Microarray studies are currently a very popular source of biological information. They allow the sim...
Background: In biometric practice, researchers often apply a large number of different methods in a ...
Background: In high density arrays, the identification of relevant genes for disease classification ...
High-dimensional data analysis characterises many contemporary problems in statistics and arise in m...
Background: Data generated using ‘omics’ technologies are characterized by high dimensionality, wher...
Microarray dataset dimensionality reduction is a prerequisite for avoiding overfitting, and hence de...
This research focuses on using statistical learning methods on high-dimensional biological data anal...
Microarray technologies have gained tremendous interest from researchers in recent years. The proble...
This research evaluates pattern recognition techniques on a subclass of big data where the dimension...
Advances in microarray technology have equipped researchers to measure gene expression levels simult...
During the exploration of high dimension-low-sample-size (HDLSS) data in different fields such as ge...
Doctor of PhilosophyDepartment of StatisticsHaiyan WangThis dissertation considers different methods ...
Background: Machine learning is a powerful approach for describing and predicting classes in microar...
In many technological or industrial fields, the amount of high dimensional data is steadily growing....
Introduction: As part of the MicroArray Quality Control (MAQC)-II project, this analysis examines ho...
Microarray studies are currently a very popular source of biological information. They allow the sim...
Background: In biometric practice, researchers often apply a large number of different methods in a ...
Background: In high density arrays, the identification of relevant genes for disease classification ...
High-dimensional data analysis characterises many contemporary problems in statistics and arise in m...
Background: Data generated using ‘omics’ technologies are characterized by high dimensionality, wher...
Microarray dataset dimensionality reduction is a prerequisite for avoiding overfitting, and hence de...
This research focuses on using statistical learning methods on high-dimensional biological data anal...
Microarray technologies have gained tremendous interest from researchers in recent years. The proble...