With the ever-increasing amount of computational power available, so broadens the horizon of statistical problems that can be tackled. However, many practitioners have only an ordinary personal computer on which to do their work. The need for computationally efficient methodology is as pressing as ever, and there remain some questions as-yet without a confident answer for a practitioner working with tight computational constraints. This thesis develops methods for three such problems. The first, introductory, chapter provides an overview of the area and an accessible preamble to the problems these methods address. In the second chapter we address the problem of modelling a high-dimensional linear regression with categorical predictor vari...
High-dimensional statistics is one of the most active research topics in modern statistics. It also ...
We consider the problem of Ising and Gaussian graphical model selection given n i.i.d. samples from ...
Massive high-dimensional data sets are ubiquitous in all scientific disciplines. Extracting meaningf...
This thesis is focused on the development of computationally efficient procedures for regression mod...
Historically, the choice of method for a given statistical problem has been primarily driven by two ...
The last few years have witnessed the rise of the big data era, which features the prevalence of dat...
Several statistical problems can be described as estimation problem, where the goal is to learn a se...
With advances in science and information technologies, many scientific fields are able to meet the c...
If there are extraordinarily large data, too large to fit into a single computer or too expensive to...
With the increasing availability of big data, it is challenging to analyze complex data that is high...
Several strategies for computing the best subset regression models are proposed. Some of the algorit...
We consider the problem of variable selection in high-dimensional linear models where the number of ...
This thesis concerns the analysis of high-dimensional and large-scale data that have become ubiq-uit...
Many authors have argued that identifying parsimonious statistical models (those that are neither ov...
Many variable selection algorithms in data mining utilize ranking and thresholding predictors as a p...
High-dimensional statistics is one of the most active research topics in modern statistics. It also ...
We consider the problem of Ising and Gaussian graphical model selection given n i.i.d. samples from ...
Massive high-dimensional data sets are ubiquitous in all scientific disciplines. Extracting meaningf...
This thesis is focused on the development of computationally efficient procedures for regression mod...
Historically, the choice of method for a given statistical problem has been primarily driven by two ...
The last few years have witnessed the rise of the big data era, which features the prevalence of dat...
Several statistical problems can be described as estimation problem, where the goal is to learn a se...
With advances in science and information technologies, many scientific fields are able to meet the c...
If there are extraordinarily large data, too large to fit into a single computer or too expensive to...
With the increasing availability of big data, it is challenging to analyze complex data that is high...
Several strategies for computing the best subset regression models are proposed. Some of the algorit...
We consider the problem of variable selection in high-dimensional linear models where the number of ...
This thesis concerns the analysis of high-dimensional and large-scale data that have become ubiq-uit...
Many authors have argued that identifying parsimonious statistical models (those that are neither ov...
Many variable selection algorithms in data mining utilize ranking and thresholding predictors as a p...
High-dimensional statistics is one of the most active research topics in modern statistics. It also ...
We consider the problem of Ising and Gaussian graphical model selection given n i.i.d. samples from ...
Massive high-dimensional data sets are ubiquitous in all scientific disciplines. Extracting meaningf...