Data-driven decision making has become a necessary commodity in virtually every domain of human endeavor, fueled by the exponential growth in the availability of data and the rapid increase in our computing power. In principle, if the collected data contain sufficient information, it is possible to build a useful model for making decisions. Nevertheless, there are a few challenges to address to bring it into reality. First, the gathered data can be contaminated by noise, or even by missing values. Second, building a model from data usually involves solving an optimization problem, which may require prohibitively large computational resources. In this thesis, we explore two research directions, motivated by these two challenges. In the fi...
2018-01-18This is the era of big data, where both challenges and opportunities lie ahead for the mac...
Solving the missing-value (MV) problem with small estimation errors in large-scale data environment...
This work considers the problem of learning with missing data. Two main classes of approaches are co...
The last few years have witnessed the rise of the big data era, which features the prevalence of dat...
With the ever-growing data sizes along with the increasing complexity of the modern problem formulat...
Some of the most challenging issues in big data are size, scalability and reliability. Big data, su...
Methods that analyze large-scale data and make predictions based on data are increasingly prevalent ...
This electronic version was submitted by the student author. The certified thesis is available in th...
Missing values arise in most real-world data sets due to the aggregation of multiple sources and int...
Modern learning problems in nature language processing, computer vision, computational biology, etc....
The aim of this thesis is to develop scalable numerical optimization methods that can be used to add...
We explore a general statistical framework for low-rank modeling of matrix-valued data, based on con...
Modern technological advances have prompted massive scale data collection in manymodern fields such ...
This thesis shows how we can exploit low-dimensional structure in high-dimensional statistics and ma...
Missing data imputation is an important issue in machine learning and data mining. In this paper, we...
2018-01-18This is the era of big data, where both challenges and opportunities lie ahead for the mac...
Solving the missing-value (MV) problem with small estimation errors in large-scale data environment...
This work considers the problem of learning with missing data. Two main classes of approaches are co...
The last few years have witnessed the rise of the big data era, which features the prevalence of dat...
With the ever-growing data sizes along with the increasing complexity of the modern problem formulat...
Some of the most challenging issues in big data are size, scalability and reliability. Big data, su...
Methods that analyze large-scale data and make predictions based on data are increasingly prevalent ...
This electronic version was submitted by the student author. The certified thesis is available in th...
Missing values arise in most real-world data sets due to the aggregation of multiple sources and int...
Modern learning problems in nature language processing, computer vision, computational biology, etc....
The aim of this thesis is to develop scalable numerical optimization methods that can be used to add...
We explore a general statistical framework for low-rank modeling of matrix-valued data, based on con...
Modern technological advances have prompted massive scale data collection in manymodern fields such ...
This thesis shows how we can exploit low-dimensional structure in high-dimensional statistics and ma...
Missing data imputation is an important issue in machine learning and data mining. In this paper, we...
2018-01-18This is the era of big data, where both challenges and opportunities lie ahead for the mac...
Solving the missing-value (MV) problem with small estimation errors in large-scale data environment...
This work considers the problem of learning with missing data. Two main classes of approaches are co...