With the growth in size and complexity of data, methods exploiting low-dimensional structure, as well as distributed methods, have been playing an ever important role in machine learning. These approaches offer a natural choice to alleviate the computational burden, albeit typically at a statistical trade-off. In this thesis, we show that a careful utilization of structure of a problem, or bottlenecks of a distributed system, can also provide a statistical advantage in such settings. We do this from the purview of the following three problems: 1. Learning Graphical models with a few hubs: Graphical models are a popular tool to represent multivariate distributions. The task of learning a graphical model entails estimating the graph of condit...
Distributed learning provides an attractive framework for scaling the learning task by sharing the c...
The rise of big data has led to new demands for machine learning (ML) systems to learn complex model...
Over the past two decades graphical models have been widely used as a powerful tool for compactly re...
With the growth in size and complexity of data, methods exploiting low-dimensional structure, as wel...
In recent studies, the generalization properties for distributed learning and random features assume...
A current challenge for data management systems is to support the construction and maintenance of ma...
Critical to high-dimensional statistical estimation is to exploit the structure in the data distribu...
International audienceThe development of cluster computing frameworks has allowed practitioners to s...
Training a large-scale model over a massive data set is an extremely computation and storage intensi...
Distributed machine learning bridges the traditional fields of distributed systems and machine learn...
We study generalization properties of distributed algorithms in the setting of nonparametric regress...
The area of machine learning has made considerable progress over the past decade, enabled by the wid...
A current challenge for data management systems is to support the construction and maintenance of ma...
We live in an age of big data. Analyzing modern data sets can be very difficult because they usually...
ABSTRACTThe rise of big data has led to new demands for machine learning (ML) systems to learn compl...
Distributed learning provides an attractive framework for scaling the learning task by sharing the c...
The rise of big data has led to new demands for machine learning (ML) systems to learn complex model...
Over the past two decades graphical models have been widely used as a powerful tool for compactly re...
With the growth in size and complexity of data, methods exploiting low-dimensional structure, as wel...
In recent studies, the generalization properties for distributed learning and random features assume...
A current challenge for data management systems is to support the construction and maintenance of ma...
Critical to high-dimensional statistical estimation is to exploit the structure in the data distribu...
International audienceThe development of cluster computing frameworks has allowed practitioners to s...
Training a large-scale model over a massive data set is an extremely computation and storage intensi...
Distributed machine learning bridges the traditional fields of distributed systems and machine learn...
We study generalization properties of distributed algorithms in the setting of nonparametric regress...
The area of machine learning has made considerable progress over the past decade, enabled by the wid...
A current challenge for data management systems is to support the construction and maintenance of ma...
We live in an age of big data. Analyzing modern data sets can be very difficult because they usually...
ABSTRACTThe rise of big data has led to new demands for machine learning (ML) systems to learn compl...
Distributed learning provides an attractive framework for scaling the learning task by sharing the c...
The rise of big data has led to new demands for machine learning (ML) systems to learn complex model...
Over the past two decades graphical models have been widely used as a powerful tool for compactly re...