Machine learning and statistics with very large datasets is now a topic of widespread interest, both in academia and industry. Many such tasks can be posed as convex optimization problems, so algorithms for distributed convex optimization serve as a powerful, general-purpose mechanism for training a wide class of models on datasets too large to process on a single machine. In previous work, it has been shown how to solve such problems in such a way that each machine only looks at either a subset of training examples or a subset of features. In this paper, we extend these algorithms by showing how to split problems by both examples and features simultaneously, which is necessary to deal with datasets that are very large in both dimensions. W...
Modern machine learning systems pose several new statistical, scalability, privacy and ethical chall...
We consider unsupervised partitioning problems based explicitly or implicitly on the minimiza-tion o...
Many machine learning algorithms rely heavily on the existence of a good measure of (dis-)similarity...
This paper describes a general purpose method for solving convex optimization problems in a distribu...
Many problems of recent interest in statistics and machine learning can be posed in the framework of...
Over recent years we have seen the appearance of huge datasets that do not fit into memory and do no...
Classical optimization techniques have found widespread use in machine learning. Convex optimization...
Many important machine learning problems are modeled and solved via semidefinite programs; examples ...
We present the Convex Polytope Machine (CPM), a novel non-linear learning algorithm for large-scale ...
This thesis proposes parallel and distributed algorithms for solving very largescale sparse optimiza...
Many important machine learning problems are modeled and solved via semidefinite programs; examples ...
Thesis: Ph. D. in Mathematics and Operations Research, Massachusetts Institute of Technology, Depart...
Training a large-scale model over a massive data set is an extremely computation and storage intensi...
Machine learning has achieved tremendous successes and played increasingly essential roles in many a...
Machine learning is gaining fresh momentum, and has helped us to enhance not only many industrial an...
Modern machine learning systems pose several new statistical, scalability, privacy and ethical chall...
We consider unsupervised partitioning problems based explicitly or implicitly on the minimiza-tion o...
Many machine learning algorithms rely heavily on the existence of a good measure of (dis-)similarity...
This paper describes a general purpose method for solving convex optimization problems in a distribu...
Many problems of recent interest in statistics and machine learning can be posed in the framework of...
Over recent years we have seen the appearance of huge datasets that do not fit into memory and do no...
Classical optimization techniques have found widespread use in machine learning. Convex optimization...
Many important machine learning problems are modeled and solved via semidefinite programs; examples ...
We present the Convex Polytope Machine (CPM), a novel non-linear learning algorithm for large-scale ...
This thesis proposes parallel and distributed algorithms for solving very largescale sparse optimiza...
Many important machine learning problems are modeled and solved via semidefinite programs; examples ...
Thesis: Ph. D. in Mathematics and Operations Research, Massachusetts Institute of Technology, Depart...
Training a large-scale model over a massive data set is an extremely computation and storage intensi...
Machine learning has achieved tremendous successes and played increasingly essential roles in many a...
Machine learning is gaining fresh momentum, and has helped us to enhance not only many industrial an...
Modern machine learning systems pose several new statistical, scalability, privacy and ethical chall...
We consider unsupervised partitioning problems based explicitly or implicitly on the minimiza-tion o...
Many machine learning algorithms rely heavily on the existence of a good measure of (dis-)similarity...