We propose LOCO, a distributed algorithm which solves large-scale ridge reg-ression. LOCO randomly assigns variables to different processing units which do not communicate. Important dependencies between variables are preserved us-ing random projections which are cheap to compute. We show that LOCO has bounded approximation error compared to the exact ridge regression solution in the fixed design setting. Experimentally, in addition to obtaining significant speedups LOCO achieves good predictive accuracy on a variety of large-scale reg-ression problems. Notably LOCO is able to solve a regression problem with 5 billion non-zeros distributed across 128 workers in 25 seconds.
We study the problem of distribution to real regression, where one aims to regress a map-ping f that...
The presence of the multicollinearity problem in the predictor data causes the variance of the ordin...
We propose a new distributed algorithm for em-pirical risk minimization in machine learning. The alg...
We develop a new distributed algorithm to solve the ridge regression problem with feature partitioni...
We study the distributed machine learning problem where the n feature-response pairs are partitioned...
We propose a fast algorithm for ridge regression when the number of features is much larger than the...
We propose a new two stage algorithm LING for large scale regression problems. LING has the same ris...
We live in an age of big data. Analyzing modern data sets can be very difficult because they usually...
Ridge regression is a classical statistical technique that attempts to address the bias-variance tra...
Ridge regression is a technical method to deal with highly correlated data when using regression mod...
We propose a fast algorithm for ridge regression when the number of features is much larger than the...
In several supervised learning applications, it happens that reconstruction methods have to be appli...
Ridge regression method is an improved method when the assumptions of independence of the explanator...
Ridge regression is a technical method to deal with highly correlated data when using regression mod...
Decisions are increasingly taken by both humans and machine learning models. However, machine learni...
We study the problem of distribution to real regression, where one aims to regress a map-ping f that...
The presence of the multicollinearity problem in the predictor data causes the variance of the ordin...
We propose a new distributed algorithm for em-pirical risk minimization in machine learning. The alg...
We develop a new distributed algorithm to solve the ridge regression problem with feature partitioni...
We study the distributed machine learning problem where the n feature-response pairs are partitioned...
We propose a fast algorithm for ridge regression when the number of features is much larger than the...
We propose a new two stage algorithm LING for large scale regression problems. LING has the same ris...
We live in an age of big data. Analyzing modern data sets can be very difficult because they usually...
Ridge regression is a classical statistical technique that attempts to address the bias-variance tra...
Ridge regression is a technical method to deal with highly correlated data when using regression mod...
We propose a fast algorithm for ridge regression when the number of features is much larger than the...
In several supervised learning applications, it happens that reconstruction methods have to be appli...
Ridge regression method is an improved method when the assumptions of independence of the explanator...
Ridge regression is a technical method to deal with highly correlated data when using regression mod...
Decisions are increasingly taken by both humans and machine learning models. However, machine learni...
We study the problem of distribution to real regression, where one aims to regress a map-ping f that...
The presence of the multicollinearity problem in the predictor data causes the variance of the ordin...
We propose a new distributed algorithm for em-pirical risk minimization in machine learning. The alg...