Codes for Distributed Machine Learning

Muthuveeru Subramaniam, Adarsh

Publication date

January 2021

Abstract

The problem considered is that of distributing machine learning operations of matrix multiplication and multivariate polynomial evaluation among computer nodes a.k.a worker nodes some of whom don’t return their outputs or return erroneous outputs. The thesis can be divided into three major parts. In the first part of the thesis, a fault tolerant setup where t worker nodes return erroneous values is considered. For an additive random Gaussian error model, it is shown that for all t < N − K, errors can be corrected with probability 1 for polynomial codes. In the second part of the thesis, a class of codes called random Khatri-Rao-Product (RKRP) codes for distributed matrix multiplication in the presence of stragglers is proposed. The main...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Codes for Distributed Machine Learning

Abstract

Extracted data

Codes for Distributed Machine Learning

Abstract

Extracted data

Related items

Related items