More effective distributed ML via a stale synchronous parallel parameter server

Qirong Ho
James Cipar
Henggang Cui
Jin Kyu Kim
Seunghak Lee
Phillip B. Gibbons
Garth A. Gibson
Gregory R. Ganger
Eric P. Xing

Publication date

January 2013

Abstract

We propose a parameter server system for distributed ML, which follows a Stale Synchronous Parallel (SSP) model of computation that maximizes the time com-putational workers spend doing useful work on ML algorithms, while still provid-ing correctness guarantees. The parameter server provides an easy-to-use shared interface for read/write access to an ML model’s values (parameters and vari-ables), and the SSP model allows distributed workers to read older, stale versions of these values from a local cache, instead of waiting to get them from a central storage. This significantly increases the proportion of time workers spend com-puting, as opposed to waiting. Furthermore, the SSP model ensures ML algorithm correctness by limiting the maximum...

Extracted data

We use cookies to provide a better user experience.

Data Protection

More effective distributed ML via a stale synchronous parallel parameter server

Abstract

Extracted data

More effective distributed ML via a stale synchronous parallel parameter server

Abstract

Extracted data

Related items

Related items