Dynamic parameter allocation in parameter servers

Renz-Wieland, Alexander
Gemulla, Rainer
Zeuch, Steffen
Markl, Volker

Open PDF

Open link

Publication date

January 2020

DOI

10.14778/3407790.3407796

Publisher

Association of Computing Machinery

Abstract

To keep up with increasing dataset sizes and model complexity, distributed training has become a necessity for large machine learning tasks. Parameter servers ease the implementation of distributed parameter management---a key concern in distributed training---, but can induce severe communication overhead. To reduce communication overhead, distributed machine learning algorithms use techniques to increase parameter access locality (PAL), achieving up to linear speed-ups. We found that existing parameter servers provide only limited support for PAL techniques, however, and therefore prevent efficient training. In this paper, we explore whether and to what extent PAL techniques can be supported, and whether such support is beneficial. We pro...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Dynamic parameter allocation in parameter servers

Abstract

Extracted data

Dynamic parameter allocation in parameter servers

Abstract

Extracted data

Related items

Related items