SCALANA: Automating Scaling Loss Detection with Graph Analysis

Jin, Yuyang
Wang, Haojie
Yu, Teng
Tang, Xiongchao
Hoefler, Torsten
Liu, Xu
Zhai, Jidong

Open link

Publication date

January 2020

DOI

10.1109/SC41405.2020.00032

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

Scaling a parallel program to modern supercomputers is challenging due to inter-process communication, Amdahl's law, and resource contention. Performance analysis tools for finding such scaling bottlenecks either base on profiling or tracing. Profiling incurs low overheads but does not capture detailed dependencies needed for root-cause analysis. Tracing collects all information at prohibitive overheads. In this work, we design SCALANA that uses static analysis techniques to achieve the best of both worlds - it enables the analyzability of traces at a cost similar to profiling. SCALANA first leverages static compiler techniques to build a Program Structure Graph, which records the main computation and communication patterns as well as the p...

Extracted data

We use cookies to provide a better user experience.

Data Protection

SCALANA: Automating Scaling Loss Detection with Graph Analysis

Abstract

Extracted data

SCALANA: Automating Scaling Loss Detection with Graph Analysis

Abstract

Extracted data

Related items

Related items