One-Shot Tuner for Deep Learning Compilers

Ryu, Jaehun
Park, Eunhyeok
Sung, Hyojin

Publication date

April 2022

Publisher

Association for Computing Machinery, Inc

Abstract

Auto-Tuning DL compilers are gaining ground as an optimizing back-end for DL frameworks. While existing work can generate deep learning models that exceed the performance of hand-Tuned libraries, they still suffer from prohibitively long auto-Tuning time due to repeated hardware measurements in large search spaces. In this paper, we take a neural-predictor inspired approach to reduce the auto-Tuning overhead and show that a performance predictor model trained prior to compilation can produce optimized tensor operation codes without repeated search and hardware measurements. To generate a sample-efficient training dataset, we extend input representation to include task-specific information and to guide data sampling methods to focus on learn...

Extracted data

We use cookies to provide a better user experience.

Data Protection

One-Shot Tuner for Deep Learning Compilers

Abstract

Extracted data

One-Shot Tuner for Deep Learning Compilers

Abstract

Extracted data

Related items

Related items