Profiling and characterization of deep learning model inference on CPU

Qian, Yanli

Publication date

August 2022

Abstract

With the rapid growth of deep learning models and higher expectations for their accuracy and throughput in real-world applications, the demand for proﬁling and characterizing model inference on diﬀerent hardware/software stacks is signiﬁcantly increased. As the model inference characterization on GPU has already been extensively studied, it is worth exploring how performance-enhancing libraries like Intel MKL-DNN help to boost the performance on Intel CPU. We develop a proﬁling mechanism to capture the MKL-DNN operation calls and formulate the tracing timeline with spans on the server. Through proﬁling and characterization that give insights into Intel MKL-DNN, we evaluate and demonstrate that the optimization techniques, including blocked ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Profiling and characterization of deep learning model inference on CPU

Abstract

Extracted data

Profiling and characterization of deep learning model inference on CPU

Abstract

Extracted data

Related items

Related items