Deep neural network (DNN) latency characterization is a time-consuming process and adds significant cost to Neural Architecture Search (NAS) processes when searching for efficient convolutional neural networks for embedded vision applications. DNN Latency is a hardware dependent metric and requires direct measurement or inference on target hardware. A recently introduced latency estimation technique known as MAPLE predicts DNN execution time on previously unseen hardware devices by using hardware performance counters. Leveraging these hardware counters in the form of an implicit prior, MAPLE achieves state-of-the-art performance in latency prediction. Here, we propose MAPLE-X which extends MAPLE by incorporating explicit prior knowledge of ...
© 2017 IEEE. Deep neural networks (DNNs) are currently widely used for many artificial intelligence ...
Abstract. We present an estimation methodology, accurately predicting the execution time for a given...
We devise a performance model for GPU training of Deep Learning Recommendation Models (DLRM), whose ...
Modern deep neural networks must demonstrate state-of-the-art accuracy while exhibiting low latency ...
DNNs have been finding a growing number of applications including image classification, speech recog...
This paper presents PreVIous, a methodology to predict the performance of Convolutional Neural Netwo...
With more powerful yet efficient embedded devices and accelerators being available for Deep Neural N...
The ability to accurately predict deep neural network (DNN) inference performance metrics, such as l...
Mixed-precision quantization, where a deep neural network's layers are quantized to different precis...
Deep learning is attracting interest across a variety of domains, including natural language process...
Deep neural networks (DNNs) are a vital tool in pattern recognition and Machine Learning (ML) – solv...
Deep learning applications have been widely adopted on edge devices, to mitigate the privacy and lat...
While providing the same functionality, the various Deep Learning software frameworks available thes...
This work presents an in-depth analysis of the majority of the deep neural networks (DNNs) proposed ...
Large Deep Neural Networks (DNNs) are the backbone of today's artificial intelligence due to their a...
© 2017 IEEE. Deep neural networks (DNNs) are currently widely used for many artificial intelligence ...
Abstract. We present an estimation methodology, accurately predicting the execution time for a given...
We devise a performance model for GPU training of Deep Learning Recommendation Models (DLRM), whose ...
Modern deep neural networks must demonstrate state-of-the-art accuracy while exhibiting low latency ...
DNNs have been finding a growing number of applications including image classification, speech recog...
This paper presents PreVIous, a methodology to predict the performance of Convolutional Neural Netwo...
With more powerful yet efficient embedded devices and accelerators being available for Deep Neural N...
The ability to accurately predict deep neural network (DNN) inference performance metrics, such as l...
Mixed-precision quantization, where a deep neural network's layers are quantized to different precis...
Deep learning is attracting interest across a variety of domains, including natural language process...
Deep neural networks (DNNs) are a vital tool in pattern recognition and Machine Learning (ML) – solv...
Deep learning applications have been widely adopted on edge devices, to mitigate the privacy and lat...
While providing the same functionality, the various Deep Learning software frameworks available thes...
This work presents an in-depth analysis of the majority of the deep neural networks (DNNs) proposed ...
Large Deep Neural Networks (DNNs) are the backbone of today's artificial intelligence due to their a...
© 2017 IEEE. Deep neural networks (DNNs) are currently widely used for many artificial intelligence ...
Abstract. We present an estimation methodology, accurately predicting the execution time for a given...
We devise a performance model for GPU training of Deep Learning Recommendation Models (DLRM), whose ...