Edge devices are increasingly utilized for deploying deep learning applications on embedded systems. The real-time nature of many applications and the limited resources of edge devices necessitate latency-targeted neural network compression. However, measuring latency on real devices is challenging and expensive. Therefore, this letter presents a novel and efficient framework, named EvoLP, to accurately predict the inference latency of models on edge devices. This predictor can evolve to achieve higher latency prediction precision during the network compression process. Experimental results demonstrate that EvoLP outperforms previous state-of-the-art approaches by being evaluated on three edge devices and four model variants. Moreover, when...
peer reviewedAbstract—Many real world computer vision applications are required to run on hardware w...
With the development of mobile edge computing (MEC), more and more intelligent services and applicat...
Deep neural networks (DNNs) are becoming the core components of many applications running on edge de...
Deep learning applications have been widely adopted on edge devices, to mitigate the privacy and lat...
Edge analytics refers to the application of data analytics and Machine Learning (ML) algorithms on I...
A lot of deep learning applications are desired to be run on mobile devices. Both accuracy and infer...
Mobile networks are evolving towards centralization and cloudification while bringing computing powe...
With more powerful yet efficient embedded devices and accelerators being available for Deep Neural N...
Recent advances in both lightweight deep learning algorithms and edge computing increasingly enable ...
Convolutional neural networks (CNNs) have demonstrated encouraging results in image classification t...
Industry 4.0 and the Industrial Internet of Things (IIoT) growth will result in an explosion of data...
INST: L_042Edge computing is an essential technology to enable machine learning capabilities on IoT ...
Recent years have witnessed a rapid growth of deep-network based services and applications. A practi...
The ability to accurately predict deep neural network (DNN) inference performance metrics, such as l...
The research in real-time segmentation mainly focuses on desktop GPUs. However, autonomous driving ...
peer reviewedAbstract—Many real world computer vision applications are required to run on hardware w...
With the development of mobile edge computing (MEC), more and more intelligent services and applicat...
Deep neural networks (DNNs) are becoming the core components of many applications running on edge de...
Deep learning applications have been widely adopted on edge devices, to mitigate the privacy and lat...
Edge analytics refers to the application of data analytics and Machine Learning (ML) algorithms on I...
A lot of deep learning applications are desired to be run on mobile devices. Both accuracy and infer...
Mobile networks are evolving towards centralization and cloudification while bringing computing powe...
With more powerful yet efficient embedded devices and accelerators being available for Deep Neural N...
Recent advances in both lightweight deep learning algorithms and edge computing increasingly enable ...
Convolutional neural networks (CNNs) have demonstrated encouraging results in image classification t...
Industry 4.0 and the Industrial Internet of Things (IIoT) growth will result in an explosion of data...
INST: L_042Edge computing is an essential technology to enable machine learning capabilities on IoT ...
Recent years have witnessed a rapid growth of deep-network based services and applications. A practi...
The ability to accurately predict deep neural network (DNN) inference performance metrics, such as l...
The research in real-time segmentation mainly focuses on desktop GPUs. However, autonomous driving ...
peer reviewedAbstract—Many real world computer vision applications are required to run on hardware w...
With the development of mobile edge computing (MEC), more and more intelligent services and applicat...
Deep neural networks (DNNs) are becoming the core components of many applications running on edge de...