In this work, we extend the auto-tuning process of the state-of-the-art TVM framework with XFeatur; a tool that extracts new meaningful hardware-related features that improve the quality of the representation of the search space and consequently improve the accuracy of its prediction algorithm. These new features provide information about the amount of thread-level parallelism, shared memory usage, register usage, dynamic instruction count and memory access dependencies. Optimizing ResNet-18 with the proposed features improves the quality of the search space representation by 63% on average and a maximum of 2× for certain tasks, while it reduces the tuning time by 9% (approximately 1.1 hours) and produces configurations that have equal or b...
Deep Neural Networks (DNNs) have shown superior accuracy at the expense of high memory and computati...
Recent advances in Neural Architecture Search (NAS) such as one-shot NAS offer the ability to extrac...
Machine learning is becoming increasingly common in our society, from recommendation systems, audio ...
The process of optimizing the latency of DNN operators with ML models and hardware-in-the-loop, call...
Cataloged from PDF version of article.In recent times, large high-dimensional datasets have become u...
Hardware systems integrated with deep neural networks (DNNs) are deemed to pave the way for future a...
The spread of deep learning on embedded devices has prompted the development of numerous methods to ...
The spread of deep learning on embedded devices has prompted the development of numerous methods to ...
Deep Neural Networks (DNNs) have been traditionally designed by human experts in a painstaking and e...
The proliferation of AI across a variety of domains (vision, language, speech, recommendations, game...
Deep Learning is increasingly being adopted by industry for computer vision applications running on ...
Recent advances in algorithm-hardware co-design for deep neural networks (DNNs) have demonstrated th...
peer reviewedDeep Neural Networks (DNNs) are intensively used to solve a wide variety of complex pro...
Deep Neural Networks (DNNs) are extremely computationally demanding, which presents a large barrier ...
Tuning and optimising the operations executed in deep learning frameworks is a fundamental task in a...
Deep Neural Networks (DNNs) have shown superior accuracy at the expense of high memory and computati...
Recent advances in Neural Architecture Search (NAS) such as one-shot NAS offer the ability to extrac...
Machine learning is becoming increasingly common in our society, from recommendation systems, audio ...
The process of optimizing the latency of DNN operators with ML models and hardware-in-the-loop, call...
Cataloged from PDF version of article.In recent times, large high-dimensional datasets have become u...
Hardware systems integrated with deep neural networks (DNNs) are deemed to pave the way for future a...
The spread of deep learning on embedded devices has prompted the development of numerous methods to ...
The spread of deep learning on embedded devices has prompted the development of numerous methods to ...
Deep Neural Networks (DNNs) have been traditionally designed by human experts in a painstaking and e...
The proliferation of AI across a variety of domains (vision, language, speech, recommendations, game...
Deep Learning is increasingly being adopted by industry for computer vision applications running on ...
Recent advances in algorithm-hardware co-design for deep neural networks (DNNs) have demonstrated th...
peer reviewedDeep Neural Networks (DNNs) are intensively used to solve a wide variety of complex pro...
Deep Neural Networks (DNNs) are extremely computationally demanding, which presents a large barrier ...
Tuning and optimising the operations executed in deep learning frameworks is a fundamental task in a...
Deep Neural Networks (DNNs) have shown superior accuracy at the expense of high memory and computati...
Recent advances in Neural Architecture Search (NAS) such as one-shot NAS offer the ability to extrac...
Machine learning is becoming increasingly common in our society, from recommendation systems, audio ...