Deep-learning accelerators are increasingly popular. There are two prevalent accelerator architectures: one based on general matrix multiplication units and the other on convolution cores. However, Tensor Virtual Machine (TVM), a widely used deep-learning compiler stack, does not support the latter. This paper proposes a general framework for extending TVM to support deep-learning accelerators with convolution cores. We have applied it to two well-known accelerators: Nvidia\u27s NVDLA and Bitmain\u27s BM1880 successfully. Deep-learning workloads can now be readily deployed to these accelerators through TVM and executed efficiently. This framework can extend TVM to other accelerators with minimum effort
Deep neural networks have gained popularity in recent years, obtaining outstanding results in a wide...
The aim of this project is to conduct a study of deep learning on multi-core processors. The study i...
Deep learning-based solutions and, in particular, deep neural networks (DNNs) are at the heart of se...
Deep learning algorithms are gaining popularity in autonomous systems. These systems typically have ...
Thesis (Ph.D.)--University of Washington, 2022As the scaling and performance demands for deep learni...
There has been a surge in the demand for a Domain Specific Architecture due to wide ranging deep lea...
The size of neural networks a GPU can train is limited by the GPU’s memory capacity. Although GPU vi...
The recent “Cambrian explosion” of Deep Learning (DL) algorithms in concert with the end of Moore’s ...
The flourish of deep learning frameworks and hardware platforms has been demanding an efficient comp...
Machine learning has gained success in many application domains including medical data analysis, fin...
Computational intensive applications such as pattern recognition, and natural language processing, a...
140 pagesTensor algebra lives at the heart of big data applications. Where classical machine learnin...
This report shows the steps needed for one to implement a deep Learning hardware accelerator based o...
Popular Machine Learning (ML) and High Performance Computing (HPC) workloads contribute to a signifi...
Machine Learning (ML) frameworks are tools that facilitate the development and deployment of ML mode...
Deep neural networks have gained popularity in recent years, obtaining outstanding results in a wide...
The aim of this project is to conduct a study of deep learning on multi-core processors. The study i...
Deep learning-based solutions and, in particular, deep neural networks (DNNs) are at the heart of se...
Deep learning algorithms are gaining popularity in autonomous systems. These systems typically have ...
Thesis (Ph.D.)--University of Washington, 2022As the scaling and performance demands for deep learni...
There has been a surge in the demand for a Domain Specific Architecture due to wide ranging deep lea...
The size of neural networks a GPU can train is limited by the GPU’s memory capacity. Although GPU vi...
The recent “Cambrian explosion” of Deep Learning (DL) algorithms in concert with the end of Moore’s ...
The flourish of deep learning frameworks and hardware platforms has been demanding an efficient comp...
Machine learning has gained success in many application domains including medical data analysis, fin...
Computational intensive applications such as pattern recognition, and natural language processing, a...
140 pagesTensor algebra lives at the heart of big data applications. Where classical machine learnin...
This report shows the steps needed for one to implement a deep Learning hardware accelerator based o...
Popular Machine Learning (ML) and High Performance Computing (HPC) workloads contribute to a signifi...
Machine Learning (ML) frameworks are tools that facilitate the development and deployment of ML mode...
Deep neural networks have gained popularity in recent years, obtaining outstanding results in a wide...
The aim of this project is to conduct a study of deep learning on multi-core processors. The study i...
Deep learning-based solutions and, in particular, deep neural networks (DNNs) are at the heart of se...