International audienceThe Deep learning processor (DLP), especially ASIC-based accelerators, have been proved to be a promising device for accelerating the computation of deep learning algorithms. However, the learning cost of mastering these DLPs is high as they use different programming interfaces. On the other hand, many deep learning frameworks are proposed to ease the burden of developing deep learning algorithms, but few of them support DLPs. Due to the special features in DLPs, it is hard to integrate a DLP into existed frameworks.In this paper, we propose an intermediate representation (called DLIR) to bridge the gap between DL frameworks and DLPs. DLIR is a tensor-based language with built-in tensor intrinsics that can be directly ...
140 pagesTensor algebra lives at the heart of big data applications. Where classical machine learnin...
The aim of this project is to conduct a study of deep learning on multi-core processors. The study i...
Computational intensive applications such as pattern recognition, and natural language processing, a...
Multi-level intermediate representations (MLIR) show great promise for reducing the cost of building...
Deploying deep learning models on various devices has become an important topic. The wave of hardwar...
Deep Learning (DL) has created a growing demand for simpler ways to develop complex models and effic...
Thesis (Ph.D.)--University of Washington, 2022As the scaling and performance demands for deep learni...
The recent “Cambrian explosion” of Deep Learning (DL) algorithms in concert with the end of Moore’s ...
Deep-learning accelerators are increasingly popular. There are two prevalent accelerator architectur...
To respond to the need for efficient training and inference of deep neural networks, a plethora of d...
Deep learning has made significant breakthroughs in various fields of artificial intelligence. Howev...
We present a library that provides optimized implementations for deep learning primitives. Deep lear...
130 pagesOver the past decade, machine learning (ML) with deep neural networks (DNNs) has become ext...
The MAtrix, TEnsor, and Deep-learning Optimized Routines (MATEDOR) project seeks to develop software...
©2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for al...
140 pagesTensor algebra lives at the heart of big data applications. Where classical machine learnin...
The aim of this project is to conduct a study of deep learning on multi-core processors. The study i...
Computational intensive applications such as pattern recognition, and natural language processing, a...
Multi-level intermediate representations (MLIR) show great promise for reducing the cost of building...
Deploying deep learning models on various devices has become an important topic. The wave of hardwar...
Deep Learning (DL) has created a growing demand for simpler ways to develop complex models and effic...
Thesis (Ph.D.)--University of Washington, 2022As the scaling and performance demands for deep learni...
The recent “Cambrian explosion” of Deep Learning (DL) algorithms in concert with the end of Moore’s ...
Deep-learning accelerators are increasingly popular. There are two prevalent accelerator architectur...
To respond to the need for efficient training and inference of deep neural networks, a plethora of d...
Deep learning has made significant breakthroughs in various fields of artificial intelligence. Howev...
We present a library that provides optimized implementations for deep learning primitives. Deep lear...
130 pagesOver the past decade, machine learning (ML) with deep neural networks (DNNs) has become ext...
The MAtrix, TEnsor, and Deep-learning Optimized Routines (MATEDOR) project seeks to develop software...
©2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for al...
140 pagesTensor algebra lives at the heart of big data applications. Where classical machine learnin...
The aim of this project is to conduct a study of deep learning on multi-core processors. The study i...
Computational intensive applications such as pattern recognition, and natural language processing, a...