The flourish of deep learning frameworks and hardware platforms has been demanding an efficient compiler that can shield the diversity in both software and hardware in order to provide application portability. Among the existing deep learning compilers, TVM is well known for its efficiency in code generation and optimization across diverse hardware devices. In the meanwhile, the Sunway many-core processor renders itself as a competitive candidate for its attractive computational power in both scientific computing and deep learning workloads. This paper combines the trends in these two directions. Specifically, we propose swTVM that extends the original TVM to support ahead-of-time compilation for architecture requiring cross-compilation suc...
Machine learning has gained success in many application domains including medical data analysis, fin...
This paper presents a compiler flow to map Deep Convolutional Networks (ConvNets) to a highly specia...
Neural networks are becoming more and more popular in scientific field and in the industry. It is mo...
Thesis (Ph.D.)--University of Washington, 2022As the scaling and performance demands for deep learni...
Deep-learning accelerators are increasingly popular. There are two prevalent accelerator architectur...
Deep learning algorithms are gaining popularity in autonomous systems. These systems typically have ...
Deploying deep learning models on various devices has become an important topic. The wave of hardwar...
The rapid advancement of Artificial intelligence (AI) is making our everyday life easier with smart ...
The recent “Cambrian explosion” of Deep Learning (DL) algorithms in concert with the end of Moore’s ...
Shorterm Memory AI model shows that during the silent period of memory, the brain can use the short-...
High-performance tensor programs are crucial to guarantee efficient execution of deep neural network...
Advances in high-performance computer architecture design have been a major driver for the rapid evo...
Machine learning has been widely used in various application domains such as recommendation, compute...
Training deep learning (DL) models is a highly compute-intensive task since it involves operating on...
Over the last years, deep learning architectures have gained attention by winning important interna...
Machine learning has gained success in many application domains including medical data analysis, fin...
This paper presents a compiler flow to map Deep Convolutional Networks (ConvNets) to a highly specia...
Neural networks are becoming more and more popular in scientific field and in the industry. It is mo...
Thesis (Ph.D.)--University of Washington, 2022As the scaling and performance demands for deep learni...
Deep-learning accelerators are increasingly popular. There are two prevalent accelerator architectur...
Deep learning algorithms are gaining popularity in autonomous systems. These systems typically have ...
Deploying deep learning models on various devices has become an important topic. The wave of hardwar...
The rapid advancement of Artificial intelligence (AI) is making our everyday life easier with smart ...
The recent “Cambrian explosion” of Deep Learning (DL) algorithms in concert with the end of Moore’s ...
Shorterm Memory AI model shows that during the silent period of memory, the brain can use the short-...
High-performance tensor programs are crucial to guarantee efficient execution of deep neural network...
Advances in high-performance computer architecture design have been a major driver for the rapid evo...
Machine learning has been widely used in various application domains such as recommendation, compute...
Training deep learning (DL) models is a highly compute-intensive task since it involves operating on...
Over the last years, deep learning architectures have gained attention by winning important interna...
Machine learning has gained success in many application domains including medical data analysis, fin...
This paper presents a compiler flow to map Deep Convolutional Networks (ConvNets) to a highly specia...
Neural networks are becoming more and more popular in scientific field and in the industry. It is mo...