Boosting the runtime performance of deep neural networks (DNNs) is critical due to their wide adoption in real-world tasks. Existing approaches to optimizing the tensor algebra expression of a DNN only consider expressions representable by a fixed set of predefined operators, missing possible optimization opportunities between general expressions. We propose OLLIE, the first derivation-based tensor program optimizer. OLLIE optimizes tensor programs by leveraging transformations between general tensor algebra expressions, enabling a significantly larger expression search space that includes those supported by prior work as special cases. OLLIE uses a hybrid derivation-based optimizer that effectively combines explorative and guided derivatio...
The spread of deep learning on embedded devices has prompted the development of numerous methods to ...
This paper presents a state-of-the-art overview on how to architect, design, and optimize Deep Neura...
Deep Neural Networks (DNNs) have revolutionized many aspects of our lives. The use of DNNs is becomi...
High-performance tensor programs are crucial to guarantee efficient execution of deep neural network...
Deploying deep learning models on various devices has become an important topic. The wave of hardwar...
Auto-scheduling for tensor programs is a process where a search algorithm automatically explores can...
Training and inference efficiency of deep neural networks highly rely on the performance of tensor o...
Memory efficiency is crucial in training deep learning networks on resource-restricted devices. Duri...
The emergence of deep learning has launched many works in deep learning accelerators. To fully reali...
Automatic optimization for tensor programs becomes increasingly important as we deploy deep learning...
The spread of deep learning on embedded devices has prompted the development of numerous methods to ...
Popular Machine Learning (ML) and High Performance Computing (HPC) workloads contribute to a signifi...
Deep Neural Networks (DNNs) have proved to be a conve- nient and powerful tool for a wide range of p...
A standard hardware bottleneck when training deep neural networks is GPU memory. The bulk of memory ...
140 pagesTensor algebra lives at the heart of big data applications. Where classical machine learnin...
The spread of deep learning on embedded devices has prompted the development of numerous methods to ...
This paper presents a state-of-the-art overview on how to architect, design, and optimize Deep Neura...
Deep Neural Networks (DNNs) have revolutionized many aspects of our lives. The use of DNNs is becomi...
High-performance tensor programs are crucial to guarantee efficient execution of deep neural network...
Deploying deep learning models on various devices has become an important topic. The wave of hardwar...
Auto-scheduling for tensor programs is a process where a search algorithm automatically explores can...
Training and inference efficiency of deep neural networks highly rely on the performance of tensor o...
Memory efficiency is crucial in training deep learning networks on resource-restricted devices. Duri...
The emergence of deep learning has launched many works in deep learning accelerators. To fully reali...
Automatic optimization for tensor programs becomes increasingly important as we deploy deep learning...
The spread of deep learning on embedded devices has prompted the development of numerous methods to ...
Popular Machine Learning (ML) and High Performance Computing (HPC) workloads contribute to a signifi...
Deep Neural Networks (DNNs) have proved to be a conve- nient and powerful tool for a wide range of p...
A standard hardware bottleneck when training deep neural networks is GPU memory. The bulk of memory ...
140 pagesTensor algebra lives at the heart of big data applications. Where classical machine learnin...
The spread of deep learning on embedded devices has prompted the development of numerous methods to ...
This paper presents a state-of-the-art overview on how to architect, design, and optimize Deep Neura...
Deep Neural Networks (DNNs) have revolutionized many aspects of our lives. The use of DNNs is becomi...