International audienceA wide range of scientific and machine learning applications depend on highly optimized implementations of tensor computations. Exploiting the full capacity of a given processor architecture remains a challenging task, due to the complexity of the microarchitectural features that come into play when seeking near-peak performance. Among the state-of-the-art techniques for loop transformations for performance optimization, AutoScheduler [34] tends to outperform other systems. It often yields higher performance as compared to vendor libraries, but takes a large number of runs to converge, while also involving a complex training environment. In this paper, we define a structured configuration space that enables much faster...
Automatic optimization for tensor programs becomes increasingly important as we deploy deep learning...
Thesis (Ph.D.)--University of Washington, 2022As the scaling and performance demands for deep learni...
Deep learning algorithms are gaining popularity in autonomous systems. These systems typically have ...
International audienceA wide range of scientific and machine learning applications depend on highly ...
Optimizing the implementation of tensor computations is essential to exploiting the full capacity of...
Today, scientific computing plays an important role in scientific research. People build supercomput...
Auto-Tuning DL compilers are gaining ground as an optimizing back-end for DL frameworks. While exist...
Auto-scheduling for tensor programs is a process where a search algorithm automatically explores can...
In this article, a new method is provided for accelerating the execution of convolution layers in De...
At the heart of deep learning training and inferencing are computationally intensive primitives such...
High-performance tensor programs are crucial to guarantee efficient execution of deep neural network...
Data-intensive programs deal with big chunks of data and often contain compute-intensive characteris...
In high-performance computing, excellent node-level performance is required for the efficient use of...
The emergence of deep learning has launched many works in deep learning accelerators. To fully reali...
Deep Neural Networks (DNNs) have revolutionized many aspects of our lives. The use of DNNs is becomi...
Automatic optimization for tensor programs becomes increasingly important as we deploy deep learning...
Thesis (Ph.D.)--University of Washington, 2022As the scaling and performance demands for deep learni...
Deep learning algorithms are gaining popularity in autonomous systems. These systems typically have ...
International audienceA wide range of scientific and machine learning applications depend on highly ...
Optimizing the implementation of tensor computations is essential to exploiting the full capacity of...
Today, scientific computing plays an important role in scientific research. People build supercomput...
Auto-Tuning DL compilers are gaining ground as an optimizing back-end for DL frameworks. While exist...
Auto-scheduling for tensor programs is a process where a search algorithm automatically explores can...
In this article, a new method is provided for accelerating the execution of convolution layers in De...
At the heart of deep learning training and inferencing are computationally intensive primitives such...
High-performance tensor programs are crucial to guarantee efficient execution of deep neural network...
Data-intensive programs deal with big chunks of data and often contain compute-intensive characteris...
In high-performance computing, excellent node-level performance is required for the efficient use of...
The emergence of deep learning has launched many works in deep learning accelerators. To fully reali...
Deep Neural Networks (DNNs) have revolutionized many aspects of our lives. The use of DNNs is becomi...
Automatic optimization for tensor programs becomes increasingly important as we deploy deep learning...
Thesis (Ph.D.)--University of Washington, 2022As the scaling and performance demands for deep learni...
Deep learning algorithms are gaining popularity in autonomous systems. These systems typically have ...