International audienceDeep learning frameworks automate the deployment, distribution, synchronization, memory allocation, andhardware acceleration of models represented as graphs of computational operators. These operators wraphigh-performance libraries such as cuDNN or NNPACK. When the computation does not match any prede-fined library call, custom operators must be implemented, often at high engineering cost and performancepenalty, limiting the pace of innovation. To address this productivity gap, we propose and evaluate: (1) adomain-specific language with a tensor notation close to the mathematics of deep learning; (2) a Just-In-Time optimizing compiler based on the polyhedral framework; (3) carefully coordinated linear optimizationand e...
Graphics Processing Units (GPU) have been widely adopted to accelerate the execution of HPC workload...
Deep neural networks have gained popularity in recent years, obtaining outstanding results in a wide...
At the heart of deep learning training and inferencing are computationally intensive primitives such...
We present a library that provides optimized implementations for deep learning primitives. Deep lear...
Computational intensive applications such as pattern recognition, and natural language processing, a...
peer reviewedWith renewed global interest for Artificial Intelligence (AI) methods, the past decade ...
Thesis (Ph.D.)--University of Washington, 2022As the scaling and performance demands for deep learni...
Deep Neural Networks (DNNs) have revolutionized many aspects of our lives. The use of DNNs is becomi...
Computers are powerful tools which perform fast, accurate calculations over huge sets of data. Howev...
International audienceAutomatic parallel code generation from high-level abstractions such as those ...
Rapid progress in deep learning is leading to a diverse set of quickly changing models, with a drama...
Neural networks stand out from artificial intelligence because they can complete challenging tasks, ...
Machine Learning (ML) frameworks are tools that facilitate the development and deployment of ML mode...
International audienceGeometric methods rely on tensors that can be encoded using a symbolic formula...
Deep learning frameworks optimize the computation graphs and intra-operator computations to boost th...
Graphics Processing Units (GPU) have been widely adopted to accelerate the execution of HPC workload...
Deep neural networks have gained popularity in recent years, obtaining outstanding results in a wide...
At the heart of deep learning training and inferencing are computationally intensive primitives such...
We present a library that provides optimized implementations for deep learning primitives. Deep lear...
Computational intensive applications such as pattern recognition, and natural language processing, a...
peer reviewedWith renewed global interest for Artificial Intelligence (AI) methods, the past decade ...
Thesis (Ph.D.)--University of Washington, 2022As the scaling and performance demands for deep learni...
Deep Neural Networks (DNNs) have revolutionized many aspects of our lives. The use of DNNs is becomi...
Computers are powerful tools which perform fast, accurate calculations over huge sets of data. Howev...
International audienceAutomatic parallel code generation from high-level abstractions such as those ...
Rapid progress in deep learning is leading to a diverse set of quickly changing models, with a drama...
Neural networks stand out from artificial intelligence because they can complete challenging tasks, ...
Machine Learning (ML) frameworks are tools that facilitate the development and deployment of ML mode...
International audienceGeometric methods rely on tensors that can be encoded using a symbolic formula...
Deep learning frameworks optimize the computation graphs and intra-operator computations to boost th...
Graphics Processing Units (GPU) have been widely adopted to accelerate the execution of HPC workload...
Deep neural networks have gained popularity in recent years, obtaining outstanding results in a wide...
At the heart of deep learning training and inferencing are computationally intensive primitives such...