Near-Optimal Sparse Allreduce for Distributed Deep Learning - PPoPP'2022 Artifact

Mitigation of Scaling Trade-offs in Distributed Deep Learning through Multi-Objective Optimization

Rojas, E.
Meneses, E.

August 2022

The potential to solve complex problems along with the performance that deep learning offers has mad...

Communication-Efficient Distributionally Robust Decentralized Learning

Zecchin, Matteo
Kountouris, Marios
Gesbert, David

October 2022

Decentralized learning algorithms empower interconnected devices to share data and computational res...

Register Tiling for Unstructured Sparsity in Neural Network

Wilkinson, Lucas
Cheshmi, Kazem
Mehri Dehnavi, Maryam

March 2023

This artifact generates figures of the submitted draft of “Register Tiling for Unstructured Sparsity...

Near-Optimal Sparse Allreduce for Distributed Deep Learning - PPoPP'2022 Artifact

Shigang Li
Torsten Hoefler

December 2021

Artifact for the paper "Near-Optimal Sparse Allreduce for Distributed Deep Learning", published in P...

Near-Optimal Sparse Allreduce for Distributed Deep Learning

Li, Shigang
Hoefler, Torsten

April 2022

Communication overhead is one of the major obstacles to train large deep learning models at scale. G...

Sequential Reasoning for Optimizing Compilers Under Weak Memory Concurrency (PLDI 2022 Artifact)

Cho, Minki
Lee, Sung-Hwan
Lee, Dongjae
Hur, Chung-Kil
Lahav, Ori

March 2022

The artifact for the paper Sequential Reasoning for Optimizing Compilers Under Weak Memory Concurren...

Multicore Parallelism in Permanence-based Community Detection Algorithm

Anonymous

July 2023

Artifact for the paper titled "Multicore Parallelism in Permanence-based Community Detection Algorit...

Beyond spectral gap: The role of the topology in decentralized learning

Vogels, Thijs
Hendrikx, Hadrien
Jaggi, Martin

November 2022

In data-parallel optimization of machine learning models, workers collaborate to improve their estim...

Research Artifact for Paper "Unified Communication Optimization Strategies for Sparse Triangular Solver on CPU and GPU Clusters"

Liu, Yang
Ding, Nan
Sao, Piyush
Williams, Samuel
Li, Xiaoye Sherry

June 2023

This is the research artifact for the SC23 paper "Unified Communication Optimization Strategies for ...

Accelerating large sparse deep neural networks inference

Liu, Hanhaotian

This thesis presents a few methods to accelerate the inference of Deep Neural Networks that are lar...

PACT-artifact: G-Sparse: Compiler-Driven Acceleration for Generalized Sparse Computation for Graph Neural Networks on Modern GPUs

Jin Yue

August 2023

This archive includes source codes and benchmarks for paper: "G-Sparse: Compiler-Driven Acceleration...

FlexReduce: Flexible All-reduce for Distributed Deep Learning on Asymmetric Network Topology

Lee, Jinho
HWANG, INSEOK
Shah, Soham
Cho, Minsik

July 2020

We propose FlexReduce, an efficient and flexible all-reduce algorithm for distributed deep learning ...

Algorithms for Efficient and Robust Distributed Deep Learning

Lin, Tao

July 2022

The success of deep learning may be attributed in large part to remarkable growth in the size and co...

Visibility Algorithms for Dynamic Dependence Analysis and Distributed Coherence

Bauer, Michael
Slaughter, Elliott
Treichler, Sean
Lee, Wonchan
Garland, Michael
Aiken, Alex

November 2022

This is the artifact that accompanies the paper "Visibility Algorithms for Dynamic Dependence Analys...

Parallel Sparse Optimization

Peng, Zhimin

October 2014

This thesis proposes parallel and distributed algorithms for solving very largescale sparse optimiza...

Mitigation of Scaling Trade-offs in Distributed Deep Learning through Multi-Objective Optimization

Rojas, E.
Meneses, E.

August 2022

The potential to solve complex problems along with the performance that deep learning offers has mad...

Communication-Efficient Distributionally Robust Decentralized Learning

Zecchin, Matteo
Kountouris, Marios
Gesbert, David

October 2022

Decentralized learning algorithms empower interconnected devices to share data and computational res...

Register Tiling for Unstructured Sparsity in Neural Network

Wilkinson, Lucas
Cheshmi, Kazem
Mehri Dehnavi, Maryam

March 2023

This artifact generates figures of the submitted draft of “Register Tiling for Unstructured Sparsity...

Near-Optimal Sparse Allreduce for Distributed Deep Learning - PPoPP'2022 Artifact

Shigang Li
Torsten Hoefler

December 2021

Artifact for the paper "Near-Optimal Sparse Allreduce for Distributed Deep Learning", published in P...

Near-Optimal Sparse Allreduce for Distributed Deep Learning

Li, Shigang
Hoefler, Torsten

April 2022

Communication overhead is one of the major obstacles to train large deep learning models at scale. G...

Sequential Reasoning for Optimizing Compilers Under Weak Memory Concurrency (PLDI 2022 Artifact)

Cho, Minki
Lee, Sung-Hwan
Lee, Dongjae
Hur, Chung-Kil
Lahav, Ori

March 2022

The artifact for the paper Sequential Reasoning for Optimizing Compilers Under Weak Memory Concurren...

Multicore Parallelism in Permanence-based Community Detection Algorithm

Anonymous

July 2023

Artifact for the paper titled "Multicore Parallelism in Permanence-based Community Detection Algorit...

Beyond spectral gap: The role of the topology in decentralized learning

Vogels, Thijs
Hendrikx, Hadrien
Jaggi, Martin

November 2022

In data-parallel optimization of machine learning models, workers collaborate to improve their estim...

Research Artifact for Paper "Unified Communication Optimization Strategies for Sparse Triangular Solver on CPU and GPU Clusters"

Liu, Yang
Ding, Nan
Sao, Piyush
Williams, Samuel
Li, Xiaoye Sherry

June 2023

This is the research artifact for the SC23 paper "Unified Communication Optimization Strategies for ...

Accelerating large sparse deep neural networks inference

Liu, Hanhaotian

This thesis presents a few methods to accelerate the inference of Deep Neural Networks that are lar...

PACT-artifact: G-Sparse: Compiler-Driven Acceleration for Generalized Sparse Computation for Graph Neural Networks on Modern GPUs

Jin Yue

August 2023

This archive includes source codes and benchmarks for paper: "G-Sparse: Compiler-Driven Acceleration...

FlexReduce: Flexible All-reduce for Distributed Deep Learning on Asymmetric Network Topology

Lee, Jinho
HWANG, INSEOK
Shah, Soham
Cho, Minsik

July 2020

We propose FlexReduce, an efficient and flexible all-reduce algorithm for distributed deep learning ...

Algorithms for Efficient and Robust Distributed Deep Learning

Lin, Tao

July 2022

The success of deep learning may be attributed in large part to remarkable growth in the size and co...

Visibility Algorithms for Dynamic Dependence Analysis and Distributed Coherence

Bauer, Michael
Slaughter, Elliott
Treichler, Sean
Lee, Wonchan
Garland, Michael
Aiken, Alex

November 2022

This is the artifact that accompanies the paper "Visibility Algorithms for Dynamic Dependence Analys...

Parallel Sparse Optimization

Peng, Zhimin

October 2014

This thesis proposes parallel and distributed algorithms for solving very largescale sparse optimiza...

Mitigation of Scaling Trade-offs in Distributed Deep Learning through Multi-Objective Optimization

Rojas, E.
Meneses, E.

August 2022

The potential to solve complex problems along with the performance that deep learning offers has mad...

Communication-Efficient Distributionally Robust Decentralized Learning

Zecchin, Matteo
Kountouris, Marios
Gesbert, David

October 2022

Decentralized learning algorithms empower interconnected devices to share data and computational res...

Register Tiling for Unstructured Sparsity in Neural Network

Wilkinson, Lucas
Cheshmi, Kazem
Mehri Dehnavi, Maryam

March 2023

This artifact generates figures of the submitted draft of “Register Tiling for Unstructured Sparsity...

Near-Optimal Sparse Allreduce for Distributed Deep Learning - PPoPP'2022 Artifact

Abstract

Extracted data

Near-Optimal Sparse Allreduce for Distributed Deep Learning - PPoPP'2022 Artifact

Abstract

Extracted data

Related items

Related items