Accelerating and scaling the training of deep neural networks (DNNs) is critical to keep up with growing datasets, reduce training times, and enable training on memory-constrained problems where parallelism is necessary. In this thesis, I present a set of techniques that can leverage large high-performance computing systems for fast training of DNNs. I first introduce a suite of algorithms to exploit additional parallelism in convolutional layers when training, expanding beyond the standard sample-wise data-parallel approach to include spatial parallelism and channel and filter parallelism. Next, I present optimizations to communication frameworks to reduce communication overheads at large scales. Finally, I discuss communication quantizati...
With increasing data and model complexities, the time required to train neural networks has become p...
The big-data is an oil of this century. A high amount of computational power is required to get know...
Deep neural network models are commonly used in various real-life applications due to their high pre...
Accelerating and scaling the training of deep neural networks (DNNs) is critical to keep up with gro...
Deep Neural Network (DNN) frameworks use distributed training to enable faster time to convergence a...
Deep Neural Network (DNN) frameworks use distributed training to enable faster time to convergence a...
The rapid growth of data and ever increasing model complexity of deep neural networks (DNNs) have en...
Distributed deep learning becomes very common to reduce the overall training time by exploiting mult...
In recent years, machine learning (ML) and, more noticeably, deep learning (DL), have be- come incre...
In recent years, machine learning (ML) and, more noticeably, deep learning (DL), have be- come incre...
Deep Neural Networks (DNNs) enable computers to excel across many different applications such as ima...
Graph Convolutional Networks (GCNs) are extensively utilized for deep learning on graphs. The large ...
Graph Convolutional Networks (GCNs) are extensively utilized for deep learning on graphs. The large ...
Deep neural networks (DNNs) have led to significant advancements in machine learning. With deep str...
Convolutional Neural Networks (CNNs) have shown to be powerful classi cation tools in tasks that ra...
With increasing data and model complexities, the time required to train neural networks has become p...
The big-data is an oil of this century. A high amount of computational power is required to get know...
Deep neural network models are commonly used in various real-life applications due to their high pre...
Accelerating and scaling the training of deep neural networks (DNNs) is critical to keep up with gro...
Deep Neural Network (DNN) frameworks use distributed training to enable faster time to convergence a...
Deep Neural Network (DNN) frameworks use distributed training to enable faster time to convergence a...
The rapid growth of data and ever increasing model complexity of deep neural networks (DNNs) have en...
Distributed deep learning becomes very common to reduce the overall training time by exploiting mult...
In recent years, machine learning (ML) and, more noticeably, deep learning (DL), have be- come incre...
In recent years, machine learning (ML) and, more noticeably, deep learning (DL), have be- come incre...
Deep Neural Networks (DNNs) enable computers to excel across many different applications such as ima...
Graph Convolutional Networks (GCNs) are extensively utilized for deep learning on graphs. The large ...
Graph Convolutional Networks (GCNs) are extensively utilized for deep learning on graphs. The large ...
Deep neural networks (DNNs) have led to significant advancements in machine learning. With deep str...
Convolutional Neural Networks (CNNs) have shown to be powerful classi cation tools in tasks that ra...
With increasing data and model complexities, the time required to train neural networks has become p...
The big-data is an oil of this century. A high amount of computational power is required to get know...
Deep neural network models are commonly used in various real-life applications due to their high pre...