Multi-gpu training of convnets

Omry Yadan
Keith Adams
Yaniv Taigman
Facebook Ai Group

Publication date

August 2016

Abstract

Convolutional neural networks [3] have proven useful in many domains, including computer vi-sion [1, 4, 5], audio processing [6, 7] and natural language processing [8]. These powerful models come at great cost in training time, however. Currently, long training periods make experimentation difficult and time consuming. In this work, we consider a standard architecture [1] trained on the Imagenet dataset [2] for classifi-cation and investigate methods to speed convergence by parallelizing training across multiple GPUs. In this work, we used up to 4 NVIDIA TITAN GPUs with 6GB of RAM. While our experiments are performed on a single server, our GPUs have disjoint memory spaces, and just as in the distributed setting, communication overheads are...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Multi-gpu training of convnets

Abstract

Extracted data

Multi-gpu training of convnets

Abstract

Extracted data

Related items

Related items