Modeling Techniques General Terms Performance

Jonatan Ward
Sergey Andreev
Francisco Heredia
Bogdan Lazar
Zlatka Manevska

Publication date

January 2016

Abstract

We propose a method to parallelize the training of a convolutional neural network by using a CUDA-based cluster. We attain a sub-stantial increase in the performance of the algorithm itself. We re-search the feasibility of using batch versus online mode training and provide a performance comparison between them. Furthermore, we propose an implementation of an alternative algorithm to compute local gradients which increases the level of parallelism. To con-clude, we give a set of best practices for implementing Convolu-tional Neural Networks on the cluster

Extracted data

We use cookies to provide a better user experience.

Data Protection

Modeling Techniques General Terms Performance

Abstract

Extracted data

Modeling Techniques General Terms Performance

Abstract

Extracted data

Related items

Related items