Accelerating large sparse deep neural networks inference

Liu, Hanhaotian

Abstract

This thesis presents a few methods to accelerate the inference of Deep Neural Networks that are large and sparse using GPUs. Deep Neural Networks are now widely used in many applications in various fields, such as computer vision and speech recognition. Deep Neural Networks tend to work more accurately when the model is larger with more layers and neurons, but this makes the model size grow, which causes problems in transferring the data and storing the model in limited fast memory, and it also increases the number of computations, which slows the speed of network inference. The first problem can be solved by using sparse networks with comparable accuracy that contain less weights and thus are smaller in size, and this thesis intends...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Accelerating large sparse deep neural networks inference

Abstract

Extracted data

Accelerating large sparse deep neural networks inference

Abstract

Extracted data

Related items

Related items