Hardware for Quantized Mixed-Precision Deep Neural Networks

Rios, Andres

Publication date

January 2021

Publisher

ScholarWorks@UTEP

Abstract

Recently, there has been a push to perform deep learning (DL) computations on the edge rather than the cloud due to latency, network connectivity, energy consumption, and privacy issues. However, state-of-the-art deep neural networks (DNNs) require vast amounts of computational power, data, and energy—resources that are limited on edge devices. This limitation has brought the need to design domain-specific architectures (DSAs) that implement DL-specific hardware optimizations. Traditionally DNNs have run on 32-bit floating-point numbers; however, a body of research has shown that DNNs are surprisingly robust and do not require all 32 bits. Instead, using quantization, networks can run on extremely low-bit widths (1-8 bits) with fair accurac...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Hardware for Quantized Mixed-Precision Deep Neural Networks

Abstract

Extracted data

Hardware for Quantized Mixed-Precision Deep Neural Networks

Abstract

Extracted data

Related items

Related items