This paper presents a deep learning approach which evaluates accuracy and inference time speedups in deep convolutional neural networks under various network quantizations. Quantized networks can result in much faster inference time allowing them to be deployed in real time on an embedded system such as a robot. We evaluate networks with activations quantized to 1, 2, 4, and 8-bits and binary weights. We found that network quantization can yield a significant speedup for a small drop in classification accuracy. Specifically, modifying one of our networks to use an 8-bit quantized input layer and 2-bit activations in hidden layers, we calculate a theoretical 9.9x speedup in exchange for an F1 score decrease of just 3.4% relative to a f...
Este trabalho foi financiado pelo Concurso Anual para Projetos de Investigação, Desenvolvimento, Ino...
Machine learning, and specifically Deep Neural Networks (DNNs) impact all parts of daily life. Altho...
The aim of this master thesis is modeling of neural network accelerators with HW support for quantiz...
This paper presents a deep learning approach which evaluates accuracy and inference time speedups in...
Convolutional neural networks (CNN) are state of the art machine learning models used for various co...
Deep neural networks performed greatly for many engineering problems in recent years. However, power...
Deep Learning is moving to edge devices, ushering in a new age of distributed Artificial Intelligenc...
Deep Learning is moving to edge devices, ushering in a new age of distributed Artificial Intelligenc...
Paper number 134 entitled "Evaluating the Use of Interpretable Quantized Convolutional Neural Networ...
Deep learning algorithms have seen success in a wide variety of applications, such as machine transl...
Convolutional Neural Networks (CNNs) are state-of-the-art in numerous computer vision tasks such as ...
Hardware accelerators for neural network inference can exploit common data properties for performanc...
Convolutional Neural Networks (CNNs) are currently adopted to solve an ever greater number of proble...
Deep convolutional neural networks (ConvNets), which are at the heart of many new emerging applicati...
Machine learning, and specifically Deep Neural Networks (DNNs) impact all parts of daily life. Altho...
Este trabalho foi financiado pelo Concurso Anual para Projetos de Investigação, Desenvolvimento, Ino...
Machine learning, and specifically Deep Neural Networks (DNNs) impact all parts of daily life. Altho...
The aim of this master thesis is modeling of neural network accelerators with HW support for quantiz...
This paper presents a deep learning approach which evaluates accuracy and inference time speedups in...
Convolutional neural networks (CNN) are state of the art machine learning models used for various co...
Deep neural networks performed greatly for many engineering problems in recent years. However, power...
Deep Learning is moving to edge devices, ushering in a new age of distributed Artificial Intelligenc...
Deep Learning is moving to edge devices, ushering in a new age of distributed Artificial Intelligenc...
Paper number 134 entitled "Evaluating the Use of Interpretable Quantized Convolutional Neural Networ...
Deep learning algorithms have seen success in a wide variety of applications, such as machine transl...
Convolutional Neural Networks (CNNs) are state-of-the-art in numerous computer vision tasks such as ...
Hardware accelerators for neural network inference can exploit common data properties for performanc...
Convolutional Neural Networks (CNNs) are currently adopted to solve an ever greater number of proble...
Deep convolutional neural networks (ConvNets), which are at the heart of many new emerging applicati...
Machine learning, and specifically Deep Neural Networks (DNNs) impact all parts of daily life. Altho...
Este trabalho foi financiado pelo Concurso Anual para Projetos de Investigação, Desenvolvimento, Ino...
Machine learning, and specifically Deep Neural Networks (DNNs) impact all parts of daily life. Altho...
The aim of this master thesis is modeling of neural network accelerators with HW support for quantiz...