This work presents CascadeCNN, an automated toolflow that pushes the quantisation limits of any given CNN model, aiming to perform high-throughput inference. A two-stage architecture tailored for any given CNN-FPGA pair is generated, consisting of a low-and high-precision unit in a cascade. A confidence evaluation unit is employed to identify misclassified cases from the excessively low-precision unit and forward them to the high-precision unit for re-processing. Experiments demonstrate that the proposed toolflow can achieve a performance boost up to 55% for VGG-16 and 48% for AlexNet over the baseline design for the same resource budget and accuracy, without the need of retraining the model or accessing the training data
International audienceThe wide landscape of memory-hungry and compute-intensive Convolutional Neural...
In the past decade, Convolutional Neural Networks (CNNs) have demonstrated state-of-the-art performa...
Arithmetic precision scaling is mandatory to deploy Convolutional Neural Networks (CNNs) on resource...
Convolutional Neural Networks constitute a promi-nent AI model for classification tasks, serving a b...
Convolutional Neural Networks (CNNs) are currently adopted to solve an ever greater number of proble...
Due to the huge success and rapid development of convolutional neural networks (CNNs), there is a gr...
This thesis explores Convolutional Neural Network (CNN) inference accelerator architecture for FPGAs...
In the past decade, research has shown that CNN inference can be considerably sped up via dedicated ...
Large-scale convolutional neural networks (CNNs) suffer from very long training times, spanning from...
Convolutional Neural Network (CNN) inference has gained a significant amount of traction for perform...
Modern deep Convolutional Neural Networks (CNNs) are computationally demanding, yet real application...
abstract: The rapid improvement in computation capability has made deep convolutional neural network...
The development of machine learning has made a revolution in various applications such as object det...
Convolutional neural networks (CNN) are state of the art machine learning models used for various co...
Over recent years, deep learning paradigms such as convolutional neural networks (CNNs) have shown g...
International audienceThe wide landscape of memory-hungry and compute-intensive Convolutional Neural...
In the past decade, Convolutional Neural Networks (CNNs) have demonstrated state-of-the-art performa...
Arithmetic precision scaling is mandatory to deploy Convolutional Neural Networks (CNNs) on resource...
Convolutional Neural Networks constitute a promi-nent AI model for classification tasks, serving a b...
Convolutional Neural Networks (CNNs) are currently adopted to solve an ever greater number of proble...
Due to the huge success and rapid development of convolutional neural networks (CNNs), there is a gr...
This thesis explores Convolutional Neural Network (CNN) inference accelerator architecture for FPGAs...
In the past decade, research has shown that CNN inference can be considerably sped up via dedicated ...
Large-scale convolutional neural networks (CNNs) suffer from very long training times, spanning from...
Convolutional Neural Network (CNN) inference has gained a significant amount of traction for perform...
Modern deep Convolutional Neural Networks (CNNs) are computationally demanding, yet real application...
abstract: The rapid improvement in computation capability has made deep convolutional neural network...
The development of machine learning has made a revolution in various applications such as object det...
Convolutional neural networks (CNN) are state of the art machine learning models used for various co...
Over recent years, deep learning paradigms such as convolutional neural networks (CNNs) have shown g...
International audienceThe wide landscape of memory-hungry and compute-intensive Convolutional Neural...
In the past decade, Convolutional Neural Networks (CNNs) have demonstrated state-of-the-art performa...
Arithmetic precision scaling is mandatory to deploy Convolutional Neural Networks (CNNs) on resource...