Deep learning such as Convolutional Neural Networks (CNNs) are an important workload increasingly demanding high-performance hardware acceleration. One distinguishing feature of deep learnng workload is that it is inherently resilient to small numerical errors and works very well with low precision hardware. Thus we propose a novel method, called Double MAC, to theoretically double the computation rate of CNN accelerators by packing two multiply-and-accumulate (MAC) operations into one DSP block of off-the-shelf FPGAs. There are several technical challenges, which we overcome by exploiting the mode of operation in the CNN accelerator. We have validated our method through FPGA synthesis and Verilog simulation, and evaluated our method by app...
FPGA-based heterogeneous computing platform, due to its extreme logic reconfigurability, emerges to ...
In the past few years we have experienced an extremely rapid growth of modern applications based on ...
Stochastic computing (SC) is a promising computing paradigm that can help address both the uncertain...
This paper presents a novel method to double the computation rate of convolutional neural network (C...
Convolutional Neural Networks (CNNs) are currently adopted to solve an ever greater number of proble...
Deep convolutional neural networks (CNNs) have recently shown very high accuracy in a wide range of ...
Due to the huge success and rapid development of convolutional neural networks (CNNs), there is a gr...
This thesis presents the results of an architectural study on the design of FPGA- based architecture...
Deep Learning (DL) has become best-in-class for numerous applications but at a high computational co...
Modern deep Convolutional Neural Networks (CNNs) are computationally demanding, yet real application...
Over recent years, deep learning paradigms such as convolutional neural networks (CNNs) have shown g...
Convolutional neural networks (CNNs) are one of the most successful machine-learning techniques for ...
Convolutional Neural Networks (CNNs) are a nature-inspired model, extensively employed in a broad ra...
The increasing use of machine learning algorithms, such as Convolutional Neural Networks (CNNs), mak...
Convolutional Neural Network (CNN) is a deep learning algorithm extended from Artificial Neural Netw...
FPGA-based heterogeneous computing platform, due to its extreme logic reconfigurability, emerges to ...
In the past few years we have experienced an extremely rapid growth of modern applications based on ...
Stochastic computing (SC) is a promising computing paradigm that can help address both the uncertain...
This paper presents a novel method to double the computation rate of convolutional neural network (C...
Convolutional Neural Networks (CNNs) are currently adopted to solve an ever greater number of proble...
Deep convolutional neural networks (CNNs) have recently shown very high accuracy in a wide range of ...
Due to the huge success and rapid development of convolutional neural networks (CNNs), there is a gr...
This thesis presents the results of an architectural study on the design of FPGA- based architecture...
Deep Learning (DL) has become best-in-class for numerous applications but at a high computational co...
Modern deep Convolutional Neural Networks (CNNs) are computationally demanding, yet real application...
Over recent years, deep learning paradigms such as convolutional neural networks (CNNs) have shown g...
Convolutional neural networks (CNNs) are one of the most successful machine-learning techniques for ...
Convolutional Neural Networks (CNNs) are a nature-inspired model, extensively employed in a broad ra...
The increasing use of machine learning algorithms, such as Convolutional Neural Networks (CNNs), mak...
Convolutional Neural Network (CNN) is a deep learning algorithm extended from Artificial Neural Netw...
FPGA-based heterogeneous computing platform, due to its extreme logic reconfigurability, emerges to ...
In the past few years we have experienced an extremely rapid growth of modern applications based on ...
Stochastic computing (SC) is a promising computing paradigm that can help address both the uncertain...