Graph Convolutional Networks (GCNs) have shown great results but come with large computation costs and memory overhead. Recently, sampling-based approaches have been proposed to alter input sizes, which allows large GCN workloads to align to hardware constraints. Motivated by this flexibility, this thesis proposes an FPGA-based GCN accelerator, along with a novel sparse matrix format and multiple software-hardware co-optimizations to improve training efficiency. First, all feature and adjacency matrices of GCN are quantized from 32-bit floating point to 16-bit signed integers. Next, the non-linear operations are simplified to better fit the FPGA computation, and reusable intermediate results are identified and stored to eliminate redundant ...
Convolution Neural Network (CNN) is a special kind of neural network that is inspired by the behavio...
Graph neural networks (GNNs) are emerging as a powerful technique for modeling graph structures. Due...
This work proposes a novel reconfigurable architecture for reducing the latency of JEDI-net, a Graph...
Graph convolutional networks (GCNs) have demonstrated their excellent algorithmic performance in var...
Over recent years, deep learning paradigms such as convolutional neural networks (CNNs) have shown g...
The parallel nature of FPGA makes it a promising candidate to accelerate machine learning tasks. The...
This thesis explores Convolutional Neural Network (CNN) inference accelerator architecture for FPGAs...
High computational complexity and large memory footprint hinder the adoption of convolution neural n...
Graph neural networks have become increasingly popular in recent years due to their ability to natur...
2018-10-16Graph analytics has drawn much research interest because of its broad applicability from m...
The rapid advancement of Artificial intelligence (AI) is making our everyday life easier with smart ...
In recent years deep learning algorithms have shown extremely high performance on machine learning t...
Deep convolutional neural networks (CNNs) have recently shown very high accuracy in a wide range of ...
Convolutional Deep Neural Networks (DNNs) are reported to show outstanding recognition performance i...
In the past two years, various graph convolution neural networks (GCNs) accelerators have emerged, e...
Convolution Neural Network (CNN) is a special kind of neural network that is inspired by the behavio...
Graph neural networks (GNNs) are emerging as a powerful technique for modeling graph structures. Due...
This work proposes a novel reconfigurable architecture for reducing the latency of JEDI-net, a Graph...
Graph convolutional networks (GCNs) have demonstrated their excellent algorithmic performance in var...
Over recent years, deep learning paradigms such as convolutional neural networks (CNNs) have shown g...
The parallel nature of FPGA makes it a promising candidate to accelerate machine learning tasks. The...
This thesis explores Convolutional Neural Network (CNN) inference accelerator architecture for FPGAs...
High computational complexity and large memory footprint hinder the adoption of convolution neural n...
Graph neural networks have become increasingly popular in recent years due to their ability to natur...
2018-10-16Graph analytics has drawn much research interest because of its broad applicability from m...
The rapid advancement of Artificial intelligence (AI) is making our everyday life easier with smart ...
In recent years deep learning algorithms have shown extremely high performance on machine learning t...
Deep convolutional neural networks (CNNs) have recently shown very high accuracy in a wide range of ...
Convolutional Deep Neural Networks (DNNs) are reported to show outstanding recognition performance i...
In the past two years, various graph convolution neural networks (GCNs) accelerators have emerged, e...
Convolution Neural Network (CNN) is a special kind of neural network that is inspired by the behavio...
Graph neural networks (GNNs) are emerging as a powerful technique for modeling graph structures. Due...
This work proposes a novel reconfigurable architecture for reducing the latency of JEDI-net, a Graph...