Abstract We introduce an automated tool for deploying ultra low-latency, low-power deep neural networks with convolutional layers on field-programmable gate arrays (FPGAs). By extending the hls4ml library, we demonstrate an inference latency of 5 µs using convolutional architectures, targeting microsecond latency applications like those at the CERN Large Hadron Collider. Considering benchmark models trained on the Street View House Numbers Dataset, we demonstrate various methods for model compression in order to fit the computational constraints of a typical FPGA device used in trigger and data acquisition systems of particle detectors. In particular, we discuss pruning and quantization-aware training, and demonstrate how re...
Thesis (Master's)--University of Washington, 2021Field programmable gate arrays (FPGAs) offer a flex...
Graph neural networks have been shown to achieve excellent performance for several crucial tasks in ...
With the evolution of machine learning algorithms they are seeing a wider use in traditional signal ...
We introduce an automated tool for deploying ultra low-latency, low-power deep neural networks with ...
We introduce an automated tool for deploying ultra low-latency, low-power deep neural networks with ...
Recent results at the Large Hadron Collider (LHC) have pointed to enhanced physics capabilities thro...
Recent results at the Large Hadron Collider (LHC) have pointed to enhanced physics capabilities thro...
Recent results at the Large Hadron Collider (LHC) have pointed to enhanced physics capabilities thro...
Artificial neural networks are becoming a standard tool for data analysis, but their potential remai...
This thesis explores Convolutional Neural Network (CNN) inference accelerator architecture for FPGAs...
Although the quest for more accurate solutions is pushing deep learning research towards larger and ...
Machine learning methods are ubiquitous in particle physics and have proven to be very performant. O...
Abstract Resource utilization plays a crucial role for successful implementation of fast real-time i...
Resource utilization plays a crucial role for successful implementation of fast real-time inference ...
Resource utilization plays a crucial role for successful implementation of fast real-time inference ...
Thesis (Master's)--University of Washington, 2021Field programmable gate arrays (FPGAs) offer a flex...
Graph neural networks have been shown to achieve excellent performance for several crucial tasks in ...
With the evolution of machine learning algorithms they are seeing a wider use in traditional signal ...
We introduce an automated tool for deploying ultra low-latency, low-power deep neural networks with ...
We introduce an automated tool for deploying ultra low-latency, low-power deep neural networks with ...
Recent results at the Large Hadron Collider (LHC) have pointed to enhanced physics capabilities thro...
Recent results at the Large Hadron Collider (LHC) have pointed to enhanced physics capabilities thro...
Recent results at the Large Hadron Collider (LHC) have pointed to enhanced physics capabilities thro...
Artificial neural networks are becoming a standard tool for data analysis, but their potential remai...
This thesis explores Convolutional Neural Network (CNN) inference accelerator architecture for FPGAs...
Although the quest for more accurate solutions is pushing deep learning research towards larger and ...
Machine learning methods are ubiquitous in particle physics and have proven to be very performant. O...
Abstract Resource utilization plays a crucial role for successful implementation of fast real-time i...
Resource utilization plays a crucial role for successful implementation of fast real-time inference ...
Resource utilization plays a crucial role for successful implementation of fast real-time inference ...
Thesis (Master's)--University of Washington, 2021Field programmable gate arrays (FPGAs) offer a flex...
Graph neural networks have been shown to achieve excellent performance for several crucial tasks in ...
With the evolution of machine learning algorithms they are seeing a wider use in traditional signal ...