Acceleration of machine learning models is proving to be an important application for FPGAs. Unfortunately, debugging such models during training or inference is difficult. Software simulations of a machine learning system may be of insufficient detail to provide meaningful debug insight, or may require infeasibly long run-times. Thus, it is often desirable to debug the accelerated model while it is running on real hardware. Effective on-chip debug often requires instrumenting a design with additional circuitry to store run-time data, consuming valuable chip resources. Previous work has developed methods to perform lossy compression of signals by exploiting machine learning specific knowledge, thereby increasing the amount of debug context ...
Design verification is an essential step in the development of any product. It ensures that the prod...
This paper presents Lane Compression, a lightweight lossless compression technique for machine learn...
This thesis introduces novel frameworks for automated customization of two classes of machine learni...
Recent years have seen a dramatic increase in the use of hardware accelerators to perform machine le...
Summarization: In this work we consider the slow and tedious phase of hardware debugging in FPGAs. T...
The capacity of on-chip trace buffers employed for embedded logic analysis limits the observation wi...
The size of on-chip trace buffers used for at-speed sili-con debug limits the observation window in ...
Electronic devices make up a vital part of our lives. These are seen from mobiles, laptops, computer...
Electronic devices have come to permeate every aspect of our daily lives, and at the heart of each d...
ABSTRACT In the multicore era, capturing execution traces of processors is indispensable to debuggin...
High-Level Synthesis (HLS) has emerged as a promising technology that allows designers to create a d...
abstract: Machine learning is a powerful tool for processing and understanding the vast amounts of d...
Parallel hardware accelerators, for example Graphics Processor Units, have limited on-chip memory ca...
One of the key challenges in deploying ML models on embedded systems are the numerous resource const...
130 pagesOver the past decade, machine learning (ML) with deep neural networks (DNNs) has become ext...
Design verification is an essential step in the development of any product. It ensures that the prod...
This paper presents Lane Compression, a lightweight lossless compression technique for machine learn...
This thesis introduces novel frameworks for automated customization of two classes of machine learni...
Recent years have seen a dramatic increase in the use of hardware accelerators to perform machine le...
Summarization: In this work we consider the slow and tedious phase of hardware debugging in FPGAs. T...
The capacity of on-chip trace buffers employed for embedded logic analysis limits the observation wi...
The size of on-chip trace buffers used for at-speed sili-con debug limits the observation window in ...
Electronic devices make up a vital part of our lives. These are seen from mobiles, laptops, computer...
Electronic devices have come to permeate every aspect of our daily lives, and at the heart of each d...
ABSTRACT In the multicore era, capturing execution traces of processors is indispensable to debuggin...
High-Level Synthesis (HLS) has emerged as a promising technology that allows designers to create a d...
abstract: Machine learning is a powerful tool for processing and understanding the vast amounts of d...
Parallel hardware accelerators, for example Graphics Processor Units, have limited on-chip memory ca...
One of the key challenges in deploying ML models on embedded systems are the numerous resource const...
130 pagesOver the past decade, machine learning (ML) with deep neural networks (DNNs) has become ext...
Design verification is an essential step in the development of any product. It ensures that the prod...
This paper presents Lane Compression, a lightweight lossless compression technique for machine learn...
This thesis introduces novel frameworks for automated customization of two classes of machine learni...