Deep Neural Networks (DNNs) computation-hungry algorithms demand hardware platforms capable of meeting rigid power and timing requirements. We introduce the Serial-MAC-engine (SMAC-engine), a fully-digital hardware accelerator for inference of quantized DNNs suitable for integration in a heterogeneous System-on-Chip (SoC). The accelerator is completely embedded in the form of a Hardware Processing Engine (HWPE) in the PULPissimo platform, a RISCV-based programmable architecture that targets the computational requirements of IoT applications. The SMAC-engine supports configurable precision for both weights (8/6/4 bits) and activations (8/4 bits), with scalable performance. Results in 65 nm technology demonstrate that the serial-MAC approach ...
RISC-V is an open-source instruction set and now has been examined as a universal standard to unify ...
The current trend for deep learning has come with an enormous computational need for billions of Mul...
The use of neural networks, machine learning, or artificial intelligence, in its broadest and most c...
Deep Neural Networks (DNNs) computation-hungry algorithms demand hardware platforms capable of meeti...
Heavily quantized fixed-point arithmetic is becoming a common approach to deploy Convolutional Neura...
The size of neural networks in deep learning techniques is increasing and varies significantly accor...
Deep neural networks (DNN) are achieving state-of-the-art performance in many artificial intelligenc...
Over the last ten years, the rise of deep learning has redefined the state-of-the-art in many comput...
As AI applications become more prevalent and powerful, the performance of deep learning neural netwo...
The current trend for deep learning has come with an enormous computational need for billions of Mul...
AI and deep learning are experiencing explosive growth in almost every domain involving analysis of ...
With the rapid development of the Internet of things (IoT), networks, software, and computing platfo...
Machine Learning (ML) functions are becoming ubiquitous in latency- and privacy-sensitive IoT applic...
On-chip DNN inference and training at the Extreme-Edge (TinyML) impose strict latency, throughput, a...
International audienceAs the depth of DNN increases, the need for DNN calculations for the storage a...
RISC-V is an open-source instruction set and now has been examined as a universal standard to unify ...
The current trend for deep learning has come with an enormous computational need for billions of Mul...
The use of neural networks, machine learning, or artificial intelligence, in its broadest and most c...
Deep Neural Networks (DNNs) computation-hungry algorithms demand hardware platforms capable of meeti...
Heavily quantized fixed-point arithmetic is becoming a common approach to deploy Convolutional Neura...
The size of neural networks in deep learning techniques is increasing and varies significantly accor...
Deep neural networks (DNN) are achieving state-of-the-art performance in many artificial intelligenc...
Over the last ten years, the rise of deep learning has redefined the state-of-the-art in many comput...
As AI applications become more prevalent and powerful, the performance of deep learning neural netwo...
The current trend for deep learning has come with an enormous computational need for billions of Mul...
AI and deep learning are experiencing explosive growth in almost every domain involving analysis of ...
With the rapid development of the Internet of things (IoT), networks, software, and computing platfo...
Machine Learning (ML) functions are becoming ubiquitous in latency- and privacy-sensitive IoT applic...
On-chip DNN inference and training at the Extreme-Edge (TinyML) impose strict latency, throughput, a...
International audienceAs the depth of DNN increases, the need for DNN calculations for the storage a...
RISC-V is an open-source instruction set and now has been examined as a universal standard to unify ...
The current trend for deep learning has come with an enormous computational need for billions of Mul...
The use of neural networks, machine learning, or artificial intelligence, in its broadest and most c...