The use of a wide Single-Instruction-Multiple-Data (SIMD) architecture is a promising approach to build energy-efficient high performance embedded processors. In this paper, based on our design framework for low-power SIMD processors, we propose a multiply-accumulate (MAC) unit with variable number of accumulator registers. The proposed MAC unit exploits both the merits of merged operation and register tiling. A Convolutional Neural Network (CNN) is a popular learning based algorithm due to its flexibility and high accuracy. However, a CNN-based application is often computationally intensive as it applies convolution operations extensively on a large data set. In this work, a CNN-based intelligent learning application is analyzed and mapped...
This paper presents a novel method to double the computation rate of convolutional neural network (C...
Heavily quantized fixed-point arithmetic is becoming a common approach to deploy Convolutional Neura...
Convolution Neural Networks (CNN) are used in many applications ranging from real-time object detect...
The use of a wide Single-Instruction-Multiple-Data (SIMD) architecture is a promising approach to bu...
Single-Instruction-Multiple-Data (SIMD) architectures, which exploit data-level parallelism (DLP), a...
Energy efficiency has become one of the most important topics in computing. To meet the ever increas...
International audienceMachine learning algorithms are compute-and memory-intensive. Their execution ...
Convolutional neural networks (CNNs) are one of the most successful machine-learning techniques for ...
Deep convolutional neural networks (CNNs) have shown strong abilities in the application of artifici...
Convolutional Neural Network (CNN) has attained high accuracy and it has been widely employed in ima...
Machine Learning is finding applications in a wide variety of areas ranging from autonomous cars to ...
Convolutional Neural Networks (CNNs) are hierarchical biologically-inspired models that may be taugh...
The Winograd or Cook-Toom class of algorithms help to reduce the overall compute complexity of many ...
This thesis presents the results of an architectural study on the design of FPGA- based architecture...
As machine learning algorithms play an ever increasing role in today's technology, more demands are ...
This paper presents a novel method to double the computation rate of convolutional neural network (C...
Heavily quantized fixed-point arithmetic is becoming a common approach to deploy Convolutional Neura...
Convolution Neural Networks (CNN) are used in many applications ranging from real-time object detect...
The use of a wide Single-Instruction-Multiple-Data (SIMD) architecture is a promising approach to bu...
Single-Instruction-Multiple-Data (SIMD) architectures, which exploit data-level parallelism (DLP), a...
Energy efficiency has become one of the most important topics in computing. To meet the ever increas...
International audienceMachine learning algorithms are compute-and memory-intensive. Their execution ...
Convolutional neural networks (CNNs) are one of the most successful machine-learning techniques for ...
Deep convolutional neural networks (CNNs) have shown strong abilities in the application of artifici...
Convolutional Neural Network (CNN) has attained high accuracy and it has been widely employed in ima...
Machine Learning is finding applications in a wide variety of areas ranging from autonomous cars to ...
Convolutional Neural Networks (CNNs) are hierarchical biologically-inspired models that may be taugh...
The Winograd or Cook-Toom class of algorithms help to reduce the overall compute complexity of many ...
This thesis presents the results of an architectural study on the design of FPGA- based architecture...
As machine learning algorithms play an ever increasing role in today's technology, more demands are ...
This paper presents a novel method to double the computation rate of convolutional neural network (C...
Heavily quantized fixed-point arithmetic is becoming a common approach to deploy Convolutional Neura...
Convolution Neural Networks (CNN) are used in many applications ranging from real-time object detect...