In order to deliver high performance efficiently, modern processors include dedicated hardware to accelerate different application domains. For example, several recent processors include dedicated Machine Learning (ML) accelerators. However, while adding dedicated hardware improves efficiency compared to general-purpose CPUs, it also requires a larger area, making it unfeasible for smaller devices. Therefore, exploring ways to use the existing hardware for different functionalities becomes desirable in those setups. In this work, we explore the reuse of the components in a Vector Processing Unit (VPU) to offer the functionality of a Systolic Array (SA) for General Matrix Multiplication (GEMM), a kernel extensively used in machine learning, ...
A soft vector processor (SVP) is an overlay on top of FPGAs that allows data- parallel algorithms to...
Recently, availability of big data and enormous processing power along with maturing of the applied ...
Convolution Neural Networks (CNN) are used in many applications ranging from real-time object detect...
Research on artificial neural networks (ANNs) has been carried out for more than five decades. A ren...
Today’s computer systems develop towards less energy consumption while keeping high performance. The...
In this thesis, we propose a new systolic architecture which is based on the Faddeev\u27s algorithm....
The Smith Waterman algorithm is used to perform local alignment on biological sequences by calculati...
One of the key kernels in scientific applications is the Sparse Matrix Vector Multiplication (SMVM)....
The dissemination of multi-core architectures and the later irruption of massively parallel devices,...
This paper presents architecture for matrix multiplication optimized to be integrated as an accelera...
In this paper, we present an advanced algorithm-hardware co-optimization method for designing an eff...
Conventional sequential processing on software with a general purpose CPU has become significantly i...
With the explosion of AI in recent years, there has been an exponential rise in the demand for compu...
Support vector machines (SVM) are a popular class of supervised models in machine learning. The asso...
As artificial intelligence (AI) and machine learning (ML) technologies disrupt a wide range of indus...
A soft vector processor (SVP) is an overlay on top of FPGAs that allows data- parallel algorithms to...
Recently, availability of big data and enormous processing power along with maturing of the applied ...
Convolution Neural Networks (CNN) are used in many applications ranging from real-time object detect...
Research on artificial neural networks (ANNs) has been carried out for more than five decades. A ren...
Today’s computer systems develop towards less energy consumption while keeping high performance. The...
In this thesis, we propose a new systolic architecture which is based on the Faddeev\u27s algorithm....
The Smith Waterman algorithm is used to perform local alignment on biological sequences by calculati...
One of the key kernels in scientific applications is the Sparse Matrix Vector Multiplication (SMVM)....
The dissemination of multi-core architectures and the later irruption of massively parallel devices,...
This paper presents architecture for matrix multiplication optimized to be integrated as an accelera...
In this paper, we present an advanced algorithm-hardware co-optimization method for designing an eff...
Conventional sequential processing on software with a general purpose CPU has become significantly i...
With the explosion of AI in recent years, there has been an exponential rise in the demand for compu...
Support vector machines (SVM) are a popular class of supervised models in machine learning. The asso...
As artificial intelligence (AI) and machine learning (ML) technologies disrupt a wide range of indus...
A soft vector processor (SVP) is an overlay on top of FPGAs that allows data- parallel algorithms to...
Recently, availability of big data and enormous processing power along with maturing of the applied ...
Convolution Neural Networks (CNN) are used in many applications ranging from real-time object detect...