The Gustavson’s algorithm (i.e., the row-wise product algorithm) shows its potential as the backbone algorithm for sparse matrix-matrix multiplication (SpMM) on hardware accelerators. However, it still suffers from irregular memory accesses and thus its performance is bounded by the off-chip memory traffic. Previous works mainly focus on high bandwidth memory-based architectures and are not suitable for embedded FPGAs with traditional DDR. In this work, we propose an efficient Gustavson-based SpMM accelerator on embedded FPGAs with element-wise parallelism and access pattern-aware caches. First of all, we analyze the parallelism of the Gustavson’s algorithm and propose to perform the algorithm with element-wise parallelism, which reduces th...
Sparse linear algebra is an important kernel in many different applications. Among various sparse ge...
The widespread adoption of massively parallel processors over the past decade has fundamentally tran...
AbstractThe sparse matrix-vector multiplication (SpMV) is a fundamental kernel used in computational...
Sparse-matrix sparse-matrix multiplication (SpMM) is an important kernel in multiple areas, e.g., da...
Sparse matrix-vector multiplication (SMVM) is a fundamental operation in many scientific and enginee...
Sparse matrix-vector multiplication (SpMV) is an important ker-nel in many scientific applications a...
This dissertation presents an architecture to accelerate sparse matrix linear algebra,which is among...
Sparse matrix-vector multiplication (SpMV) is of paramount importance in both scientific and enginee...
Sparse Matrix-Vector Multiplication (SpMxV) is a widely used mathematical operation in many high-per...
Sparse matrix-matrix multiplication (SpGEMM) is a computational primitive that is widely used in are...
In this whitepaper, we propose outer-product-parallel and inner-product-parallel sparse matrix-matri...
Accessing the memory efficiently to keep up with the data processing rate is a well known problem in...
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...
Sparse matrix-matrix multiplication (SpGEMM) is a computational primitive that is widely used in are...
Sparse general matrix multiplication (SpGEMM) is a fundamental building block for many real-world ap...
Sparse linear algebra is an important kernel in many different applications. Among various sparse ge...
The widespread adoption of massively parallel processors over the past decade has fundamentally tran...
AbstractThe sparse matrix-vector multiplication (SpMV) is a fundamental kernel used in computational...
Sparse-matrix sparse-matrix multiplication (SpMM) is an important kernel in multiple areas, e.g., da...
Sparse matrix-vector multiplication (SMVM) is a fundamental operation in many scientific and enginee...
Sparse matrix-vector multiplication (SpMV) is an important ker-nel in many scientific applications a...
This dissertation presents an architecture to accelerate sparse matrix linear algebra,which is among...
Sparse matrix-vector multiplication (SpMV) is of paramount importance in both scientific and enginee...
Sparse Matrix-Vector Multiplication (SpMxV) is a widely used mathematical operation in many high-per...
Sparse matrix-matrix multiplication (SpGEMM) is a computational primitive that is widely used in are...
In this whitepaper, we propose outer-product-parallel and inner-product-parallel sparse matrix-matri...
Accessing the memory efficiently to keep up with the data processing rate is a well known problem in...
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...
Sparse matrix-matrix multiplication (SpGEMM) is a computational primitive that is widely used in are...
Sparse general matrix multiplication (SpGEMM) is a fundamental building block for many real-world ap...
Sparse linear algebra is an important kernel in many different applications. Among various sparse ge...
The widespread adoption of massively parallel processors over the past decade has fundamentally tran...
AbstractThe sparse matrix-vector multiplication (SpMV) is a fundamental kernel used in computational...