We contribute to the optimization of the sparse matrix-vector product on graphics processing units by introducing a variant of the coordinate sparse matrix layout that compresses the integer representation of the matrix indices. In addition, we employ a look-ahead table to avoid the storage of repeated numerical values in the sparse matrix, yielding a more compact data representation that is easier to maintain in the cache. Our evaluation on the two most recent generations of NVIDIA GPUs, the V100 and the A100 architectures, shows considerable performance improvements over the kernels for the sparse matrix-vector product in cuSPARSE (CUDA 11.0.167).This work was partially sponsored by the EU H2020 project 732631 OPRECOMP and project TIN2017...
The massive parallelism of graphics processing units (GPUs) offers tremendous performance in many hi...
Sparse matrix computations are ubiquitous in scientific computing; General-Purpose computing on Grap...
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...
We contribute to the optimization of the sparse matrix-vector product by introducing a variant of th...
Efficient processing of Irregular Matrices on Single Instruction, Multiple Data (SIMD)-type architec...
Sparse matrix-vector multiplication is an integral part of many scientific algorithms. Several studi...
AbstractExisting formats for Sparse Matrix-Vector Multiplication (SpMV) on the GPU are outperforming...
General purpose computation on graphics processing unit (GPU) is prominent in the high performance c...
Abstract. A new format for storing sparse matrices is proposed for efficient sparse matrix-vector (S...
Abstract. Graphics Processing Units (GPUs) are massive data parallel processors. High performance co...
Sparse matrix-vector multiplication (SpMV) is an important operation in scientific computations. Com...
Sparse matrix–vector multiplication (SpMV) is of singular importance in sparse linear algebra, which...
Sparse linear algebra kernels play a critical role in numerous applications, covering from exascale ...
AbstractSparse matrix vector multiplication (SpMV) is the dominant kernel in scientific simulations....
This repository contains the code and scripts for verifying the claims in the paper "Design Principl...
The massive parallelism of graphics processing units (GPUs) offers tremendous performance in many hi...
Sparse matrix computations are ubiquitous in scientific computing; General-Purpose computing on Grap...
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...
We contribute to the optimization of the sparse matrix-vector product by introducing a variant of th...
Efficient processing of Irregular Matrices on Single Instruction, Multiple Data (SIMD)-type architec...
Sparse matrix-vector multiplication is an integral part of many scientific algorithms. Several studi...
AbstractExisting formats for Sparse Matrix-Vector Multiplication (SpMV) on the GPU are outperforming...
General purpose computation on graphics processing unit (GPU) is prominent in the high performance c...
Abstract. A new format for storing sparse matrices is proposed for efficient sparse matrix-vector (S...
Abstract. Graphics Processing Units (GPUs) are massive data parallel processors. High performance co...
Sparse matrix-vector multiplication (SpMV) is an important operation in scientific computations. Com...
Sparse matrix–vector multiplication (SpMV) is of singular importance in sparse linear algebra, which...
Sparse linear algebra kernels play a critical role in numerous applications, covering from exascale ...
AbstractSparse matrix vector multiplication (SpMV) is the dominant kernel in scientific simulations....
This repository contains the code and scripts for verifying the claims in the paper "Design Principl...
The massive parallelism of graphics processing units (GPUs) offers tremendous performance in many hi...
Sparse matrix computations are ubiquitous in scientific computing; General-Purpose computing on Grap...
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...