The sparse matrix-vector (SpMV) multiplication routine is an important building block used in many iterative algo-rithms for solving scientific and engineering problems. One of the main challenges of SpMV is its memory-boundedness. Although compression has been proposed previously to im-prove SpMV performance on CPUs, its use has not been demonstrated on the GPU because of the serial nature of many compression and decompression schemes. In this pa-per, we introduce a family of bit-representation-optimized (BRO) compression schemes for representing sparse matrices on GPUs. The proposed schemes, BRO-ELL, BRO-COO, and BRO-HYB, perform compression on index data and help to speed up SpMV on GPUs through reduction of memory traffic. Furthermore, ...
The massive parallelism of graphics processing units (GPUs) offers tremendous performance in many hi...
We implement a promising algorithm for sparse-matrix sparse-vector multiplication (SpMSpV) on the GP...
Abstract—This paper presents a performance modeling and optimization analysis tool to predict and op...
AbstractThe sparse matrix-vector multiplication (SpMV) is a fundamental kernel used in computational...
Abstract. Graphics Processing Units (GPUs) are massive data parallel processors. High performance co...
Sparse matrix-vector multiplication is an integral part of many scientific algorithms. Several studi...
AbstractSparse matrix vector multiplication (SpMV) is the dominant kernel in scientific simulations....
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...
In this article, we discuss the performance modeling and optimization of Sparse Matrix-Vector Multip...
Graphics processing units (GPUs) have delivered a remarkable performance for a variety of high perfo...
Many-core GPUs provide high computing ability and substantial bandwidth; however, optimizing irregul...
In this article, we discuss the performance modeling and optimization of Sparse Matrix-Vector Multip...
10.1145/2503210.2503234International Conference for High Performance Computing, Networking, Storage ...
Sparse matrix-vector multiplication (spMVM) is the dominant operation in many sparse solvers. We in...
This repository contains the code and scripts for verifying the claims in the paper "Design Principl...
The massive parallelism of graphics processing units (GPUs) offers tremendous performance in many hi...
We implement a promising algorithm for sparse-matrix sparse-vector multiplication (SpMSpV) on the GP...
Abstract—This paper presents a performance modeling and optimization analysis tool to predict and op...
AbstractThe sparse matrix-vector multiplication (SpMV) is a fundamental kernel used in computational...
Abstract. Graphics Processing Units (GPUs) are massive data parallel processors. High performance co...
Sparse matrix-vector multiplication is an integral part of many scientific algorithms. Several studi...
AbstractSparse matrix vector multiplication (SpMV) is the dominant kernel in scientific simulations....
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...
In this article, we discuss the performance modeling and optimization of Sparse Matrix-Vector Multip...
Graphics processing units (GPUs) have delivered a remarkable performance for a variety of high perfo...
Many-core GPUs provide high computing ability and substantial bandwidth; however, optimizing irregul...
In this article, we discuss the performance modeling and optimization of Sparse Matrix-Vector Multip...
10.1145/2503210.2503234International Conference for High Performance Computing, Networking, Storage ...
Sparse matrix-vector multiplication (spMVM) is the dominant operation in many sparse solvers. We in...
This repository contains the code and scripts for verifying the claims in the paper "Design Principl...
The massive parallelism of graphics processing units (GPUs) offers tremendous performance in many hi...
We implement a promising algorithm for sparse-matrix sparse-vector multiplication (SpMSpV) on the GP...
Abstract—This paper presents a performance modeling and optimization analysis tool to predict and op...