AbstractExisting formats for Sparse Matrix-Vector Multiplication (SpMV) on the GPU are outperforming their corresponding implementations on multi-core CPUs. In this paper, we present a new format called Sliced COO (SCOO) and an effcient CUDA implementation to perform SpMV on the GPU. While previous work shows experiments on small to medium-sized sparse matrices, we perform evaluations on large sparse matrices. We compared SCOO performance to existing formats of the NVIDIA Cusp library. Our resutls on a Fermi GPU show that SCOO outperforms the COO and CSR format for all tested matrices and the HYB format for all tested unstructured matrices. Furthermore, comparison to a Sandy-Bridge CPU shows that SCOO on a Fermi GPU outperforms the multi-th...
This repository contains the code and scripts for verifying the claims in the paper "Design Principl...
Abstract—We present new adaptive format for storing sparse matrices on GPU. We compare it with sever...
The sparse matrix-vector (SpMV) multiplication routine is an important building block used in many i...
AbstractExisting formats for Sparse Matrix-Vector Multiplication (SpMV) on the GPU are outperforming...
General purpose computation on graphics processing unit (GPU) is prominent in the high performance c...
The massive parallelism of graphics processing units (GPUs) offers tremendous performance in many hi...
Sparse matrix-vector multiplication (SpMV) is an important operation in scientific computations. Com...
Abstract. Graphics Processing Units (GPUs) are massive data parallel processors. High performance co...
Efficient processing of Irregular Matrices on Single Instruction, Multiple Data (SIMD)-type architec...
In this article, we discuss the performance modeling and optimization of Sparse Matrix-Vector Multip...
Abstract—This paper presents a performance modeling and optimization analysis tool to predict and op...
Sparse matrix-vector multiplication is an integral part of many scientific algorithms. Several studi...
AbstractThe sparse matrix-vector multiplication (SpMV) is a fundamental kernel used in computational...
Sparse matrix multiplication is a common operation in linear algebra and an important element of oth...
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...
This repository contains the code and scripts for verifying the claims in the paper "Design Principl...
Abstract—We present new adaptive format for storing sparse matrices on GPU. We compare it with sever...
The sparse matrix-vector (SpMV) multiplication routine is an important building block used in many i...
AbstractExisting formats for Sparse Matrix-Vector Multiplication (SpMV) on the GPU are outperforming...
General purpose computation on graphics processing unit (GPU) is prominent in the high performance c...
The massive parallelism of graphics processing units (GPUs) offers tremendous performance in many hi...
Sparse matrix-vector multiplication (SpMV) is an important operation in scientific computations. Com...
Abstract. Graphics Processing Units (GPUs) are massive data parallel processors. High performance co...
Efficient processing of Irregular Matrices on Single Instruction, Multiple Data (SIMD)-type architec...
In this article, we discuss the performance modeling and optimization of Sparse Matrix-Vector Multip...
Abstract—This paper presents a performance modeling and optimization analysis tool to predict and op...
Sparse matrix-vector multiplication is an integral part of many scientific algorithms. Several studi...
AbstractThe sparse matrix-vector multiplication (SpMV) is a fundamental kernel used in computational...
Sparse matrix multiplication is a common operation in linear algebra and an important element of oth...
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...
This repository contains the code and scripts for verifying the claims in the paper "Design Principl...
Abstract—We present new adaptive format for storing sparse matrices on GPU. We compare it with sever...
The sparse matrix-vector (SpMV) multiplication routine is an important building block used in many i...