Sparse Matrix-Vector Multiplication (SpMV) is an important computational kernel in scientific applications that tends to perform poorly on modern processors because of irregular memory accesses. GPU have evolved into a very attractive hardware platform for general purpose computations due to their high floating-point computation performance, which results in that GPGPU becomes the hot and popular topic in HPC. Therefore, we need to parallelize and optimize SpMV on GPGPU to get a better performance. In this paper, we studied the register-level blocking algorithm and the heuristic algorithm to optimize SpMV performance on GPGPU. Based on AMD Stream Computing, We propose an automatic performance tuning SpMV software package on GPGPU, with its ...
Many-core GPUs provide high computing ability and substantial bandwidth; however, optimizing irregul...
Abstract. Graphics Processing Units (GPUs) are massive data parallel processors. High performance co...
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...
AbstractSparse matrix vector multiplication (SpMV) is the dominant kernel in scientific simulations....
AbstractThe sparse matrix-vector multiplication (SpMV) is a fundamental kernel used in computational...
In this article, we discuss the performance modeling and optimization of Sparse Matrix-Vector Multip...
Abstract—This paper presents a performance modeling and optimization analysis tool to predict and op...
Sparse matrix-vector multiplication (SpMV) is a key operation in scientific computing and engineerin...
The sparse matrix-vector (SpMV) multiplication routine is an important building block used in many i...
Sparse matrix-vector multiplication (SpMV) is an important operation in scientific computations. Com...
Graphics processing units (GPUs) have delivered a remarkable performance for a variety of high perfo...
SpMV is a key linear algebra algorithm and has been widely used in many important application domain...
Sparse matrix-vector multiplication (spMVM) is the dominant operation in many sparse solvers. We in...
Accelerators such as the Graphic Processing Unit (GPU) have increasingly seen use by the science and...
We implement a promising algorithm for sparse-matrix sparse-vector multiplication (SpMSpV) on the GP...
Many-core GPUs provide high computing ability and substantial bandwidth; however, optimizing irregul...
Abstract. Graphics Processing Units (GPUs) are massive data parallel processors. High performance co...
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...
AbstractSparse matrix vector multiplication (SpMV) is the dominant kernel in scientific simulations....
AbstractThe sparse matrix-vector multiplication (SpMV) is a fundamental kernel used in computational...
In this article, we discuss the performance modeling and optimization of Sparse Matrix-Vector Multip...
Abstract—This paper presents a performance modeling and optimization analysis tool to predict and op...
Sparse matrix-vector multiplication (SpMV) is a key operation in scientific computing and engineerin...
The sparse matrix-vector (SpMV) multiplication routine is an important building block used in many i...
Sparse matrix-vector multiplication (SpMV) is an important operation in scientific computations. Com...
Graphics processing units (GPUs) have delivered a remarkable performance for a variety of high perfo...
SpMV is a key linear algebra algorithm and has been widely used in many important application domain...
Sparse matrix-vector multiplication (spMVM) is the dominant operation in many sparse solvers. We in...
Accelerators such as the Graphic Processing Unit (GPU) have increasingly seen use by the science and...
We implement a promising algorithm for sparse-matrix sparse-vector multiplication (SpMSpV) on the GP...
Many-core GPUs provide high computing ability and substantial bandwidth; however, optimizing irregul...
Abstract. Graphics Processing Units (GPUs) are massive data parallel processors. High performance co...
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...