This paper presents an integrated analytical and profile-based cross-architecture performance modeling tool to specifically provide inter-architecture performance prediction for Sparse Matrix-Vector Multiplication (SpMV) on NVIDIA GPU architectures. To design and construct the tool, we investigate the inter-architecture relative performance for multiple SpMV kernels. For a sparse matrix, based on its SpMV kernel performance measured on a reference architecture, our cross-architecture performance modeling tool can accurately predict its SpMV kernel performance on a target architecture. The prediction results can effectively assist researchers in making choice of an appropriate architecture that best fits their needs from a wide range of avai...
Sparse matrix-vector multiplication (SpMV) is an important operation in scientific computations. Com...
We develop a microbenchmark-based performance model for NVIDIA GeForce 200-series GPUs. Our model id...
This repository contains the code and scripts for verifying the claims in the paper "Design Principl...
Abstract—This paper presents a performance modeling and optimization analysis tool to predict and op...
AbstractThis paper presents unique modeling algorithms of performance prediction for sparse matrix-v...
In this article, we discuss the performance modeling and optimization of Sparse Matrix-Vector Multip...
Predicting the runtime of a sparse matrix-vector multiplication (SpMV) for different sparse matrix f...
Predicting the runtime of a sparse matrix-vector multiplication (SpMV) for different sparse matrix f...
Graphics processing units (GPUs) have delivered a remarkable performance for a variety of high perfo...
AbstractThe sparse matrix-vector multiplication (SpMV) is a fundamental kernel used in computational...
The sparse Matrix-Vector multiplication is a key operation in science and engineering along with th...
Abstract. Graphics Processing Units (GPUs) are massive data parallel processors. High performance co...
AbstractSparse matrix vector multiplication (SpMV) is the dominant kernel in scientific simulations....
General purpose computation on graphics processing unit (GPU) is prominent in the high performance c...
The massive parallelism of graphics processing units (GPUs) offers tremendous performance in many hi...
Sparse matrix-vector multiplication (SpMV) is an important operation in scientific computations. Com...
We develop a microbenchmark-based performance model for NVIDIA GeForce 200-series GPUs. Our model id...
This repository contains the code and scripts for verifying the claims in the paper "Design Principl...
Abstract—This paper presents a performance modeling and optimization analysis tool to predict and op...
AbstractThis paper presents unique modeling algorithms of performance prediction for sparse matrix-v...
In this article, we discuss the performance modeling and optimization of Sparse Matrix-Vector Multip...
Predicting the runtime of a sparse matrix-vector multiplication (SpMV) for different sparse matrix f...
Predicting the runtime of a sparse matrix-vector multiplication (SpMV) for different sparse matrix f...
Graphics processing units (GPUs) have delivered a remarkable performance for a variety of high perfo...
AbstractThe sparse matrix-vector multiplication (SpMV) is a fundamental kernel used in computational...
The sparse Matrix-Vector multiplication is a key operation in science and engineering along with th...
Abstract. Graphics Processing Units (GPUs) are massive data parallel processors. High performance co...
AbstractSparse matrix vector multiplication (SpMV) is the dominant kernel in scientific simulations....
General purpose computation on graphics processing unit (GPU) is prominent in the high performance c...
The massive parallelism of graphics processing units (GPUs) offers tremendous performance in many hi...
Sparse matrix-vector multiplication (SpMV) is an important operation in scientific computations. Com...
We develop a microbenchmark-based performance model for NVIDIA GeForce 200-series GPUs. Our model id...
This repository contains the code and scripts for verifying the claims in the paper "Design Principl...