Recently, the Intel Xeon Phi coprocessor has received increasing attention in high performance computing due to its simple programming model and highly parallel architecture. In this paper, we implement sparse matrix vector multiplication (SpMV) for scale-free matrices on the Xeon Phi architecture and optimize its performance. Scale-free sparse matrices are widely used in various application domains, such as in the study of social networks, gene networks and web graphs. We propose a novel SpMV format called vectorized hybrid COO+CSR (VHCC). Our SpMV implementation employs 2D jagged partitioning, tiling and vectorized prefix sum computations to improve hardware resource utilization, and thus overall performance. As the achieved performance d...
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...
Sparse Matrix-Vector Multiplication (SpMxV) is a widely used mathematical operation in many high-per...
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...
Sparse matrix-vector multiplication (SpMV) is an important ker-nel in many scientific applications a...
In this paper, we propose a lightweight optimization methodology for the ubiquitous sparse matrix-ve...
Accelerators such as the Graphic Processing Unit (GPU) have increasingly seen use by the science and...
Abstract. Intel Xeon Phi is a recently released high-performance co-processor which features 61 core...
Sparse matrix-vector multiplication (spMV) is a fundamental building block of iterative solvers in m...
In this whitepaper, we propose outer-product-parallel and inner-product-parallel sparse matrix-matri...
Abstract. The Sparse Matrix-Vector Multiplication is the key operation in many iterative methods. Th...
Efficiently exploiting SIMD vector units is one of the most important aspects in achieving high perf...
Due to ill performance on many devices, sparse matrix-vector multiplication (SpMV) normally requires...
AbstractThe sparse matrix-vector multiplication (SpMV) is a fundamental kernel used in computational...
AbstractExisting formats for Sparse Matrix-Vector Multiplication (SpMV) on the GPU are outperforming...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...
Sparse Matrix-Vector Multiplication (SpMxV) is a widely used mathematical operation in many high-per...
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...
Sparse matrix-vector multiplication (SpMV) is an important ker-nel in many scientific applications a...
In this paper, we propose a lightweight optimization methodology for the ubiquitous sparse matrix-ve...
Accelerators such as the Graphic Processing Unit (GPU) have increasingly seen use by the science and...
Abstract. Intel Xeon Phi is a recently released high-performance co-processor which features 61 core...
Sparse matrix-vector multiplication (spMV) is a fundamental building block of iterative solvers in m...
In this whitepaper, we propose outer-product-parallel and inner-product-parallel sparse matrix-matri...
Abstract. The Sparse Matrix-Vector Multiplication is the key operation in many iterative methods. Th...
Efficiently exploiting SIMD vector units is one of the most important aspects in achieving high perf...
Due to ill performance on many devices, sparse matrix-vector multiplication (SpMV) normally requires...
AbstractThe sparse matrix-vector multiplication (SpMV) is a fundamental kernel used in computational...
AbstractExisting formats for Sparse Matrix-Vector Multiplication (SpMV) on the GPU are outperforming...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...
Sparse Matrix-Vector Multiplication (SpMxV) is a widely used mathematical operation in many high-per...
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...