In this whitepaper, we propose outer-product-parallel and inner-product-parallel sparse matrix-matrix multiplication (SpMM) algorithms for the Xeon Phi architecture. We discuss the trade-offs between these two parallelization schemes for the Xeon Phi architecture. We also propose two hypergraph-partitioning-based matrix partitioning and row/column reordering methods that achieve temporal locality in these two parallelization schemes. Both HP models try to minimize the total number of transfers from/to the memory while maintaining balance on computational loads of threads. The experimental results performed for realistic SpMM instances show that the Intel MIC architecture has the potential for attaining high performance in irregular applicat...
Several manufacturers have already started to commercialize near-bank Processing-In-Memory (PIM) arc...
Parallel sparse matrix-matrix multiplication algorithms (PSpGEMM) spend most of their running time o...
This dissertation presents an architecture to accelerate sparse matrix linear algebra,which is among...
Exploiting spatial and temporal localities is investigated for efficient row-by-row parallelization ...
In this paper, we propose a lightweight optimization methodology for the ubiquitous sparse matrix-ve...
Sparse matrix-vector and matrix-transpose-vector multiplication (SpMMTV) repeatedly performed as z ←...
Sparse matrix-matrix multiplication (SpGEMM) is a computational primitive that is widely used in are...
Sparse matrix-vector multiplication (SpMV) is an important ker-nel in many scientific applications a...
Sparse matrix-matrix multiplication (SpGEMM) is a computational primitive that is widely used in are...
Recently, the Intel Xeon Phi coprocessor has received increasing attention in high performance compu...
Abstract. Intel Xeon Phi is a recently released high-performance co-processor which features 61 core...
We investigate outer-product--parallel, inner-product--parallel, and row-by-row-product--parallel fo...
The Gustavson’s algorithm (i.e., the row-wise product algorithm) shows its potential as the backbone...
Cataloged from PDF version of article.Thesis (M.S.): Bilkent University, Department of Computer Engi...
Sparse matrix-vector multiplication (SpMxV) is a kernel operation widely used in iterative linear so...
Several manufacturers have already started to commercialize near-bank Processing-In-Memory (PIM) arc...
Parallel sparse matrix-matrix multiplication algorithms (PSpGEMM) spend most of their running time o...
This dissertation presents an architecture to accelerate sparse matrix linear algebra,which is among...
Exploiting spatial and temporal localities is investigated for efficient row-by-row parallelization ...
In this paper, we propose a lightweight optimization methodology for the ubiquitous sparse matrix-ve...
Sparse matrix-vector and matrix-transpose-vector multiplication (SpMMTV) repeatedly performed as z ←...
Sparse matrix-matrix multiplication (SpGEMM) is a computational primitive that is widely used in are...
Sparse matrix-vector multiplication (SpMV) is an important ker-nel in many scientific applications a...
Sparse matrix-matrix multiplication (SpGEMM) is a computational primitive that is widely used in are...
Recently, the Intel Xeon Phi coprocessor has received increasing attention in high performance compu...
Abstract. Intel Xeon Phi is a recently released high-performance co-processor which features 61 core...
We investigate outer-product--parallel, inner-product--parallel, and row-by-row-product--parallel fo...
The Gustavson’s algorithm (i.e., the row-wise product algorithm) shows its potential as the backbone...
Cataloged from PDF version of article.Thesis (M.S.): Bilkent University, Department of Computer Engi...
Sparse matrix-vector multiplication (SpMxV) is a kernel operation widely used in iterative linear so...
Several manufacturers have already started to commercialize near-bank Processing-In-Memory (PIM) arc...
Parallel sparse matrix-matrix multiplication algorithms (PSpGEMM) spend most of their running time o...
This dissertation presents an architecture to accelerate sparse matrix linear algebra,which is among...