Sparse triangular solve (SpTRSV) is one of the most important kernels in many real-world applications. Currently, much research on parallel SpTRSV focuses on level-set construction for reducing the number of inter-level synchronizations. However, the out-of-control data reuse and high cost for global memory or shared cache access in inter-level synchronization have been largely neglected in existing work. In this paper, we propose a novel data layout called Sparse Level Tile to make all data reuse under control, and design a Producer-Consumer pairing method to make any inter-level synchronization only happen in very fast register communication. We implement our data layout and algorithms on an SW26010 many-core processor, which is the main...
A few parallel algorithms for solving triangular systems resulting from parallel factorization of sp...
Sparse direct solvers play a vital role in large-scale high performance scientific and engineering c...
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...
Abstract. The last decade has seen rapid growth of single-chip multi-processors (CMPs), which have b...
International audienceWe consider techniques to improve the performance of parallel sparse triangula...
Sparse triangular solve (SpTRSV) is an extensively studied computational kernel. An important obstac...
We propose a parallel sparse triangular linear system solver based on the Spike algorithm. Sparse tr...
We implement a promising algorithm for sparse-matrix sparse-vector multiplication (SpMSpV) on the GP...
Many numerical optimisation problems rely on fast algorithms for solving sparse triangular systems o...
We design and develop a work-efficient multithreaded algorithm for sparse matrix-sparse vector multi...
In earlier work we have introduced the “Recursive Sparse Blocks ” (RSB) sparse matrix storage scheme...
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...
Sparse matrix-vector multiplication (SpMV) is an important ker-nel in many scientific applications a...
The level-set method is one of the most popular techniques for capturing and tracking deformable int...
AbstractSparse matrix vector multiplication (SpMV) is the dominant kernel in scientific simulations....
A few parallel algorithms for solving triangular systems resulting from parallel factorization of sp...
Sparse direct solvers play a vital role in large-scale high performance scientific and engineering c...
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...
Abstract. The last decade has seen rapid growth of single-chip multi-processors (CMPs), which have b...
International audienceWe consider techniques to improve the performance of parallel sparse triangula...
Sparse triangular solve (SpTRSV) is an extensively studied computational kernel. An important obstac...
We propose a parallel sparse triangular linear system solver based on the Spike algorithm. Sparse tr...
We implement a promising algorithm for sparse-matrix sparse-vector multiplication (SpMSpV) on the GP...
Many numerical optimisation problems rely on fast algorithms for solving sparse triangular systems o...
We design and develop a work-efficient multithreaded algorithm for sparse matrix-sparse vector multi...
In earlier work we have introduced the “Recursive Sparse Blocks ” (RSB) sparse matrix storage scheme...
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...
Sparse matrix-vector multiplication (SpMV) is an important ker-nel in many scientific applications a...
The level-set method is one of the most popular techniques for capturing and tracking deformable int...
AbstractSparse matrix vector multiplication (SpMV) is the dominant kernel in scientific simulations....
A few parallel algorithms for solving triangular systems resulting from parallel factorization of sp...
Sparse direct solvers play a vital role in large-scale high performance scientific and engineering c...
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...