The permanent is an important characteristic of a matrix and it has been used in many applications. Unfortunately, it is a hard to compute and hard to approximate the immanant. For dense/full matrices, the fastest exact algorithm, Ryser, has O(2n−1n) complexity. In this work, a parallel algorithm, SkipPer, is proposed to exploit the sparsity within the input matrix as much as possible. SkipPer restructures the matrix to reduce the overall work, skips the unnecessary steps, and employs a coarse-grain, shared-memory parallelization with dynamic scheduling. The experiments show that SkipPer increases the performance of exact permanent computation up to 140× compared to the naive version for general matrices. Furthermore, thanks to the coarse-g...
We identify the challenges that are special to parallel sparse matrix-matrix multiplication (PSpGEMM...
Algorithms are often parallelized based on data dependence analysis manually or by means of parallel...
We design and develop a work-efficient multithreaded algorithm for sparse matrix-sparse vector multi...
The permanent is an important characteristic of a matrix and it has been used in many applications. ...
Permanent -just like determinant-, is an important numeric value in order to understand matrix chara...
International audienceWe discuss efficient shared memory parallelization of sparse matrix computatio...
Gary Kumfert and Alex Pothen have improved the quality and run time of two ordering algorithms for m...
International audienceIn this paper, we propose a generic method of automatic parallelization for sp...
AbstractThe matrix-vector multiplication operation is the kernel of most numerical algorithms.Typica...
Problems in the class of unstructured sparse matrix computations are characterized by highly irregul...
The sparse matrix--vector multiplication is an important kernel, but is hard to efficiently execute ...
This book is primarily intended as a research monograph that could also be used in graduate courses ...
Several fine grained parallel algorithms were developed and compared to compute the Cholesky factori...
Sparse-matrix solution is a dominant part of execution time in simulating VLSI circuits by a detaile...
We give a parallel algorithm for the problem of computing the row minima of a totally monotone two-d...
We identify the challenges that are special to parallel sparse matrix-matrix multiplication (PSpGEMM...
Algorithms are often parallelized based on data dependence analysis manually or by means of parallel...
We design and develop a work-efficient multithreaded algorithm for sparse matrix-sparse vector multi...
The permanent is an important characteristic of a matrix and it has been used in many applications. ...
Permanent -just like determinant-, is an important numeric value in order to understand matrix chara...
International audienceWe discuss efficient shared memory parallelization of sparse matrix computatio...
Gary Kumfert and Alex Pothen have improved the quality and run time of two ordering algorithms for m...
International audienceIn this paper, we propose a generic method of automatic parallelization for sp...
AbstractThe matrix-vector multiplication operation is the kernel of most numerical algorithms.Typica...
Problems in the class of unstructured sparse matrix computations are characterized by highly irregul...
The sparse matrix--vector multiplication is an important kernel, but is hard to efficiently execute ...
This book is primarily intended as a research monograph that could also be used in graduate courses ...
Several fine grained parallel algorithms were developed and compared to compute the Cholesky factori...
Sparse-matrix solution is a dominant part of execution time in simulating VLSI circuits by a detaile...
We give a parallel algorithm for the problem of computing the row minima of a totally monotone two-d...
We identify the challenges that are special to parallel sparse matrix-matrix multiplication (PSpGEMM...
Algorithms are often parallelized based on data dependence analysis manually or by means of parallel...
We design and develop a work-efficient multithreaded algorithm for sparse matrix-sparse vector multi...