We have ported the numerical factorization and triangular solve phases of the sparse direct solver STRUMPACK to GPU. STRUMPACK implements sparse LU factorization using the multifrontal algorithm, which performs most of its operations in dense linear algebra operations on so-called frontal matrices of various sizes. Our GPU implementation off-loads these dense linear algebra operations, as well as the sparse scatter–gather operations between frontal matrices. For the larger frontal matrices, our GPU implementation relies on vendor libraries such as cuBLAS and cuSOLVER for NVIDIA GPUs and rocBLAS and rocSOLVER for AMD GPUs. For the smaller frontal matrices we developed custom CUDA and HIP kernels to reduce kernel launch overhead. Overall, hig...
Sparse solver has become the bottleneck of SPICE simulators. There has been few work on GPU-based sp...
Abstract. Linear systems are required to solve in many scientific applications and the solution of t...
AbstractThis paper describes our progressindeveloping softwarefor performing parallelLUfactorization...
For many finite element problems, when represented as sparse matrices, iterative solvers are found t...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...
Abstract. We present a new sparse linear solver for GPUs. It is designed to work with structured spa...
International audienceWe present a new sparse linear solver for GPUs. It is designed to work with st...
International audienceWe present a new sparse linear solver for GPUs. It is designed to work with st...
International audienceWe present a new sparse linear solver for GPUs. It is designed to work with st...
International audienceWe present a new sparse linear solver for GPUs. It is designed to work with st...
In this poster, a GPU-accelerated sparse multifrontal solver for structurally symmetric matrices is ...
In this poster, a GPU-accelerated sparse multifrontal solver for structurally symmetric matrices is ...
The original publication is available at www.springerlink.comInternational audienceA wide class of g...
to appearInternational audienceA wide class of numerical methods needs to solve a linear system, whe...
Sparse solver has become the bottleneck of SPICE simulators. There has been few work on GPU-based sp...
Abstract. Linear systems are required to solve in many scientific applications and the solution of t...
AbstractThis paper describes our progressindeveloping softwarefor performing parallelLUfactorization...
For many finite element problems, when represented as sparse matrices, iterative solvers are found t...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...
Abstract. We present a new sparse linear solver for GPUs. It is designed to work with structured spa...
International audienceWe present a new sparse linear solver for GPUs. It is designed to work with st...
International audienceWe present a new sparse linear solver for GPUs. It is designed to work with st...
International audienceWe present a new sparse linear solver for GPUs. It is designed to work with st...
International audienceWe present a new sparse linear solver for GPUs. It is designed to work with st...
In this poster, a GPU-accelerated sparse multifrontal solver for structurally symmetric matrices is ...
In this poster, a GPU-accelerated sparse multifrontal solver for structurally symmetric matrices is ...
The original publication is available at www.springerlink.comInternational audienceA wide class of g...
to appearInternational audienceA wide class of numerical methods needs to solve a linear system, whe...
Sparse solver has become the bottleneck of SPICE simulators. There has been few work on GPU-based sp...
Abstract. Linear systems are required to solve in many scientific applications and the solution of t...
AbstractThis paper describes our progressindeveloping softwarefor performing parallelLUfactorization...