Partial solution variant of the cyclic reduction (PSCR) method is a direct solver that can be applied to certain types of separable block tridiagonal linear systems. Such linear systems arise, e.g., from the Poisson and the Helmholtz equations discretized with bilinear finite-elements. Furthermore, the separability of the linear system entails that the discretization domain has to be rectangular and the discretization mesh orthogonal. A generalized graphics processing unit (GPU) implementation of the PSCR method is presented. The numerical results indicate up to 24-fold speedups when compared to an equivalent CPU implementation that utilizes a single CPU core. Attained floating point performance is analyzed using roofline performance analys...
A conventional block cyclic reduction algorithm operates by halving the size of the linear system at...
none2siIn this paper we describe a new tridiagonal equation solver, based on a rank-one updating str...
none2siIn this paper we describe a new tridiagonal equation solver, based on a rank-one updating str...
Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications...
We study the performance of three parallel algorithms and their hybrid variants for solving tridiago...
Engineering, scientific, and financial applications often require the simultaneous solution of a lar...
AbstractA parallel version of the cyclic reduction algorithm for the solution of tridiagonal linear ...
this paper, the wrap-around partitioning methodology, originally proposed by Hegland [1], is conside...
A method is described to solve the systems of tridiagonal linear equations that result from discrete...
Two block cyclic reduction linear system solvers are considered and implemented using the OpenCL fr...
In this paper, the performance of the Cyclic Reduction (CR) algorithm for solving tridiagonal system...
In this paper, the performance of the Cyclic Reduction (CR) algorithm for solving tridiagonal system...
In this paper, the performance of the Cyclic Reduction (CR) algorithm for solving tridiagonal system...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...
A conventional block cyclic reduction algorithm operates by halving the size of the linear system at...
none2siIn this paper we describe a new tridiagonal equation solver, based on a rank-one updating str...
none2siIn this paper we describe a new tridiagonal equation solver, based on a rank-one updating str...
Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications...
We study the performance of three parallel algorithms and their hybrid variants for solving tridiago...
Engineering, scientific, and financial applications often require the simultaneous solution of a lar...
AbstractA parallel version of the cyclic reduction algorithm for the solution of tridiagonal linear ...
this paper, the wrap-around partitioning methodology, originally proposed by Hegland [1], is conside...
A method is described to solve the systems of tridiagonal linear equations that result from discrete...
Two block cyclic reduction linear system solvers are considered and implemented using the OpenCL fr...
In this paper, the performance of the Cyclic Reduction (CR) algorithm for solving tridiagonal system...
In this paper, the performance of the Cyclic Reduction (CR) algorithm for solving tridiagonal system...
In this paper, the performance of the Cyclic Reduction (CR) algorithm for solving tridiagonal system...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...
A conventional block cyclic reduction algorithm operates by halving the size of the linear system at...
none2siIn this paper we describe a new tridiagonal equation solver, based on a rank-one updating str...
none2siIn this paper we describe a new tridiagonal equation solver, based on a rank-one updating str...