Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications. The standard Thomas algorithm for solving such systems is inherently serial forming a bottleneck in computation. Algorithms such as cyclic reduction and SPIKE reduce a single large tridiagonal system into multiple small independent systems which can be solved in parallel. We have developed portable cyclic reduction and SPIKE algorithm OpenCL implementations with the intent to target a range of co-processors in a heterogeneous computing environment including Field Programmable Gate Arrays (FPGAs), Graphics Processing Units (GPUs) and other multi-core processors. In this paper, we evaluate these designs in the context of solver performance, r...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...
While parallel computers offer significant computational performance, it is generally nec-essary to ...
We study the performance of three parallel algorithms and their hybrid variants for solving tridiago...
Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications...
Solving diagonally dominant tridiagonal linear systems is a common problem in scientific high-perfor...
The primary motivation for this research is to determine the feasibility of targeting FPGAs for use ...
We study the performance of three parallel algorithms and their hybrid variants for solving tridiago...
Engineering, scientific, and financial applications often require the simultaneous solution of a lar...
AbstractA parallel version of the cyclic reduction algorithm for the solution of tridiagonal linear ...
Tridiagonal solvers are important building blocks for a wide range of scientific applications that a...
Partial solution variant of the cyclic reduction (PSCR) method is a direct solver that can be applie...
Tridiagonal solvers are important building blocks for a wide range of scientific applications that a...
The solution of tridiagonal system of equations using graphic processing units (GPU) is assessed. Th...
AbstractA parallel version of the cyclic reduction algorithm for the solution of tridiagonal linear ...
The solution of tridiagonal system of equations using graphic processing units (GPU) is assessed. Th...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...
While parallel computers offer significant computational performance, it is generally nec-essary to ...
We study the performance of three parallel algorithms and their hybrid variants for solving tridiago...
Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications...
Solving diagonally dominant tridiagonal linear systems is a common problem in scientific high-perfor...
The primary motivation for this research is to determine the feasibility of targeting FPGAs for use ...
We study the performance of three parallel algorithms and their hybrid variants for solving tridiago...
Engineering, scientific, and financial applications often require the simultaneous solution of a lar...
AbstractA parallel version of the cyclic reduction algorithm for the solution of tridiagonal linear ...
Tridiagonal solvers are important building blocks for a wide range of scientific applications that a...
Partial solution variant of the cyclic reduction (PSCR) method is a direct solver that can be applie...
Tridiagonal solvers are important building blocks for a wide range of scientific applications that a...
The solution of tridiagonal system of equations using graphic processing units (GPU) is assessed. Th...
AbstractA parallel version of the cyclic reduction algorithm for the solution of tridiagonal linear ...
The solution of tridiagonal system of equations using graphic processing units (GPU) is assessed. Th...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...
While parallel computers offer significant computational performance, it is generally nec-essary to ...
We study the performance of three parallel algorithms and their hybrid variants for solving tridiago...