A compact scheme is a discretization scheme that is advantageous in obtaining highly accurate solutions. However, the resulting systems from compact schemes are tridiagonal systems that are difficult to solve efficiently on parallel computers. Considering the almost symmetric Toeplitz structure, a parallel algorithm, simple parallel prefix (SPP), is proposed. The SPP algorithm requires less memory than the conventional LU decomposition and is highly efficient on parallel machines. It consists of a prefix communication pattern and AXPY operations. Both the computation and the communication can be truncated without degrading the accuracy when the system is diagonally dominant. A formal accuracy study was conducted to provide a simple truncati...
AbstractThe performance of parallel algorithms implementing the block SOR iterative method, used for...
A general approach to solve boundary value problems numerically in a parallel environment is discuss...
AbstractWe present a parallel prefix algorithm which uses (2(p + 1)p (p + 1) + 2)n − 1 arithmetic an...
A compact scheme is a discretization scheme that is advantageous in obtaining highly accurate soluti...
This paper proposes an efficient parallel computing approach based on a high-order accurate compact ...
AbstractThe recursive doubling algorithm as developed by Stone can be used to solve a tridiagonal li...
Compact finite difference schemes are widely used in the direct numerical simulation of fluid flows ...
A conservative approach for MPI-based parallelization of tridiagonal compact schemes is developed in...
While parallel computers offer significant computational performance, it is generally necessary to e...
AbstractDiagonally dominant tridiagonal Toeplitz systems of linear equations arise in many applicati...
Direct numerical simulation (DNS) of complex flows require solving the problem on parallel machines ...
We examine the super compact symmetric finite difference scheme (SCSFD) and compare it with traditio...
[[abstract]]Solving circulant Toeplitz tridiagonal systems arises in many engineering applications. ...
A new family of parallel schemes for directly solving linear systems is presented and analyzed. It i...
A new parallel numerical scheme for solving incompressible steady-state flows is presented. The algo...
AbstractThe performance of parallel algorithms implementing the block SOR iterative method, used for...
A general approach to solve boundary value problems numerically in a parallel environment is discuss...
AbstractWe present a parallel prefix algorithm which uses (2(p + 1)p (p + 1) + 2)n − 1 arithmetic an...
A compact scheme is a discretization scheme that is advantageous in obtaining highly accurate soluti...
This paper proposes an efficient parallel computing approach based on a high-order accurate compact ...
AbstractThe recursive doubling algorithm as developed by Stone can be used to solve a tridiagonal li...
Compact finite difference schemes are widely used in the direct numerical simulation of fluid flows ...
A conservative approach for MPI-based parallelization of tridiagonal compact schemes is developed in...
While parallel computers offer significant computational performance, it is generally necessary to e...
AbstractDiagonally dominant tridiagonal Toeplitz systems of linear equations arise in many applicati...
Direct numerical simulation (DNS) of complex flows require solving the problem on parallel machines ...
We examine the super compact symmetric finite difference scheme (SCSFD) and compare it with traditio...
[[abstract]]Solving circulant Toeplitz tridiagonal systems arises in many engineering applications. ...
A new family of parallel schemes for directly solving linear systems is presented and analyzed. It i...
A new parallel numerical scheme for solving incompressible steady-state flows is presented. The algo...
AbstractThe performance of parallel algorithms implementing the block SOR iterative method, used for...
A general approach to solve boundary value problems numerically in a parallel environment is discuss...
AbstractWe present a parallel prefix algorithm which uses (2(p + 1)p (p + 1) + 2)n − 1 arithmetic an...