Block iterative methods are extremely important as smoothers for multigrid methods, as preconditioners for Krylov methods, and as solvers for diagonally dominant linear systems. Developing robust and efficient smoother algorithms suitable for current and evolving GPU and multicore CPU systems is a significant challenge. We address this issue in the case of constant-coefficient stencils arising in the solution of elliptic partial differential equations on structured 3D uniform and adaptively refined grids. Robust, highly parallel implementations of block Jacobi and chaotic block Gauss-Seidel algorithms with exact inversion of the blocks are developed using different parallelization techniques. Experimental results for NVIDIA Fermi/Kepler GPU...
Abstract. Numerical solutions of nonlinear partial differential equations frequently rely on iterati...
Efficient solution of partial differential equations require a match between the algorithm and the t...
Finite Element problems are often solved using multigrid techniques. The most time consuming part of...
Gauss-Seidel is a popular multigrid smoother as it is provably optimal on structured grids and exhib...
Many scientific applications require the solution of large and sparse linear systems of equations us...
AbstractIn this paper we investigate how stencil computations can be implemented on state-of-the-art...
Abstract: This paper offers an algorithm of calculation of points of a computational front...
In this paper, we analyze the potential of asynchronous relaxation methods on Graphics Processing Un...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...
For solving systems of linear algebraic equations with block-tridiagonal matrices arising in geoelec...
The solution of elliptic partial differential equations is a common performance bottleneck in scient...
It was recently shown that block-circulant preconditioners applied to a conjugate gradient method us...
We propose an efficient and robust algorithm to solve the steady Euler equations on unstructured gri...
Parallel 2-point and 3-point block method will simultaneously compute the numerical solutions at two...
Efficient solution of partial differential equations require a match between the algorithm and the t...
Abstract. Numerical solutions of nonlinear partial differential equations frequently rely on iterati...
Efficient solution of partial differential equations require a match between the algorithm and the t...
Finite Element problems are often solved using multigrid techniques. The most time consuming part of...
Gauss-Seidel is a popular multigrid smoother as it is provably optimal on structured grids and exhib...
Many scientific applications require the solution of large and sparse linear systems of equations us...
AbstractIn this paper we investigate how stencil computations can be implemented on state-of-the-art...
Abstract: This paper offers an algorithm of calculation of points of a computational front...
In this paper, we analyze the potential of asynchronous relaxation methods on Graphics Processing Un...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...
For solving systems of linear algebraic equations with block-tridiagonal matrices arising in geoelec...
The solution of elliptic partial differential equations is a common performance bottleneck in scient...
It was recently shown that block-circulant preconditioners applied to a conjugate gradient method us...
We propose an efficient and robust algorithm to solve the steady Euler equations on unstructured gri...
Parallel 2-point and 3-point block method will simultaneously compute the numerical solutions at two...
Efficient solution of partial differential equations require a match between the algorithm and the t...
Abstract. Numerical solutions of nonlinear partial differential equations frequently rely on iterati...
Efficient solution of partial differential equations require a match between the algorithm and the t...
Finite Element problems are often solved using multigrid techniques. The most time consuming part of...