International audienceThis paper focuses on the resolution of a large number of small symmetric linear systems and its parallel implementation on single precision on GPUs. The computations involved by each linear system are independent from the others and the number of unknowns does not exceed 64. For this purpose, we present the adaptation to our context of largely used methods that include: LDLt, House-holder reduction to a tridiagonal matrix, parallel cyclic reduction that is not a power of two and the divide and conquer algorithm for tridiagonal eigenprob-lems. We not only detail the implementation and optimization of each method but we also compare the sustainability of each solution and its performance which include both parallel comp...
The modern GPUs are well suited for intensive computational tasks and massive parallel computation. ...
Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications...
Jack Dongarra z We present a new parallel implementation of a divide and conquer algo-rithm for comp...
International audienceThis paper focuses on the resolution of a large number of small symmetric line...
We study the performance of three parallel algorithms and their hybrid variants for solving tridiago...
We illustrate how linear algebra calculations can be enhanced by statistical techniques in the case ...
International audienceThis paper studies the performance of different algorithms for solving a dense...
International audienceWe illustrate how linear algebra calculations can be enhanced by statistical t...
AbstractThe recursive doubling algorithm as developed by Stone can be used to solve a tridiagonal li...
Banded linear systems with large bandwidths can be solved by similar methods as full linear systems....
Tridiagonal solvers are important building blocks for a wide range of scientific applications that a...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...
Many eigenvalue and eigenvector algorithms begin with reducing the input matrix into a tridiagonal ...
We present several algorithms to compute the solution of a linear system of equations on a graphics ...
AbstractDiagonally dominant tridiagonal Toeplitz systems of linear equations arise in many applicati...
The modern GPUs are well suited for intensive computational tasks and massive parallel computation. ...
Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications...
Jack Dongarra z We present a new parallel implementation of a divide and conquer algo-rithm for comp...
International audienceThis paper focuses on the resolution of a large number of small symmetric line...
We study the performance of three parallel algorithms and their hybrid variants for solving tridiago...
We illustrate how linear algebra calculations can be enhanced by statistical techniques in the case ...
International audienceThis paper studies the performance of different algorithms for solving a dense...
International audienceWe illustrate how linear algebra calculations can be enhanced by statistical t...
AbstractThe recursive doubling algorithm as developed by Stone can be used to solve a tridiagonal li...
Banded linear systems with large bandwidths can be solved by similar methods as full linear systems....
Tridiagonal solvers are important building blocks for a wide range of scientific applications that a...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...
Many eigenvalue and eigenvector algorithms begin with reducing the input matrix into a tridiagonal ...
We present several algorithms to compute the solution of a linear system of equations on a graphics ...
AbstractDiagonally dominant tridiagonal Toeplitz systems of linear equations arise in many applicati...
The modern GPUs are well suited for intensive computational tasks and massive parallel computation. ...
Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications...
Jack Dongarra z We present a new parallel implementation of a divide and conquer algo-rithm for comp...