The solving of tridiagonal systems is one of the most computationally expensive parts in many applications, so that multiple studies have explored the use of NVIDIA GPUs to accelerate such computation. However, these studies have mainly focused on using parallel algorithms to compute such systems, which can efficiently exploit the shared memory and are able to saturate the GPUs capacity with a low number of systems, presenting a poor scalability when dealing with a relatively high number of systems. The gtsvStridedBatch routine in the cuSPARSE NVIDIA package is one of these examples, which is used as reference in this article. We propose a new implementation (cuThomasBatch) based on the Thomas algorithm. Unlike other algorithms, the Thomas ...
The computational demands of multivariate clustering grow rapidly, and therefore processing large da...
Many numerical optimisation problems rely on fast algorithms for solving sparse triangular systems o...
This thesis contributes to the field of sparse linear algebra, graph applications, and preconditione...
The solving of tridiagonal systems is one of the most computationally expensive parts in many applic...
We study the performance of three parallel algorithms and their hybrid variants for solving tridiago...
[Abstract] Current Graphics Processing Units (GPUs) are capable of obtaining high computational perf...
The solution of tridiagonal system of equations using graphic processing units (GPU) is assessed. Th...
Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications...
Engineering, scientific, and financial applications often require the simultaneous solution of a lar...
Understand the human brain is one of the century challenges. On this work we are going to achieve a ...
Many eigenvalue and eigenvector algorithms begin with reducing the input matrix into a tridiagonal ...
In this paper we present how recent hardware revisions and newly introduced approaches to thread col...
Video cards have now outgrown their purpose of being only a simple tool for graphic display. With th...
A new high-performance general-purpose graphics processing unit (GPGPU) computational fluid dynamics...
With serial, or sequential, computational operations\u27 growth rate slowing over the past few years...
The computational demands of multivariate clustering grow rapidly, and therefore processing large da...
Many numerical optimisation problems rely on fast algorithms for solving sparse triangular systems o...
This thesis contributes to the field of sparse linear algebra, graph applications, and preconditione...
The solving of tridiagonal systems is one of the most computationally expensive parts in many applic...
We study the performance of three parallel algorithms and their hybrid variants for solving tridiago...
[Abstract] Current Graphics Processing Units (GPUs) are capable of obtaining high computational perf...
The solution of tridiagonal system of equations using graphic processing units (GPU) is assessed. Th...
Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications...
Engineering, scientific, and financial applications often require the simultaneous solution of a lar...
Understand the human brain is one of the century challenges. On this work we are going to achieve a ...
Many eigenvalue and eigenvector algorithms begin with reducing the input matrix into a tridiagonal ...
In this paper we present how recent hardware revisions and newly introduced approaches to thread col...
Video cards have now outgrown their purpose of being only a simple tool for graphic display. With th...
A new high-performance general-purpose graphics processing unit (GPGPU) computational fluid dynamics...
With serial, or sequential, computational operations\u27 growth rate slowing over the past few years...
The computational demands of multivariate clustering grow rapidly, and therefore processing large da...
Many numerical optimisation problems rely on fast algorithms for solving sparse triangular systems o...
This thesis contributes to the field of sparse linear algebra, graph applications, and preconditione...