This thesis contributes to the field of sparse linear algebra, graph applications, and preconditioners for Krylov iterative solvers of sparse linear equation systems, by providing a (block) tridiagonal solver library, a generalized sparse matrix-vector implementation, a linear forest extraction, and a multiplicative preconditioner based on tridiagonal solves. The tridiagonal library, which supports (scaled) partial pivoting, outperforms cuSPARSE's tridiagonal solver by factor five while completely utilizing the available GPU memory bandwidth. For the performance optimized solving of multiple right-hand sides, the explicit factorization of the tridiagonal matrix can be computed. The extraction of a weighted linear forest (union of disjoint p...
University of Minnesota Ph.D. dissertation. June 2015. Major: Computer Science. Advisor: Yousef Saad...
AbstractWe formalize the concept of parallel factorization as a set of scalar factorizations. By mea...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...
Tridiagonal solvers are important building blocks for a wide range of scientific applications that a...
Abstract. Linear systems are required to solve in many scientific applications and the solution of t...
With the breakdown of Dennard scaling in the mid-2000s and the end of Moore's law on the horizon, th...
Abstract. In this paper, we develop, study and implement a restricted additive Schwarz (RAS) precond...
[EN] We consider the computation of a few eigenpairs of a generalized eigenvalue problem Ax = lambda...
Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications...
Many scientific applications require the solution of large and sparse linear systems of equations us...
We study the performance of three parallel algorithms and their hybrid variants for solving tridiago...
The original publication is available at www.springerlink.comInternational audienceA wide class of g...
The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-32149-3_18In...
AbstractWe present the recurrence formulas for computing the approximate inverse factors of tridiago...
Many scientific applications require the solution of large and sparse linear systems of equations us...
University of Minnesota Ph.D. dissertation. June 2015. Major: Computer Science. Advisor: Yousef Saad...
AbstractWe formalize the concept of parallel factorization as a set of scalar factorizations. By mea...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...
Tridiagonal solvers are important building blocks for a wide range of scientific applications that a...
Abstract. Linear systems are required to solve in many scientific applications and the solution of t...
With the breakdown of Dennard scaling in the mid-2000s and the end of Moore's law on the horizon, th...
Abstract. In this paper, we develop, study and implement a restricted additive Schwarz (RAS) precond...
[EN] We consider the computation of a few eigenpairs of a generalized eigenvalue problem Ax = lambda...
Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications...
Many scientific applications require the solution of large and sparse linear systems of equations us...
We study the performance of three parallel algorithms and their hybrid variants for solving tridiago...
The original publication is available at www.springerlink.comInternational audienceA wide class of g...
The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-32149-3_18In...
AbstractWe present the recurrence formulas for computing the approximate inverse factors of tridiago...
Many scientific applications require the solution of large and sparse linear systems of equations us...
University of Minnesota Ph.D. dissertation. June 2015. Major: Computer Science. Advisor: Yousef Saad...
AbstractWe formalize the concept of parallel factorization as a set of scalar factorizations. By mea...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...