The emergence of multicore architectures and highly scalable platforms motivates the development of novel algorithms and techniques that emphasize concurrency and are tolerant of deep memory hierarchies, as opposed to minimizing raw FLOP counts. While direct solvers are reliable, they are often slow and memory-intensive for large problems. Iterative solvers, on the other hand, are more efficient but, in the absence of robust preconditioners, lack reliability. While preconditioners based on incomplete factorizations ( whenever they exist) are effective for many problems, their parallel scalability is generally limited. In this paper, we advocate the use of banded preconditioners instead and introduce a reordering strategy that enables their ...
This paper addresses the parallelization of the preconditioned iterative methods that use explicit p...
This contribution outlines an approach that draws on general purpose graphics processing unit (GPGPU...
Banded linear systems with large bandwidths can be solved by similar methods as full linear systems....
The emergence of multicore architectures and highly scalable platforms motivates the development of ...
With availability of large-scale parallel platforms comprised of tens-of-thousands of processors and...
A popular class of preconditioners is known as incomplete factorizations. They can be thought of as ...
We review current methods for preconditioning systems of equations for their solution using iterativ...
Solving large-scale systems of linear equations [] { } {}bxA = is one of the most expensive and cr...
A popular class of preconditioners is known as incomplete factorizations. They can be thought of as ...
We present implementation details of a reordering strategy for permuting elements whose absolute val...
Sequential and parallel algorithms based on the LU factorization or the QR factorization have been i...
This is the pre-peer reviewed version of the following article: Adaptive precision in block‐Jacobi p...
In this paper, we introduce a class of recursive multilevel preconditioning strategies suited for so...
University of Minnesota Ph.D. dissertation. December 2011. Major: Scientific Computation. Advisor: ...
University of Minnesota Ph.D. dissertation. June 2015. Major: Computer Science. Advisor: Yousef Saad...
This paper addresses the parallelization of the preconditioned iterative methods that use explicit p...
This contribution outlines an approach that draws on general purpose graphics processing unit (GPGPU...
Banded linear systems with large bandwidths can be solved by similar methods as full linear systems....
The emergence of multicore architectures and highly scalable platforms motivates the development of ...
With availability of large-scale parallel platforms comprised of tens-of-thousands of processors and...
A popular class of preconditioners is known as incomplete factorizations. They can be thought of as ...
We review current methods for preconditioning systems of equations for their solution using iterativ...
Solving large-scale systems of linear equations [] { } {}bxA = is one of the most expensive and cr...
A popular class of preconditioners is known as incomplete factorizations. They can be thought of as ...
We present implementation details of a reordering strategy for permuting elements whose absolute val...
Sequential and parallel algorithms based on the LU factorization or the QR factorization have been i...
This is the pre-peer reviewed version of the following article: Adaptive precision in block‐Jacobi p...
In this paper, we introduce a class of recursive multilevel preconditioning strategies suited for so...
University of Minnesota Ph.D. dissertation. December 2011. Major: Scientific Computation. Advisor: ...
University of Minnesota Ph.D. dissertation. June 2015. Major: Computer Science. Advisor: Yousef Saad...
This paper addresses the parallelization of the preconditioned iterative methods that use explicit p...
This contribution outlines an approach that draws on general purpose graphics processing unit (GPGPU...
Banded linear systems with large bandwidths can be solved by similar methods as full linear systems....