Abstract. Numerical solutions of nonlinear partial differential equations frequently rely on iterative Newton-Krylov methods, which linearize a finite-difference stencil-based discretization of a problem, producing a sparse matrix with regular structure. Knowledge of this structure can be used to exploit parallelism and locality of reference on modern cache-based multi- and many-core architectures, achieving high performance for computations underlying commonly used iterative linear solvers. In this paper we describe our approach to sparse matrix data structure design and our implementation of the kernels underlying iterative linear solvers in PETSc. We also describe autotuning of CUDA implementations based on high-level descriptions of the...
Abstract. The limiting factor for efficiency of sparse linear solvers is the memory bandwidth. In th...
AbstractFinite-Differencing and other regular and direct approaches to solving partial differential ...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
Abstract PETSc is a scalable solver library for the solution of algebraic equations arising from the...
International audienceIn this paper, we present and analyze parallel substructuring methods based on...
Abstract. Linear systems are required to solve in many scientific applications and the solution of t...
The research conducted in this thesis provides a robust implementation of a preconditioned iterative...
Abstract. We present a new sparse linear solver for GPUs. It is designed to work with structured spa...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...
Abstract—Krylov subspace solvers are often the method of choice when solving sparse linear systems i...
The original publication is available at www.springerlink.comInternational audienceA wide class of g...
IEEE Computer SocietyInternational audienceThe main objective of this work consists in analyzing sub...
to appearInternational audienceA wide class of numerical methods needs to solve a linear system, whe...
The modern GPUs are well suited for intensive computational tasks and massive parallel computation. ...
Finite-Differencing and other regular and direct approaches to solving partial differential equation...
Abstract. The limiting factor for efficiency of sparse linear solvers is the memory bandwidth. In th...
AbstractFinite-Differencing and other regular and direct approaches to solving partial differential ...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
Abstract PETSc is a scalable solver library for the solution of algebraic equations arising from the...
International audienceIn this paper, we present and analyze parallel substructuring methods based on...
Abstract. Linear systems are required to solve in many scientific applications and the solution of t...
The research conducted in this thesis provides a robust implementation of a preconditioned iterative...
Abstract. We present a new sparse linear solver for GPUs. It is designed to work with structured spa...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...
Abstract—Krylov subspace solvers are often the method of choice when solving sparse linear systems i...
The original publication is available at www.springerlink.comInternational audienceA wide class of g...
IEEE Computer SocietyInternational audienceThe main objective of this work consists in analyzing sub...
to appearInternational audienceA wide class of numerical methods needs to solve a linear system, whe...
The modern GPUs are well suited for intensive computational tasks and massive parallel computation. ...
Finite-Differencing and other regular and direct approaches to solving partial differential equation...
Abstract. The limiting factor for efficiency of sparse linear solvers is the memory bandwidth. In th...
AbstractFinite-Differencing and other regular and direct approaches to solving partial differential ...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...