International audienceIn this paper, we aim at exploiting the power computing of a graphics processing unit (GPU) cluster for solving large sparse linear systems. We implement the parallel algorithm of the generalized minimal residual iterative method using the Compute Unified Device Architecture programming language and the MPI parallel environment. The experiments show that a GPU cluster is more efficient than a CPU cluster. In order to optimize the performances, we use a compressed storage format for the sparse vectors and the hypergraph partitioning. These solutions improve the spatial and temporal localization of the shared data between the computing nodes of the GPU cluster
Hybrid nodes containing GPUs are rapidly becoming the norm in parallel machines. We have conducted s...
Abstract. We present a new sparse linear solver for GPUs. It is designed to work with structured spa...
Solving triangular systems is the building block for preconditioned GMRES algorithm. Inexact precond...
International audienceIn this paper, we aim at exploiting the power computing of a graphics processi...
Or the past few years, the clusters equipped with GPUs have become attractive tools for high perform...
International audienceScientific applications very often rely on solving one or more linear systems....
Abstract. Linear systems are required to solve in many scientific applications and the solution of t...
Depuis quelques années, les grappes équipées de processeurs graphiques GPUs sont devenues des outils...
IEEE Computer SocietyInternational audienceThe main objective of this work consists in analyzing sub...
The original publication is available at www.springerlink.comInternational audienceA wide class of g...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...
to appearInternational audienceA wide class of numerical methods needs to solve a linear system, whe...
We consider a fast, robust and scalable solver using graphic processing units (GPU) as accelerators ...
International audienceGrid computing focuses on making use of a very large amount of resources from ...
International audienceIn this paper, we present and analyze parallel substructuring methods based on...
Hybrid nodes containing GPUs are rapidly becoming the norm in parallel machines. We have conducted s...
Abstract. We present a new sparse linear solver for GPUs. It is designed to work with structured spa...
Solving triangular systems is the building block for preconditioned GMRES algorithm. Inexact precond...
International audienceIn this paper, we aim at exploiting the power computing of a graphics processi...
Or the past few years, the clusters equipped with GPUs have become attractive tools for high perform...
International audienceScientific applications very often rely on solving one or more linear systems....
Abstract. Linear systems are required to solve in many scientific applications and the solution of t...
Depuis quelques années, les grappes équipées de processeurs graphiques GPUs sont devenues des outils...
IEEE Computer SocietyInternational audienceThe main objective of this work consists in analyzing sub...
The original publication is available at www.springerlink.comInternational audienceA wide class of g...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...
to appearInternational audienceA wide class of numerical methods needs to solve a linear system, whe...
We consider a fast, robust and scalable solver using graphic processing units (GPU) as accelerators ...
International audienceGrid computing focuses on making use of a very large amount of resources from ...
International audienceIn this paper, we present and analyze parallel substructuring methods based on...
Hybrid nodes containing GPUs are rapidly becoming the norm in parallel machines. We have conducted s...
Abstract. We present a new sparse linear solver for GPUs. It is designed to work with structured spa...
Solving triangular systems is the building block for preconditioned GMRES algorithm. Inexact precond...