Hybrid nodes containing GPUs are rapidly becoming the norm in parallel machines. We have conducted some experiments regarding how to plug GPU-enabled computational kernels into PSBLAS, a MPI-based library specifically geared towards sparse matrix computations. In this paper, we present our findings on which strategies are more promising in the quest for the optimal compromise among raw performance, speedup, software maintainability, and extensibility. We consider several solutions to implement the data exchange with the GPU focusing on the data access and transfer, and present an experimental evaluation for a cluster system with up to two GPUs per node. In particular, we compare the pinned memory and the Open-MPI approaches, which are the t...
Sparse times dense matrix multiplication (SpMM) finds its applications in well-established fields su...
Many important problems in science and engineering today deal with sparse data. Examples of sparse d...
International audienceThis book chapter proposes to draw several development methodologies to obtain...
Hybrid nodes containing GPUs are rapidly becoming the norm in parallel machines. We have conducted s...
Sparse matrix-vector multiplication (spMVM) is the dominant operation in many sparse solvers. We in...
Sparse matrix computations are ubiquitous in scientific computing; General-Purpose computing on Grap...
Sparse matrix computations are ubiquitous in scientific computing; General-Purpose computing on Grap...
Hybrid GPU/CPU clusters are becoming very popular in the scientific computing community, as attested...
This repository contains the code and scripts for verifying the claims in the paper "Design Principl...
We apply object-oriented software design patterns to develop code for scientific software involving ...
International audienceIn this paper, we aim at exploiting the power computing of a graphics processi...
HipMCL is a high-performance distributed memory implementation of the popular Markov Cluster Algorit...
We apply object-oriented software design patterns to develop code for scientific software involving ...
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...
Sparse times dense matrix multiplication (SpMM) finds its applications in well-established fields su...
Many important problems in science and engineering today deal with sparse data. Examples of sparse d...
International audienceThis book chapter proposes to draw several development methodologies to obtain...
Hybrid nodes containing GPUs are rapidly becoming the norm in parallel machines. We have conducted s...
Sparse matrix-vector multiplication (spMVM) is the dominant operation in many sparse solvers. We in...
Sparse matrix computations are ubiquitous in scientific computing; General-Purpose computing on Grap...
Sparse matrix computations are ubiquitous in scientific computing; General-Purpose computing on Grap...
Hybrid GPU/CPU clusters are becoming very popular in the scientific computing community, as attested...
This repository contains the code and scripts for verifying the claims in the paper "Design Principl...
We apply object-oriented software design patterns to develop code for scientific software involving ...
International audienceIn this paper, we aim at exploiting the power computing of a graphics processi...
HipMCL is a high-performance distributed memory implementation of the popular Markov Cluster Algorit...
We apply object-oriented software design patterns to develop code for scientific software involving ...
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on the GPU. O...
Sparse times dense matrix multiplication (SpMM) finds its applications in well-established fields su...
Many important problems in science and engineering today deal with sparse data. Examples of sparse d...
International audienceThis book chapter proposes to draw several development methodologies to obtain...