International audienceLarge clusters and supercomputers are rapidly evolving and may be subject to regular hardware updates that increase the chances of becoming heterogeneous. Homogeneous clusters may also have variable performance capabilities due to processor manufacturing, or even partitions equipped with different types of accelerators. Data distribution over heterogeneous nodes is very challenging but essential to exploit all resources efficiently. In this article, we build upon task-based runtimes' flexibility to study the interplay between static communication-aware data distribution strategies and dynamic scheduling of the linear algebra LU factorization over heterogeneous sets of hybrid nodes. We propose two techniques derived fro...
We present a novel approach of distributing matrix multiplications among GPU-equipped nodes in a clu...
In this paper, an adaptive matrix multiplication algorithm for dynamic heterogeneous environments is...
We present a new approach to utilizing all CPU cores and all GPUs on heterogeneous multicore and mul...
International audienceLarge clusters and supercomputers are rapidly evolving and may be subject to r...
(eng) In this paper, we deal with algorithmic issues on heterogeneous platforms. We concentrate on d...
Most supercomputers are shipped with both a CPU and a GPU. With the powerful parallel computing capa...
This paper presents and analyzes two different strategies of heterogeneous distribution of computati...
In this paper, we consider task-based dense linear algebra applications on a single heterogeneous no...
National audienceIn this paper, we deal with algorithmic issues on heterogeneous platforms. We conce...
GPU-based heterogeneous clusters continue to draw atten-tion from vendors and HPC users due to their...
We present the use of a hybrid static/dynamic scheduling strategy of the task dependency graph for d...
Abstract—Dense LU factorization is a prominent benchmark used to rank the performance of supercomput...
This paper discusses some algorithmic issues when computing with a heterogeneous network of workstat...
International audienceIn this paper, we deal with algorithmic issues on heterogeneous platforms. We ...
(eng) Future computing platforms will be distributed and heterogeneous. Such platforms range from he...
We present a novel approach of distributing matrix multiplications among GPU-equipped nodes in a clu...
In this paper, an adaptive matrix multiplication algorithm for dynamic heterogeneous environments is...
We present a new approach to utilizing all CPU cores and all GPUs on heterogeneous multicore and mul...
International audienceLarge clusters and supercomputers are rapidly evolving and may be subject to r...
(eng) In this paper, we deal with algorithmic issues on heterogeneous platforms. We concentrate on d...
Most supercomputers are shipped with both a CPU and a GPU. With the powerful parallel computing capa...
This paper presents and analyzes two different strategies of heterogeneous distribution of computati...
In this paper, we consider task-based dense linear algebra applications on a single heterogeneous no...
National audienceIn this paper, we deal with algorithmic issues on heterogeneous platforms. We conce...
GPU-based heterogeneous clusters continue to draw atten-tion from vendors and HPC users due to their...
We present the use of a hybrid static/dynamic scheduling strategy of the task dependency graph for d...
Abstract—Dense LU factorization is a prominent benchmark used to rank the performance of supercomput...
This paper discusses some algorithmic issues when computing with a heterogeneous network of workstat...
International audienceIn this paper, we deal with algorithmic issues on heterogeneous platforms. We ...
(eng) Future computing platforms will be distributed and heterogeneous. Such platforms range from he...
We present a novel approach of distributing matrix multiplications among GPU-equipped nodes in a clu...
In this paper, an adaptive matrix multiplication algorithm for dynamic heterogeneous environments is...
We present a new approach to utilizing all CPU cores and all GPUs on heterogeneous multicore and mul...