This paper presents a self-optimization methodology for parallel linear algebra rou-tines on heterogeneous systems. For each routine, a series of decisions is taken automatically in order to obtain an execution time close to the optimum (without rewriting the routine’s code). Some of these decisions are: the number of processes to generate, the heterogeneous distribution of these processes over the network of processors, the logical topology of the generated processes,... To reduce the search-ing space of such decisions, different heuristics have been used. The experiments have been performed with a parallel LU factorization routine similar to the ScaLAPACK one, and good results have been obtained on different heterogeneous platforms
Two issues in linear algebra algorithms for multicomputers are addressed. First, how tounify paralle...
This paper discusses some algorithmic issues when computing with a heterogeneous network of workstat...
International audienceThis paper discusses some algorithmic issues when computing with a heterogeneo...
This paper presents and analyzes two different strategies of heterogeneous distribution of computati...
The aim of data and task parallel scheduling for dense linear algebra kernels is to minimize the pro...
In this document, we describe two strategies of distribution of computations that can be used to imp...
This paper describes the design and the implementation of parallel routines in the Heterogeneous Sca...
Abstract—Two strategies of distribution of computations can be used to implement parallel solvers fo...
In this document, we describe two strategies of distribution of computations that can be used to imp...
. In this paper we study the design of installation routines for linear algebra routines on network...
This paper presents a package, called Heterogeneous PBLAS (HeteroPBLAS), which is built on top of PB...
Abstract. We present an efficient and scalable programming model for the development of linear algeb...
We present a package, called Heterogeneous PBLAS (HeteroPBLAS), which is built on top of PBLAS and p...
In this paper, we consider task-based dense linear algebra applications on a single heterogeneous no...
Parallel performance optimization is being applied and further improvements are studied for parallel...
Two issues in linear algebra algorithms for multicomputers are addressed. First, how tounify paralle...
This paper discusses some algorithmic issues when computing with a heterogeneous network of workstat...
International audienceThis paper discusses some algorithmic issues when computing with a heterogeneo...
This paper presents and analyzes two different strategies of heterogeneous distribution of computati...
The aim of data and task parallel scheduling for dense linear algebra kernels is to minimize the pro...
In this document, we describe two strategies of distribution of computations that can be used to imp...
This paper describes the design and the implementation of parallel routines in the Heterogeneous Sca...
Abstract—Two strategies of distribution of computations can be used to implement parallel solvers fo...
In this document, we describe two strategies of distribution of computations that can be used to imp...
. In this paper we study the design of installation routines for linear algebra routines on network...
This paper presents a package, called Heterogeneous PBLAS (HeteroPBLAS), which is built on top of PB...
Abstract. We present an efficient and scalable programming model for the development of linear algeb...
We present a package, called Heterogeneous PBLAS (HeteroPBLAS), which is built on top of PBLAS and p...
In this paper, we consider task-based dense linear algebra applications on a single heterogeneous no...
Parallel performance optimization is being applied and further improvements are studied for parallel...
Two issues in linear algebra algorithms for multicomputers are addressed. First, how tounify paralle...
This paper discusses some algorithmic issues when computing with a heterogeneous network of workstat...
International audienceThis paper discusses some algorithmic issues when computing with a heterogeneo...