International audienceThe aim of the paper is to introduce general techniques in order to optimize the parallel execution time of sorting on a distributed architectures with processors of various speeds. Such an application requires a partitioning step. For uniformly related processors (processors speeds are related by a constant factor), we develop a constant time technique for mastering processor load and execution time in an heterogeneous environment and also a technique to deal with unknown cost functions. For non uniformly related processors, we use a technique based on dynamic programming. Most of the time, the solutions are in O(p) (p is the number of processors), independent of the problem size n. Consequently, there is a small over...
International audienceThe task-based approach is a parallelization paradigm in which an algorithm is...
This paper analyzes the dynamic and static balancing of non-homogenous cluster architectures, simult...
Heterogeneous platforms are mixes of different processing units in a compute node (e.g., CPUs+GPUs, ...
International audienceThe aim of the paper is to introduce general techniques in order to optimize t...
The current state and foreseeable future of high performance scientific computing (HPC) can be descr...
Abstract. The paper presents a new data partitioning algorithm for parallel computing on heterogeneo...
The paper deals with the problem of parallel external integer sorting in the context of a class of h...
In this paper, we address the problem of optimal distribu-tion of computational tasks on a network o...
Abstract—The paper presents a performance model that can be used to optimally distribute computation...
Abstract—The paper presents a performance model that can be used to optimally distribute computation...
Cataloged from PDF version of article.A parallel sorting algorithm for sorting n elements evenly di...
This paper is devoted to static load balancing techniques for mapping iterative algorithms onto hete...
Cataloged from PDF version of article.We study the problem of one-dimensional partitioning of nonuni...
The growing importance and interest in parallel processing within Computer Sciences are undeniable, ...
Recently we proposed algorithms for concurrent execution on multiple clusters [11]. In this case, da...
International audienceThe task-based approach is a parallelization paradigm in which an algorithm is...
This paper analyzes the dynamic and static balancing of non-homogenous cluster architectures, simult...
Heterogeneous platforms are mixes of different processing units in a compute node (e.g., CPUs+GPUs, ...
International audienceThe aim of the paper is to introduce general techniques in order to optimize t...
The current state and foreseeable future of high performance scientific computing (HPC) can be descr...
Abstract. The paper presents a new data partitioning algorithm for parallel computing on heterogeneo...
The paper deals with the problem of parallel external integer sorting in the context of a class of h...
In this paper, we address the problem of optimal distribu-tion of computational tasks on a network o...
Abstract—The paper presents a performance model that can be used to optimally distribute computation...
Abstract—The paper presents a performance model that can be used to optimally distribute computation...
Cataloged from PDF version of article.A parallel sorting algorithm for sorting n elements evenly di...
This paper is devoted to static load balancing techniques for mapping iterative algorithms onto hete...
Cataloged from PDF version of article.We study the problem of one-dimensional partitioning of nonuni...
The growing importance and interest in parallel processing within Computer Sciences are undeniable, ...
Recently we proposed algorithms for concurrent execution on multiple clusters [11]. In this case, da...
International audienceThe task-based approach is a parallelization paradigm in which an algorithm is...
This paper analyzes the dynamic and static balancing of non-homogenous cluster architectures, simult...
Heterogeneous platforms are mixes of different processing units in a compute node (e.g., CPUs+GPUs, ...