Abstract. The paper presents a new data partitioning algorithm for parallel computing on heterogeneous processors. Like traditional functional partitioning algorithms, the algorithm assumes that the speed of the processors is character-ized by speed functions rather than speed constants. Unlike the traditional algo-rithms, it does not assume the speed functions to be given. Instead, it uses a computational kernel to estimate the speed functions of the processors for dif-ferent problem sizes during its execution. This makes the algorithm distributed as its execution involves all the heterogeneous processors. The algorithm does not construct the complete speed function for each processor but rather builds and uses their partial estimates suff...
Recently we proposed algorithms for concurrent execution on multiple clusters [11]. In this case, da...
In this document, we describe two strategies of distribution of computations that can be used to imp...
Abstract. Traditional load balancing algorithms for data-intensive iterative routines can successful...
Abstract. The functional performance model (FPM) of heterogeneous proces-sors has proven to be more ...
In this paper, we address the problem of optimal distribu-tion of computational tasks on a network o...
Abstract. In this paper, we present a novel algorithm of optimal matrix partitioning for parallel de...
The paper presents a performance model that can be used to optimally distribute computations over he...
International audienceThe aim of the paper is to introduce general techniques in order to optimize t...
Abstract. High performance of data-parallel applications on heterogeneous platforms can be achieved ...
Proceedings of the 8th IEEE International Conference on Cluster Computing (Cluster 2006), October, 2...
Abstract—The paper presents a performance model that can be used to optimally distribute computation...
Heterogeneous distributed computing systems are an economical and efficient architecture for process...
The current state and foreseeable future of high performance scientific computing (HPC) can be descr...
Heterogeneous platforms are mixes of different processing units in a compute node (e.g., CPUs+GPUs, ...
2012 IEEE 26th Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW), Shang...
Recently we proposed algorithms for concurrent execution on multiple clusters [11]. In this case, da...
In this document, we describe two strategies of distribution of computations that can be used to imp...
Abstract. Traditional load balancing algorithms for data-intensive iterative routines can successful...
Abstract. The functional performance model (FPM) of heterogeneous proces-sors has proven to be more ...
In this paper, we address the problem of optimal distribu-tion of computational tasks on a network o...
Abstract. In this paper, we present a novel algorithm of optimal matrix partitioning for parallel de...
The paper presents a performance model that can be used to optimally distribute computations over he...
International audienceThe aim of the paper is to introduce general techniques in order to optimize t...
Abstract. High performance of data-parallel applications on heterogeneous platforms can be achieved ...
Proceedings of the 8th IEEE International Conference on Cluster Computing (Cluster 2006), October, 2...
Abstract—The paper presents a performance model that can be used to optimally distribute computation...
Heterogeneous distributed computing systems are an economical and efficient architecture for process...
The current state and foreseeable future of high performance scientific computing (HPC) can be descr...
Heterogeneous platforms are mixes of different processing units in a compute node (e.g., CPUs+GPUs, ...
2012 IEEE 26th Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW), Shang...
Recently we proposed algorithms for concurrent execution on multiple clusters [11]. In this case, da...
In this document, we describe two strategies of distribution of computations that can be used to imp...
Abstract. Traditional load balancing algorithms for data-intensive iterative routines can successful...