Abstract. High performance of data-parallel applications on heterogeneous platforms can be achieved by partitioning the data in proportion to the speeds of processors. It has been proven that the speed functions built from a history of time measurements better reflect different aspects of heterogeneity of processors. However, existing data partitioning algorithms based on functional performance models impose some restrictions on the shape of speed functions, which are not always satisfied if we try to approximate the real-life measurements accurately enough. This paper presents a new data partitioning algorithm that applies multidimensional solvers to numerical solution of the system of non-linear equations formalizing the problem of optima...
This dissertation presents optimization techniques for efficient data parallel formulation/implement...
(eng) Future computing platforms will be distributed and heterogeneous. Such platforms range from he...
2012 IEEE 26th Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW), Shang...
The paper presents a performance model that can be used to optimally distribute computations over he...
Abstract. The paper presents a new data partitioning algorithm for parallel computing on heterogeneo...
Abstract—The paper presents a performance model that can be used to optimally distribute computation...
In this paper, we address the problem of optimal distribu-tion of computational tasks on a network o...
The current state and foreseeable future of high performance scientific computing (HPC) can be descr...
Heterogeneous platforms are mixes of different processing units in a compute node (e.g., CPUs+GPUs, ...
International audienceThe aim of the paper is to introduce general techniques in order to optimize t...
In multiprocessor systems, data parallelism is the execution of the same task on data distributed ac...
Abstract. In this paper, we present a novel algorithm of optimal matrix partitioning for parallel de...
[[abstract]]©1988 Springer Verlag-Designing efficient parallel algorithms in a message-based paralle...
Parallel computing hardware is affordable and accessible, yet parallel programming is not as widespr...
Abstract. The functional performance model (FPM) of heterogeneous proces-sors has proven to be more ...
This dissertation presents optimization techniques for efficient data parallel formulation/implement...
(eng) Future computing platforms will be distributed and heterogeneous. Such platforms range from he...
2012 IEEE 26th Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW), Shang...
The paper presents a performance model that can be used to optimally distribute computations over he...
Abstract. The paper presents a new data partitioning algorithm for parallel computing on heterogeneo...
Abstract—The paper presents a performance model that can be used to optimally distribute computation...
In this paper, we address the problem of optimal distribu-tion of computational tasks on a network o...
The current state and foreseeable future of high performance scientific computing (HPC) can be descr...
Heterogeneous platforms are mixes of different processing units in a compute node (e.g., CPUs+GPUs, ...
International audienceThe aim of the paper is to introduce general techniques in order to optimize t...
In multiprocessor systems, data parallelism is the execution of the same task on data distributed ac...
Abstract. In this paper, we present a novel algorithm of optimal matrix partitioning for parallel de...
[[abstract]]©1988 Springer Verlag-Designing efficient parallel algorithms in a message-based paralle...
Parallel computing hardware is affordable and accessible, yet parallel programming is not as widespr...
Abstract. The functional performance model (FPM) of heterogeneous proces-sors has proven to be more ...
This dissertation presents optimization techniques for efficient data parallel formulation/implement...
(eng) Future computing platforms will be distributed and heterogeneous. Such platforms range from he...
2012 IEEE 26th Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW), Shang...