One-dimensional decomposition of nonuniform workload arrays for optimal load balancing is investigated. The problem has been studied in the literature as "chains-on-chains partitioning" problem. Despite extensive research efforts, heuristics are still used in parallel computing community with the "hope" of good decompositions and the "myth" of exact algorithms being hard to implement and not runtime efficient. The main objective of this paper is to show that using exact algorithms instead of heuristics yields significant load balance improvements with negligible increase in preprocessing time. We provide detailed pseudocodes of our algorithms so that our results can be easily reproduced. We start with a review of literature on cha...
The problem of partitioning dense matrices into sets of sub-matrices has received increased attentio...
In this paper, we present a new load balancing technique, called panel scattering, which is generall...
Parallel iterative applications often suffer from load imbalance, one of the most critical performan...
One-dimensional decomposition of nonuniform workload arrays for optimal load balancing is investigat...
The one-dimensional decomposition of nonuniform workload arrays with optimal load balancing is inves...
We study the problem of one-dimensional partitioning of nonuniform workload arrays with optimal load...
We study the problem of one-dimensional partitioning of nonuniform workload arrays, with optimal loa...
Load balancing in the decomposition of sparse matri-ces without disturbing the row/column ordering i...
International audienceWe investigate one dimensional partitioning of sparse matrices under a given o...
Given a set of 1D intervals and a desired partition number, this paper studies on how to make an opt...
Distributing spatially located heterogeneous workloads is an important problem in parallel scientifi...
A method is outlined for optimising graph partitions which arise in mapping unstructured mesh calcul...
In parallel computing, obtaining maximal performance is often mandatory to solve large and complex p...
Given a partitioning of a sparse matrix for parallel matrix–vector multiplication, which determines ...
This chapter describes a parallel optimization technique that incorporates a distributed load-balanc...
The problem of partitioning dense matrices into sets of sub-matrices has received increased attentio...
In this paper, we present a new load balancing technique, called panel scattering, which is generall...
Parallel iterative applications often suffer from load imbalance, one of the most critical performan...
One-dimensional decomposition of nonuniform workload arrays for optimal load balancing is investigat...
The one-dimensional decomposition of nonuniform workload arrays with optimal load balancing is inves...
We study the problem of one-dimensional partitioning of nonuniform workload arrays with optimal load...
We study the problem of one-dimensional partitioning of nonuniform workload arrays, with optimal loa...
Load balancing in the decomposition of sparse matri-ces without disturbing the row/column ordering i...
International audienceWe investigate one dimensional partitioning of sparse matrices under a given o...
Given a set of 1D intervals and a desired partition number, this paper studies on how to make an opt...
Distributing spatially located heterogeneous workloads is an important problem in parallel scientifi...
A method is outlined for optimising graph partitions which arise in mapping unstructured mesh calcul...
In parallel computing, obtaining maximal performance is often mandatory to solve large and complex p...
Given a partitioning of a sparse matrix for parallel matrix–vector multiplication, which determines ...
This chapter describes a parallel optimization technique that incorporates a distributed load-balanc...
The problem of partitioning dense matrices into sets of sub-matrices has received increased attentio...
In this paper, we present a new load balancing technique, called panel scattering, which is generall...
Parallel iterative applications often suffer from load imbalance, one of the most critical performan...