In this paper, we consider the problem of partitioning a square into a set of zones of prescribed areas, while minimizing the overall size of their projections onto horizontal and vertical axes. This problem typically arises when considering the amount of communications induced when partitioning matrices for dense linear algebra kernels onto a set of heterogeneous processors. It has been first introduced for matrix multiplication in the 2000's, with a best known approximation ratio was 1.75. Since then, two main new ingredients have been introduced. First, Lastovetsky et al. proposed a special partitioning in the case of 2 or 3 strongly heterogeneous processors, as in the case of a platform made of CPUs and GPUs, relaxing the constraint of ...
Proceedings of the 8th IEEE International Conference on Cluster Computing (Cluster 2006), October, 2...
We present a new approach to utilizing all CPU cores and all GPUs on heterogeneous multicore and mul...
We consider the problem of data allocation when performing matrix multiplication on a heterogeneous ...
International audienceIn this paper, we consider the problem of partitioning a square into a set of ...
The problem of partitioning dense matrices into sets of sub-matrices has received increased attentio...
The problem of partitioning a square into zones of prescribed areas arises when partitioning matrice...
In this report, we consider a simple but important linear algebra kernel, matrix-matrix multiplicati...
2012 IEEE 26th Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW), Shang...
In this paper, we deal with two geometric problems arising from heterogeneous parallel computing: ho...
International audienceIn this paper, we deal with two geometric problems arising from heterogeneous ...
In this pape6 we deal with n ~ o geometric problems arising froin heterogeneous parallel computing: ...
Abstract. In this paper, we present a novel algorithm of optimal matrix partitioning for parallel de...
In this paper, we study the implementation of dense linear algebra kernels, such as matrix multiplic...
Abstract. The functional performance model (FPM) of heterogeneous proces-sors has proven to be more ...
(eng) We study the implementation of dense linear algebra computations, such as matrix multiplicatio...
Proceedings of the 8th IEEE International Conference on Cluster Computing (Cluster 2006), October, 2...
We present a new approach to utilizing all CPU cores and all GPUs on heterogeneous multicore and mul...
We consider the problem of data allocation when performing matrix multiplication on a heterogeneous ...
International audienceIn this paper, we consider the problem of partitioning a square into a set of ...
The problem of partitioning dense matrices into sets of sub-matrices has received increased attentio...
The problem of partitioning a square into zones of prescribed areas arises when partitioning matrice...
In this report, we consider a simple but important linear algebra kernel, matrix-matrix multiplicati...
2012 IEEE 26th Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW), Shang...
In this paper, we deal with two geometric problems arising from heterogeneous parallel computing: ho...
International audienceIn this paper, we deal with two geometric problems arising from heterogeneous ...
In this pape6 we deal with n ~ o geometric problems arising froin heterogeneous parallel computing: ...
Abstract. In this paper, we present a novel algorithm of optimal matrix partitioning for parallel de...
In this paper, we study the implementation of dense linear algebra kernels, such as matrix multiplic...
Abstract. The functional performance model (FPM) of heterogeneous proces-sors has proven to be more ...
(eng) We study the implementation of dense linear algebra computations, such as matrix multiplicatio...
Proceedings of the 8th IEEE International Conference on Cluster Computing (Cluster 2006), October, 2...
We present a new approach to utilizing all CPU cores and all GPUs on heterogeneous multicore and mul...
We consider the problem of data allocation when performing matrix multiplication on a heterogeneous ...