In this report, we consider a simple but important linear algebra kernel, matrix-matrix multiplication. Building multi-core processors based on heterogeneous cores is an important current trend. In this context, it is of great interest to study optimal matrix partitioning algorithms for small cases (i.e. small number of cores). Indeed, the general case, with relatively high numbers of heterogeneous resources is now well understood, however the problem is in general NP-Complete when one aims at balancing the load while minimizing the communications. Nonetheless several approximation algorithms have been successfully designed. Nevertheless, negative complexity results do not apply for very few heterogeneous cores. Additionally, the case of a ...
Two issues in linear algebra algorithms for multicomputers are addressed. First, how tounify paralle...
In this paper, we study the implementation of dense linear algebra kernels, such as matrix multiplic...
In this document, we describe two strategies of distribution of computations that can be used to imp...
In this report, we consider a simple but important linear algebra kernel, matrix-matrix multiplicati...
2012 IEEE 26th Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW), Shang...
Proceedings of the 8th IEEE International Conference on Cluster Computing (Cluster 2006), October, 2...
Abstract. In this paper, we present a novel algorithm of optimal matrix partitioning for parallel de...
In this paper, we consider the problem of partitioning a square into a set of zones of prescribed ar...
The problem of partitioning dense matrices into sets of sub-matrices has received increased attentio...
Abstract. The functional performance model (FPM) of heterogeneous proces-sors has proven to be more ...
(eng) In this paper, we address the issue of implementing matrix-matrix multiplication on heterogene...
AbstractÐIn this paper, we address the issue of implementing matrix multiplication on heterogeneous ...
Given a partitioning of a sparse matrix for parallel matrix–vector multiplication, which determines ...
In this document, we describe two strategies of distribution of computations that can be used to imp...
Given a partitioning of a sparse matrix for parallel matrix–vector multiplication, which determines ...
Two issues in linear algebra algorithms for multicomputers are addressed. First, how tounify paralle...
In this paper, we study the implementation of dense linear algebra kernels, such as matrix multiplic...
In this document, we describe two strategies of distribution of computations that can be used to imp...
In this report, we consider a simple but important linear algebra kernel, matrix-matrix multiplicati...
2012 IEEE 26th Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW), Shang...
Proceedings of the 8th IEEE International Conference on Cluster Computing (Cluster 2006), October, 2...
Abstract. In this paper, we present a novel algorithm of optimal matrix partitioning for parallel de...
In this paper, we consider the problem of partitioning a square into a set of zones of prescribed ar...
The problem of partitioning dense matrices into sets of sub-matrices has received increased attentio...
Abstract. The functional performance model (FPM) of heterogeneous proces-sors has proven to be more ...
(eng) In this paper, we address the issue of implementing matrix-matrix multiplication on heterogene...
AbstractÐIn this paper, we address the issue of implementing matrix multiplication on heterogeneous ...
Given a partitioning of a sparse matrix for parallel matrix–vector multiplication, which determines ...
In this document, we describe two strategies of distribution of computations that can be used to imp...
Given a partitioning of a sparse matrix for parallel matrix–vector multiplication, which determines ...
Two issues in linear algebra algorithms for multicomputers are addressed. First, how tounify paralle...
In this paper, we study the implementation of dense linear algebra kernels, such as matrix multiplic...
In this document, we describe two strategies of distribution of computations that can be used to imp...