Abstract. The functional performance model (FPM) of heterogeneous proces-sors has proven to be more realistic than the traditional models because it integrates many important features of heterogeneous processors such as the processor heterogeneity, the heterogeneity of memory structure, and the effects of paging. Optimal 1D matrix partitioning algorithms employing FPMs of het-erogeneous processors are already being used in solving complicated linear al-gebra kernel such as dense factorizations. However, 2D matrix partitioning algorithms for parallel computing on heterogeneous processors based on their FPMs are unavailable. In this paper, we address this deficiency by presenting a novel iterative algorithm for partitioning a dense matrix ove...
In this document, we describe two strategies of distribution of computations that can be used to imp...
In this document, we describe two strategies of distribution of computations that can be used to imp...
We consider two-dimensional partitioning of general sparse matrices for parallel sparse matrix-vecto...
Abstract. In this paper, we present a novel algorithm of optimal matrix partitioning for parallel de...
Abstract. The paper presents a new data partitioning algorithm for parallel computing on heterogeneo...
In this report, we consider a simple but important linear algebra kernel, matrix-matrix multiplicati...
(eng) We study the implementation of dense linear algebra computations, such as matrix multiplicatio...
International audienceWe study the implementation of dense linear algebra computations, such as matr...
2012 IEEE 26th Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW), Shang...
Proceedings of the 8th IEEE International Conference on Cluster Computing (Cluster 2006), October, 2...
In this paper, we consider the problem of partitioning a square into a set of zones of prescribed ar...
In this paper, we address the problem of optimal distribu-tion of computational tasks on a network o...
We present a new approach to utilizing all CPU cores and all GPUs on heterogeneous multicore and mul...
The problem of partitioning dense matrices into sets of sub-matrices has received increased attentio...
Matrix Factorization (MF) has been widely applied in machine learning and data mining. Due to the la...
In this document, we describe two strategies of distribution of computations that can be used to imp...
In this document, we describe two strategies of distribution of computations that can be used to imp...
We consider two-dimensional partitioning of general sparse matrices for parallel sparse matrix-vecto...
Abstract. In this paper, we present a novel algorithm of optimal matrix partitioning for parallel de...
Abstract. The paper presents a new data partitioning algorithm for parallel computing on heterogeneo...
In this report, we consider a simple but important linear algebra kernel, matrix-matrix multiplicati...
(eng) We study the implementation of dense linear algebra computations, such as matrix multiplicatio...
International audienceWe study the implementation of dense linear algebra computations, such as matr...
2012 IEEE 26th Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW), Shang...
Proceedings of the 8th IEEE International Conference on Cluster Computing (Cluster 2006), October, 2...
In this paper, we consider the problem of partitioning a square into a set of zones of prescribed ar...
In this paper, we address the problem of optimal distribu-tion of computational tasks on a network o...
We present a new approach to utilizing all CPU cores and all GPUs on heterogeneous multicore and mul...
The problem of partitioning dense matrices into sets of sub-matrices has received increased attentio...
Matrix Factorization (MF) has been widely applied in machine learning and data mining. Due to the la...
In this document, we describe two strategies of distribution of computations that can be used to imp...
In this document, we describe two strategies of distribution of computations that can be used to imp...
We consider two-dimensional partitioning of general sparse matrices for parallel sparse matrix-vecto...