Integrating polyalgorithm library with optimized linear algebra libraries on HPC platforms, leveraging 2.5D version of matrix-matrix multiplication of large non-square dense matrices, based on varying algorithm as a function of problem, concurrency
We identify the challenges that are special to parallel sparse matrix-matrix multiplication (PSpGEMM...
A number of parallel formulations of dense matrix multiplication algorithm have been developed. For ...
Architecture for dense matrix multiplication on a high-performance reconfigurable syste
The polyalgorithm library, originally designed in 1991-1993 by Robert Falgout, Jin Li, and Anthony S...
Abstract. Sparse matrix-matrix multiplication (or SpGEMM) is a key primitive for many high-performan...
AbstractÐIn this paper, we address the issue of implementing matrix multiplication on heterogeneous ...
The Multicomputer Toolbox includes sparse, dense, and iterative scalable linear algebra libraries. D...
We have repurposed Google Tensor Processing Units (TPUs), application-specific chips developed for m...
In this document, we describe two strategies of distribution of computations that can be used to imp...
Abstract. Implementations of the Basic Linear Algebra Subprograms (BLAS) interface are major buildin...
International audienceWe study the implementation of dense linear algebra computations, such as matr...
International audienceSparse matrix-matrix multiplication (or SpGEMM) is a key primitive for many hi...
Abstract: Few realize that, for large matrices, many dense matrix computations achieve nearly the sa...
This paper talks about different types of algorithms fro matrix multiplication when applied to paral...
(eng) We study the implementation of dense linear algebra computations, such as matrix multiplicatio...
We identify the challenges that are special to parallel sparse matrix-matrix multiplication (PSpGEMM...
A number of parallel formulations of dense matrix multiplication algorithm have been developed. For ...
Architecture for dense matrix multiplication on a high-performance reconfigurable syste
The polyalgorithm library, originally designed in 1991-1993 by Robert Falgout, Jin Li, and Anthony S...
Abstract. Sparse matrix-matrix multiplication (or SpGEMM) is a key primitive for many high-performan...
AbstractÐIn this paper, we address the issue of implementing matrix multiplication on heterogeneous ...
The Multicomputer Toolbox includes sparse, dense, and iterative scalable linear algebra libraries. D...
We have repurposed Google Tensor Processing Units (TPUs), application-specific chips developed for m...
In this document, we describe two strategies of distribution of computations that can be used to imp...
Abstract. Implementations of the Basic Linear Algebra Subprograms (BLAS) interface are major buildin...
International audienceWe study the implementation of dense linear algebra computations, such as matr...
International audienceSparse matrix-matrix multiplication (or SpGEMM) is a key primitive for many hi...
Abstract: Few realize that, for large matrices, many dense matrix computations achieve nearly the sa...
This paper talks about different types of algorithms fro matrix multiplication when applied to paral...
(eng) We study the implementation of dense linear algebra computations, such as matrix multiplicatio...
We identify the challenges that are special to parallel sparse matrix-matrix multiplication (PSpGEMM...
A number of parallel formulations of dense matrix multiplication algorithm have been developed. For ...
Architecture for dense matrix multiplication on a high-performance reconfigurable syste