This report presents two new ASSEMBLER-subroutines MULR8 and MULC16 for fast multiplication of especially large and not necessarily quadratic matrices with real and complex entries, respectively, developed for the IBM 370/168. Matrix-multiplication is performed by P.C. FISCHER's improvement of the well-known V. STRASSEN-algorithm combined with the method of S. WINOGRAD. By a convenient addition of zero rows and columns, the original matrix dimensions are modified such that anoptimal application of STRASSEN's block method is guaranteed, whereas these zeros are only treated as fictitious, non-arithmetic constants. With respect to execution time, numerical results demonstrate that these new subroutines are superior to other well-known methods ...
AbstractPerformance characteristics of dense and structured blocked linear system solvers are studie...
International audienceWe propose several new schedules for Strassen-Winograd's matrix multiplication...
Some level-2 and level-3 Distributed Basic Linear Algebra Subroutines (DBLAS) that have been impleme...
AbstractThe main purpose of this paper is to present a fast matrix multiplication algorithm taken fr...
Strassen's algorithm is a divide and conquer matrix multiplication method that is mostly of theoreti...
Strassen’s matrix multiplication reduces the computational cost of multiplying matrices of size n × ...
The Fortran--90 standard requires an intrinsic function matmul which multiplies two matrices togethe...
Abstract. Strassen's algorithm for fast matrix-matrix multiplication has been implemented for m...
This paper examines how to write code to gain high performance on modern computers as well as the im...
The paper presents analysis of matrix multiplication algorithms from the point of view of their effi...
Abstract: Strassen’s algorithm to multiply two n×n matrices reduces the asymptotic operation count f...
Matrix multiplication is significant in a lot of scientific fields, such as mathematics, physics and...
Today current era of scientific computing and computational theory involves high exhaustive data com...
We present a parallel method for matrix multiplication on distributedmemory MIMD architectures based...
Strassen's algorithm for matrix multiplication gains its lower arithmetic complexityatthe expe...
AbstractPerformance characteristics of dense and structured blocked linear system solvers are studie...
International audienceWe propose several new schedules for Strassen-Winograd's matrix multiplication...
Some level-2 and level-3 Distributed Basic Linear Algebra Subroutines (DBLAS) that have been impleme...
AbstractThe main purpose of this paper is to present a fast matrix multiplication algorithm taken fr...
Strassen's algorithm is a divide and conquer matrix multiplication method that is mostly of theoreti...
Strassen’s matrix multiplication reduces the computational cost of multiplying matrices of size n × ...
The Fortran--90 standard requires an intrinsic function matmul which multiplies two matrices togethe...
Abstract. Strassen's algorithm for fast matrix-matrix multiplication has been implemented for m...
This paper examines how to write code to gain high performance on modern computers as well as the im...
The paper presents analysis of matrix multiplication algorithms from the point of view of their effi...
Abstract: Strassen’s algorithm to multiply two n×n matrices reduces the asymptotic operation count f...
Matrix multiplication is significant in a lot of scientific fields, such as mathematics, physics and...
Today current era of scientific computing and computational theory involves high exhaustive data com...
We present a parallel method for matrix multiplication on distributedmemory MIMD architectures based...
Strassen's algorithm for matrix multiplication gains its lower arithmetic complexityatthe expe...
AbstractPerformance characteristics of dense and structured blocked linear system solvers are studie...
International audienceWe propose several new schedules for Strassen-Winograd's matrix multiplication...
Some level-2 and level-3 Distributed Basic Linear Algebra Subroutines (DBLAS) that have been impleme...