Hyper-Systolic Implementation of BLAS-3 Routines on the APE100/Quadrics Machine

Marco Coletta
Thomas Lippert
Paolo Palazzari

Open link

Publication date

January 1998

DOI

10.1007/bfb0095323

ISSN

0302-9743

Abstract

. Basic Linear Algebra Subroutines (BLAS-3) [1] are building blocks to solve a lot of numerical problems (Cholesky factorization, Gram-Schmidt ortonormalization, LU decomposition,...). Their efficient implementation on a given parallel machine is a key issue for the maximal exploitation of the system's computational power. In this work we refer to a massively parallel processing SIMD machine (the APE100/Quadrics [2]) and to the adoption of the hyper-systolic method [3, 6, 4] to efficiently implement BLAS-3 on such a machine. The results we achieved (nearly 60-70% of the peak performances for large matrices) demonstrate the validity of the proposed approach. The work is structured as follows: section 1 is devoted to review BLAS-3, in se...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Hyper-Systolic Implementation of BLAS-3 Routines on the APE100/Quadrics Machine

Abstract

Extracted data

Hyper-Systolic Implementation of BLAS-3 Routines on the APE100/Quadrics Machine

Abstract

Extracted data

Related items

Related items