Abstract The Basic Linear Algebra Subprograms, BLAS, are the basic computa-tional kernels in most applications. BLAS 1 and BLAS 2, the vector-vector and matrix-vector routines, require memory accesses in the same order ascomputations and thus cannot achieve performance close to peak performance on modern computer architectures. BLAS 3 matrix-matrix operationson n * n-matrices on the other side can do order n3 operations with only or-der n2 memory accesses. This much better ratio of computation to memoryaccess allows for much higher performance. To show which performance can be expected using the BLAS routines from IBM's ESSL on an IBM p690 weinvestigated the performance of one routine of each BLAS level and compared it to that of the c...
Achieving high-performance while reducing power consumption is the key question as tech-nology scali...
A technique for optimizing software is proposed that involves the use of a standardized set of compu...
Basic Linear Algebra Subprograms (BLAS) and Linear Algebra Package (LAPACK) form basic building bloc...
A current trend in high-performance computing is to decompose a large linear algebra problem into ba...
The purpose of this paper is to explore issues related to the computation and communication performa...
Basic Linear Algebra Subprograms (BLAS) are building blocks for many other matrix computations BLAS ...
The level 3 Basic Linear Algebra Subprograms (BLAS) are designed to perform various matrix multiply ...
This working note examines different Fortran implementations of the Basic Linear Algebra Subprograms...
The performance of a parallel matrix-matrix-multiplication routine with the same functionality as DG...
We provide timing results for common linear algebra subroutines across BLAS (Basic Lin-ear Algebra S...
This paper summarizes the BLAS Technical Forum Standard, a speci- #cation of a set of kernel routine...
We consider the efficient implementation of matrix computations in interpreted Basic on a microcompu...
This paper describes an implementation of Level 3 of the Basic Linear Algebra Subprogram (BLAS-3) li...
This paper describes an implementation of Level 3 of the Basic Linear Algebra Subprogram (BLAS-3) li...
The Level 3 BLAS (BLAS3) are a set of specifications of FORTRAN 77 subprograms for carrying out matr...
Achieving high-performance while reducing power consumption is the key question as tech-nology scali...
A technique for optimizing software is proposed that involves the use of a standardized set of compu...
Basic Linear Algebra Subprograms (BLAS) and Linear Algebra Package (LAPACK) form basic building bloc...
A current trend in high-performance computing is to decompose a large linear algebra problem into ba...
The purpose of this paper is to explore issues related to the computation and communication performa...
Basic Linear Algebra Subprograms (BLAS) are building blocks for many other matrix computations BLAS ...
The level 3 Basic Linear Algebra Subprograms (BLAS) are designed to perform various matrix multiply ...
This working note examines different Fortran implementations of the Basic Linear Algebra Subprograms...
The performance of a parallel matrix-matrix-multiplication routine with the same functionality as DG...
We provide timing results for common linear algebra subroutines across BLAS (Basic Lin-ear Algebra S...
This paper summarizes the BLAS Technical Forum Standard, a speci- #cation of a set of kernel routine...
We consider the efficient implementation of matrix computations in interpreted Basic on a microcompu...
This paper describes an implementation of Level 3 of the Basic Linear Algebra Subprogram (BLAS-3) li...
This paper describes an implementation of Level 3 of the Basic Linear Algebra Subprogram (BLAS-3) li...
The Level 3 BLAS (BLAS3) are a set of specifications of FORTRAN 77 subprograms for carrying out matr...
Achieving high-performance while reducing power consumption is the key question as tech-nology scali...
A technique for optimizing software is proposed that involves the use of a standardized set of compu...
Basic Linear Algebra Subprograms (BLAS) and Linear Algebra Package (LAPACK) form basic building bloc...