Basic Linear Algebra Subprograms (BLAS) are building blocks for many other matrix computations BLAS encapsulate basic operations on vectors and matrices so they can be optimized for particular computer architecture while high-level routines that call them remain portable BLAS offer good opportunities for optimizing utilization of memory hierarchy Generic BLAS are available from netlib, and many computer vendors provide custom versions optimized for their particular system
This paper discusses the design of linear algebra libraries for high performance computers. Particul...
This paper proposes a set of Level 3 Basic Linear Algebra Subprograms and associated kernels for sp...
Abstract. Matrix{matrix multiplication is normally computed using one of the BLAS or a reinvention o...
This paper summarizes the BLAS Technical Forum Standard, a speci- #cation of a set of kernel routine...
Abstract The Basic Linear Algebra Subprograms, BLAS, are the basic computa-tional kernels in most ap...
This working note examines different Fortran implementations of the Basic Linear Algebra Subprograms...
A technique for optimizing software is proposed that involves the use of a standardized set of compu...
We discuss the interface design for the Sparse Basic Linear Algebra Subprograms (BLAS), the kernels ...
A package of 38 low-level subprograms for many of the basic operations of numerical linear algebra i...
This paper proposes an API for Batched Basic Linear Algebra Subprograms (Batched BLAS). We focus on...
The level 3 Basic Linear Algebra Subprograms (BLAS) are designed to perform various matrix multiply ...
This paper describes an approach for the automatic generation and optimization of numerical softwar...
Abstract—Scientific programmers often turn to vendor-tuned Basic Linear Algebra Subprograms (BLAS) t...
We describe a subset of the level-1, level-2, and level-3 BLAS implemented for each node of the Conn...
The functions library, called Basic Linear Algebra Subprograms (BLAS-1), is considered the programmi...
This paper discusses the design of linear algebra libraries for high performance computers. Particul...
This paper proposes a set of Level 3 Basic Linear Algebra Subprograms and associated kernels for sp...
Abstract. Matrix{matrix multiplication is normally computed using one of the BLAS or a reinvention o...
This paper summarizes the BLAS Technical Forum Standard, a speci- #cation of a set of kernel routine...
Abstract The Basic Linear Algebra Subprograms, BLAS, are the basic computa-tional kernels in most ap...
This working note examines different Fortran implementations of the Basic Linear Algebra Subprograms...
A technique for optimizing software is proposed that involves the use of a standardized set of compu...
We discuss the interface design for the Sparse Basic Linear Algebra Subprograms (BLAS), the kernels ...
A package of 38 low-level subprograms for many of the basic operations of numerical linear algebra i...
This paper proposes an API for Batched Basic Linear Algebra Subprograms (Batched BLAS). We focus on...
The level 3 Basic Linear Algebra Subprograms (BLAS) are designed to perform various matrix multiply ...
This paper describes an approach for the automatic generation and optimization of numerical softwar...
Abstract—Scientific programmers often turn to vendor-tuned Basic Linear Algebra Subprograms (BLAS) t...
We describe a subset of the level-1, level-2, and level-3 BLAS implemented for each node of the Conn...
The functions library, called Basic Linear Algebra Subprograms (BLAS-1), is considered the programmi...
This paper discusses the design of linear algebra libraries for high performance computers. Particul...
This paper proposes a set of Level 3 Basic Linear Algebra Subprograms and associated kernels for sp...
Abstract. Matrix{matrix multiplication is normally computed using one of the BLAS or a reinvention o...