BLIS is a new software framework for instantiating high-performance BLAS-like dense linear algebra libraries. We demonstrate how BLIS acts as a productivity multiplier by using it to implement the level-3 BLAS on a variety of current architectures. The systems for which we demonstrate the framework include state-of-the-art general purpose, low-power, and special purpose architectures. We show how, with very little effort, the BLIS framework yields sequential and parallel implementations that are competitive with the performance of ATLAS, OpenBLAS (an effort to maintain and extend the GotoBLAS), and commercial vendor implementations such as AMD’s ACML, IBM’s ESSL, and Intel’s MKL libraries. While most of this paper focuses on single core imp...
Dense linear algebra libraries, such as BLAS and LAPACK, provide a relevant collection of numerical ...
A simple but highly effective approach for transforming high-performance implementations on cachebas...
Combinatorial algorithms such as those that arise in graph analysis, modeling of discrete systems, b...
The BLAS-like Library Instantiation Software (BLIS) is a framework for the rapid instantiation of ba...
The BLAS-like Library Instantiation Software (BLIS) is a framework for the rapid instantiation of ba...
BLIS is a new framework for rapid instantiation of the BLAS. We describe how BLIS extends the “GotoB...
The level 3 Basic Linear Algebra Subprograms (BLAS) are designed to perform various matrix multiply ...
International audienceIn the last ten years, GPUs have dominated the market considering the computin...
We provide timing results for common linear algebra subroutines across BLAS (Basic Lin-ear Algebra S...
Scientific applications are some of the most computationally demanding software pieces. Their core i...
International audienceNowadays GPUs have dominated the market considering the computing/power metric...
A current trend in high-performance computing is to decompose a large linear algebra problem into ba...
One of the key areas for enabling users to efficiently use an HPC system is providing optimized BLAS...
This paper describes an implementation of Level 3 of the Basic Linear Algebra Subprogram (BLAS-3) li...
This paper describes an implementation of Level 3 of the Basic Linear Algebra Subprogram (BLAS-3) li...
Dense linear algebra libraries, such as BLAS and LAPACK, provide a relevant collection of numerical ...
A simple but highly effective approach for transforming high-performance implementations on cachebas...
Combinatorial algorithms such as those that arise in graph analysis, modeling of discrete systems, b...
The BLAS-like Library Instantiation Software (BLIS) is a framework for the rapid instantiation of ba...
The BLAS-like Library Instantiation Software (BLIS) is a framework for the rapid instantiation of ba...
BLIS is a new framework for rapid instantiation of the BLAS. We describe how BLIS extends the “GotoB...
The level 3 Basic Linear Algebra Subprograms (BLAS) are designed to perform various matrix multiply ...
International audienceIn the last ten years, GPUs have dominated the market considering the computin...
We provide timing results for common linear algebra subroutines across BLAS (Basic Lin-ear Algebra S...
Scientific applications are some of the most computationally demanding software pieces. Their core i...
International audienceNowadays GPUs have dominated the market considering the computing/power metric...
A current trend in high-performance computing is to decompose a large linear algebra problem into ba...
One of the key areas for enabling users to efficiently use an HPC system is providing optimized BLAS...
This paper describes an implementation of Level 3 of the Basic Linear Algebra Subprogram (BLAS-3) li...
This paper describes an implementation of Level 3 of the Basic Linear Algebra Subprogram (BLAS-3) li...
Dense linear algebra libraries, such as BLAS and LAPACK, provide a relevant collection of numerical ...
A simple but highly effective approach for transforming high-performance implementations on cachebas...
Combinatorial algorithms such as those that arise in graph analysis, modeling of discrete systems, b...