The functions library, called Basic Linear Algebra Subprograms (BLAS-1), is considered the programming standard in scientific computing. In this work, we focus on the analysis of various code optimization techniques to increase the computational performance of BLAS-1. In particular, we address a combinational approach to explore possible methods of encoding using unroll technique with different levels of depth, vector data programming with MMX and SSE for Intel processors. Using the main functions of BLAS-1, it was determined numerically a computational increase, expressed in mega-ops, up to 52% compared to the optimized BLAS-1 ATLASlibrary.// RESUMEN: La biblioteca de funciones denominada Subprogramas Básicos de Algebra Lineal (BLAS-1) es ...
Abstract The Basic Linear Algebra Subprograms, BLAS, are the basic computa-tional kernels in most ap...
The increase in performance of the last generations of graphics processors (GPUs) has made this clas...
This paper proposes an API for Batched Basic Linear Algebra Subprograms (Batched BLAS). We focus on...
La biblioteca de funciones denominada Subprogramas B´asicos de Algebra ´ Lineal (BLAS-1) es consider...
The Basic Linear Algebra Subprograms or BLAS provide the foundation for much of the software used i...
One of the key areas for enabling users to efficiently use an HPC system is providing optimized BLAS...
A technique for optimizing software is proposed that involves the use of a standardized set of compu...
Abstract—Scientific programmers often turn to vendor-tuned Basic Linear Algebra Subprograms (BLAS) t...
In this paper we propose a set of optimizations for the BLAS-3 routines of LASs library (Linear Alge...
This paper describes an implementation of Level 3 of the Basic Linear Algebra Subprogram (BLAS-3) li...
This paper describes an implementation of Level 3 of the Basic Linear Algebra Subprogram (BLAS-3) li...
Scientific applications are some of the most computationally demanding software pieces. Their core i...
Basic Linear Algebra Subprograms (BLAS) and Linear Algebra Package (LAPACK) form basic building bloc...
A current trend in high-performance computing is to decompose a large linear algebra problem into ba...
We combine two high-level application programming interfaces to solve large-scale linear systems wi...
Abstract The Basic Linear Algebra Subprograms, BLAS, are the basic computa-tional kernels in most ap...
The increase in performance of the last generations of graphics processors (GPUs) has made this clas...
This paper proposes an API for Batched Basic Linear Algebra Subprograms (Batched BLAS). We focus on...
La biblioteca de funciones denominada Subprogramas B´asicos de Algebra ´ Lineal (BLAS-1) es consider...
The Basic Linear Algebra Subprograms or BLAS provide the foundation for much of the software used i...
One of the key areas for enabling users to efficiently use an HPC system is providing optimized BLAS...
A technique for optimizing software is proposed that involves the use of a standardized set of compu...
Abstract—Scientific programmers often turn to vendor-tuned Basic Linear Algebra Subprograms (BLAS) t...
In this paper we propose a set of optimizations for the BLAS-3 routines of LASs library (Linear Alge...
This paper describes an implementation of Level 3 of the Basic Linear Algebra Subprogram (BLAS-3) li...
This paper describes an implementation of Level 3 of the Basic Linear Algebra Subprogram (BLAS-3) li...
Scientific applications are some of the most computationally demanding software pieces. Their core i...
Basic Linear Algebra Subprograms (BLAS) and Linear Algebra Package (LAPACK) form basic building bloc...
A current trend in high-performance computing is to decompose a large linear algebra problem into ba...
We combine two high-level application programming interfaces to solve large-scale linear systems wi...
Abstract The Basic Linear Algebra Subprograms, BLAS, are the basic computa-tional kernels in most ap...
The increase in performance of the last generations of graphics processors (GPUs) has made this clas...
This paper proposes an API for Batched Basic Linear Algebra Subprograms (Batched BLAS). We focus on...