It is rare for a programmer to solve a numerical problem with a single library call; most problems require a sequence of calls. In the case of linear algebra, programmers will chain a series of Basic Linear Algebra Subprogram (BLAS) library calls to achieve the desired result. When a sequence of BLAS calls is memory bound, a great deal of performance is missed because optimization has not occurred between library routines. It is not practical to create a library with every required sequence of linear algebra operations, but at the same time it is difficult for programmers to write their own high performance implementation. One solution is for programmers to use an auto-tuning tool capable of optimizing the sequence of operations that exactl...
AbstractDesign by Transformation (DxT) is an approach to software development that encodes domain-sp...
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. ...
AbstractThe introduction of auto-tuning techniques in linear algebra routines using hybrid combinati...
Abstract. In this article we look at the generation of libraries for dense linear algebra operations...
Abstract—Scientific programmers often turn to vendor-tuned Basic Linear Algebra Subprograms (BLAS) t...
This paper describes an approach for the automatic generation and optimization of numerical softwar...
The final publication is available at Springer via http://dx.doi.org/10.1007/s10766-013-0249-6The in...
One of the main obstacles to the efficient solution of scientific problems is the problem of tuning ...
The goal of the LAPACK project is to provide efficient and portable software for dense numerical lin...
Abstract In this document we present a new approach to developing sequential and parallel dense line...
This dissertation focuses on the design and the implementation of domain-specific compilers for line...
textOver the last two decades, much progress has been made in the area of the high-performance sequ...
Abstract. To implement dense linear algebra algorithms for distributed-memory computers, an expert a...
Design by Transformation (DxT) is an approach to software development that encodes domain-specific p...
AbstractEfficient implementation of matrix algebra is important to the performance of many large and...
AbstractDesign by Transformation (DxT) is an approach to software development that encodes domain-sp...
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. ...
AbstractThe introduction of auto-tuning techniques in linear algebra routines using hybrid combinati...
Abstract. In this article we look at the generation of libraries for dense linear algebra operations...
Abstract—Scientific programmers often turn to vendor-tuned Basic Linear Algebra Subprograms (BLAS) t...
This paper describes an approach for the automatic generation and optimization of numerical softwar...
The final publication is available at Springer via http://dx.doi.org/10.1007/s10766-013-0249-6The in...
One of the main obstacles to the efficient solution of scientific problems is the problem of tuning ...
The goal of the LAPACK project is to provide efficient and portable software for dense numerical lin...
Abstract In this document we present a new approach to developing sequential and parallel dense line...
This dissertation focuses on the design and the implementation of domain-specific compilers for line...
textOver the last two decades, much progress has been made in the area of the high-performance sequ...
Abstract. To implement dense linear algebra algorithms for distributed-memory computers, an expert a...
Design by Transformation (DxT) is an approach to software development that encodes domain-specific p...
AbstractEfficient implementation of matrix algebra is important to the performance of many large and...
AbstractDesign by Transformation (DxT) is an approach to software development that encodes domain-sp...
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. ...
AbstractThe introduction of auto-tuning techniques in linear algebra routines using hybrid combinati...