AbstractDesign by Transformation (DxT) is an approach to software development that encodes domain-specific programs as graphs and expert design knowledge as graph transformations. The goal of DxT is to mechanize the generation of highly-optimized code. This paper demonstrates how DxT can be used to transform sequential specifications of an important set of Dense Linear Algebra (DLA) kernels, the level-3 Basic Linear Algebra Subprograms (BLAS3), into high-performing library routines targeting distributed-memory (cluster) architectures. Getting good BLAS3 performance for such platforms requires deep domain knowledge, so their implementations are manually coded by experts. Unfortunately, there are few such experts and developing the full varie...
Achieving high computation efficiency, in terms of Cycles per Instruction (CPI), for high-performanc...
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. ...
Abstract. Implementations of the Basic Linear Algebra Subprograms (BLAS) interface are major buildin...
Design by Transformation (DxT) is an approach to software development that encodes domain-specific p...
AbstractDesign by Transformation (DxT) is an approach to software development that encodes domain-sp...
Abstract. To implement dense linear algebra algorithms for distributed-memory computers, an expert a...
Abstract In this document we present a new approach to developing sequential and parallel dense line...
textExpert design knowledge is essential to develop a library of high-performance software. This inc...
This paper describes an approach for the automatic generation and optimization of numerical softwar...
The complexity of hardware platforms available today is increasing. Com-plex memory hierarchies, mul...
textOver the last two decades, much progress has been made in the area of the high-performance sequ...
This paper discusses the design of linear algebra libraries for high performance computers. Particul...
<p>NSF's Software Infrastructure for Sustained Innovation funds the development of community softwar...
One of the main obstacles to the efficient solution of scientific problems is the problem of tuning ...
Abstract. In this article we look at the generation of libraries for dense linear algebra operations...
Achieving high computation efficiency, in terms of Cycles per Instruction (CPI), for high-performanc...
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. ...
Abstract. Implementations of the Basic Linear Algebra Subprograms (BLAS) interface are major buildin...
Design by Transformation (DxT) is an approach to software development that encodes domain-specific p...
AbstractDesign by Transformation (DxT) is an approach to software development that encodes domain-sp...
Abstract. To implement dense linear algebra algorithms for distributed-memory computers, an expert a...
Abstract In this document we present a new approach to developing sequential and parallel dense line...
textExpert design knowledge is essential to develop a library of high-performance software. This inc...
This paper describes an approach for the automatic generation and optimization of numerical softwar...
The complexity of hardware platforms available today is increasing. Com-plex memory hierarchies, mul...
textOver the last two decades, much progress has been made in the area of the high-performance sequ...
This paper discusses the design of linear algebra libraries for high performance computers. Particul...
<p>NSF's Software Infrastructure for Sustained Innovation funds the development of community softwar...
One of the main obstacles to the efficient solution of scientific problems is the problem of tuning ...
Abstract. In this article we look at the generation of libraries for dense linear algebra operations...
Achieving high computation efficiency, in terms of Cycles per Instruction (CPI), for high-performanc...
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. ...
Abstract. Implementations of the Basic Linear Algebra Subprograms (BLAS) interface are major buildin...