This paper presents the design and implementation of several fun-damental dense linear algebra (DLA) algorithms in OpenCL. In particular, these are linear system solvers and eigenvalue problem solvers. Further, we give an overview of the clMAGMA library, an open source, high performance OpenCL library that incorporates various optimizations, and in general provides the DLA functional-ity of the popular LAPACK library on heterogeneous architectures. The LAPACK-compliance and use of OpenCL simplify the use of clMAGMA in applications, while providing them with portable performance. High performance is obtained through the use of the high-performance OpenCL BLAS, hardware- and OpenCL-specific tuning, and a hybridization methodology, where we sp...
International audienceDirect and iterative methods are often used to solve linear systems in enginee...
We describe the design of ScaLAPACK++, an object oriented C++ library for implementing linear algebr...
Parallel accelerators are playing an increasingly important role in scientific computing. However, i...
We address some key issues in designing dense linear algebra (DLA) algorithms that are common for bo...
Abstract. We address some key issues in designing dense linear alge-bra (DLA) algorithms that are co...
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. ...
Abstract In this document we present a new approach to developing sequential and parallel dense line...
This paper discusses the design of linear algebra libraries for high performance computers. Particul...
This paper presents an overview of the LAPACK library, a portable, public-domain library to solve th...
Researchers from the Formal Linear Algebra Method Environment (Flame) project have developed new met...
This paper presents the design and implementation of several fundamental dense linear algebra (DLA) ...
The Multicomputer Toolbox includes sparse, dense, and iterative scalable linear algebra libraries. D...
ABSTRACT Sparse linear algebra is a cornerstone of modern computational science. These algorithms ig...
textOver the last two decades, much progress has been made in the area of the high-performance sequ...
Achieving high computation efficiency, in terms of Cycles per Instruction (CPI), for high-performanc...
International audienceDirect and iterative methods are often used to solve linear systems in enginee...
We describe the design of ScaLAPACK++, an object oriented C++ library for implementing linear algebr...
Parallel accelerators are playing an increasingly important role in scientific computing. However, i...
We address some key issues in designing dense linear algebra (DLA) algorithms that are common for bo...
Abstract. We address some key issues in designing dense linear alge-bra (DLA) algorithms that are co...
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. ...
Abstract In this document we present a new approach to developing sequential and parallel dense line...
This paper discusses the design of linear algebra libraries for high performance computers. Particul...
This paper presents an overview of the LAPACK library, a portable, public-domain library to solve th...
Researchers from the Formal Linear Algebra Method Environment (Flame) project have developed new met...
This paper presents the design and implementation of several fundamental dense linear algebra (DLA) ...
The Multicomputer Toolbox includes sparse, dense, and iterative scalable linear algebra libraries. D...
ABSTRACT Sparse linear algebra is a cornerstone of modern computational science. These algorithms ig...
textOver the last two decades, much progress has been made in the area of the high-performance sequ...
Achieving high computation efficiency, in terms of Cycles per Instruction (CPI), for high-performanc...
International audienceDirect and iterative methods are often used to solve linear systems in enginee...
We describe the design of ScaLAPACK++, an object oriented C++ library for implementing linear algebr...
Parallel accelerators are playing an increasingly important role in scientific computing. However, i...