Scientific applications are some of the most computationally demanding software pieces. Their core is usually a set of linear algebra operations, which may represent a significant part of the overall run-time of the application. BLAS libraries aim to solve this problem by exposing a set of highly optimized, reusable routines. There are several implementations specifically tuned for different types of computing platforms, including coprocessors. Some examples include the one bundled with the Intel MKL library, which targets Intel CPUs or Xeon Phi coprocessors, or the cuBLAS library, which is specifically designed for NVIDIA GPUs. Nowadays, computing nodes in many supercomputing clusters include one or more different coprocessor types. To ful...
This work studies programmability enhancing abstractions in the context of accelerators and heteroge...
We provide timing results for common linear algebra subroutines across BLAS (Basic Lin-ear Algebra S...
Because of tight power and energy constraints, industry is progressively shifting toward heterogeneo...
International audienceIn the last ten years, GPUs have dominated the market considering the computin...
Linear algebra kernels are in the core of many scientific applications. We propose a unified, perfor...
The increase in performance of the last generations of graphics processors (GPUs) has made this clas...
Rights to individual papers remain with the author or the author's employer. Permission is gran...
We propose two high-level application programming interfaces (APIs) to use a graphics processing uni...
The BLAS-like Library Instantiation Software (BLIS) is a framework for the rapid instantiation of ba...
BLIS is a new software framework for instantiating high-performance BLAS-like dense linear algebra l...
The BLAS-like Library Instantiation Software (BLIS) is a framework for the rapid instantiation of ba...
International audienceNowadays GPUs have dominated the market considering the computing/power metric...
This work reviews the experience of implementing different versions of the SSPR rank-one update oper...
Combinatorial algorithms such as those that arise in graph analysis, modeling of discrete systems, b...
A current trend in high-performance computing is to decompose a large linear algebra prob- lem into ...
This work studies programmability enhancing abstractions in the context of accelerators and heteroge...
We provide timing results for common linear algebra subroutines across BLAS (Basic Lin-ear Algebra S...
Because of tight power and energy constraints, industry is progressively shifting toward heterogeneo...
International audienceIn the last ten years, GPUs have dominated the market considering the computin...
Linear algebra kernels are in the core of many scientific applications. We propose a unified, perfor...
The increase in performance of the last generations of graphics processors (GPUs) has made this clas...
Rights to individual papers remain with the author or the author's employer. Permission is gran...
We propose two high-level application programming interfaces (APIs) to use a graphics processing uni...
The BLAS-like Library Instantiation Software (BLIS) is a framework for the rapid instantiation of ba...
BLIS is a new software framework for instantiating high-performance BLAS-like dense linear algebra l...
The BLAS-like Library Instantiation Software (BLIS) is a framework for the rapid instantiation of ba...
International audienceNowadays GPUs have dominated the market considering the computing/power metric...
This work reviews the experience of implementing different versions of the SSPR rank-one update oper...
Combinatorial algorithms such as those that arise in graph analysis, modeling of discrete systems, b...
A current trend in high-performance computing is to decompose a large linear algebra prob- lem into ...
This work studies programmability enhancing abstractions in the context of accelerators and heteroge...
We provide timing results for common linear algebra subroutines across BLAS (Basic Lin-ear Algebra S...
Because of tight power and energy constraints, industry is progressively shifting toward heterogeneo...