AbstractThe use of an OpenMP compiler optimized for the corresponding multicore system is a good option, but it is possible in a system to have access to more than one compiler and different compilers can appropriately optimize different parts of the code. In this paper we present a proposal for an autotuning system for linear algebra routines that decides the best compiler for each situation, as well as other parameter values, as, for example, the number of threads to generate
This paper discusses the design of linear algebra libraries for high performance computers. Particul...
Abstract. Autotuning technology has emerged recently as a systematic process for evaluating alternat...
Abstract. We present an efficient and scalable programming model for the development of linear algeb...
AbstractThe use of an OpenMP compiler optimized for the corresponding multicore system is a good opt...
The final publication is available at Springer via http://dx.doi.org/10.1007/s10766-013-0249-6The in...
AbstractThe introduction of auto-tuning techniques in linear algebra routines using hybrid combinati...
AbstractIn this work the behavior of the multithreaded implementation of some LAPACK routines on PLA...
Abstract. In this paper we describe an autotuning tool for optimiza-tion of OpenMP applications on h...
It is rare for a programmer to solve a numerical problem with a single library call; most problems r...
This paper presents a dynamic task scheduling approach to executing dense linear algebra algorithms ...
This paper presents a self-optimization methodology for parallel linear algebra rou-tines on heterog...
Abstract In this document we present a new approach to developing sequential and parallel dense line...
Multicore architectures have found their way into many areas of application by now. While this allow...
With the emergence of thread-level parallelism as the primary means for continued improvement of per...
This dissertation details contributions made by the author to the field of computer science while wo...
This paper discusses the design of linear algebra libraries for high performance computers. Particul...
Abstract. Autotuning technology has emerged recently as a systematic process for evaluating alternat...
Abstract. We present an efficient and scalable programming model for the development of linear algeb...
AbstractThe use of an OpenMP compiler optimized for the corresponding multicore system is a good opt...
The final publication is available at Springer via http://dx.doi.org/10.1007/s10766-013-0249-6The in...
AbstractThe introduction of auto-tuning techniques in linear algebra routines using hybrid combinati...
AbstractIn this work the behavior of the multithreaded implementation of some LAPACK routines on PLA...
Abstract. In this paper we describe an autotuning tool for optimiza-tion of OpenMP applications on h...
It is rare for a programmer to solve a numerical problem with a single library call; most problems r...
This paper presents a dynamic task scheduling approach to executing dense linear algebra algorithms ...
This paper presents a self-optimization methodology for parallel linear algebra rou-tines on heterog...
Abstract In this document we present a new approach to developing sequential and parallel dense line...
Multicore architectures have found their way into many areas of application by now. While this allow...
With the emergence of thread-level parallelism as the primary means for continued improvement of per...
This dissertation details contributions made by the author to the field of computer science while wo...
This paper discusses the design of linear algebra libraries for high performance computers. Particul...
Abstract. Autotuning technology has emerged recently as a systematic process for evaluating alternat...
Abstract. We present an efficient and scalable programming model for the development of linear algeb...