International audienceThis article proposes an online auto-tuning approach for computing kernels. Differently from existing online auto-tuners, which regenerate code with long compilation chains from the source to the binary code, our approach consists on deploying auto-tuning directly at the level of machine code generation. This allows auto-tuning to pay off in very short-running applications. As a proof of concept, our approach is demonstrated in two benchmarks, which execute during hundreds of milliseconds to a few seconds only. In a CPU-bound kernel, the speedups achieved are 1.10 to 1.58 in average depending on the target micro-architecture, up to 2.53 in the most favourable conditions (all run-time overheads included). In a memory-bo...
Modern high performance libraries, such as ATLAS and FFTW, and programming languages, such as PetaBr...
International audienceComputing systems rarely deliver best possible performance due to ever increas...
In computing systems, energy consumption is limiting the performance growth experienced in the last ...
International audienceThis article proposes an online auto-tuning approach for computing kernels. Di...
In high-performance computing, excellent node-level performance is required for the efficient use of...
Recent years have witnessed phenomenal growth in the application, and capabilities of Graphical Proc...
The recent transformation from an environment where gains in computational performance came from inc...
Graphics Processing Units (GPUs) have revolutionized the HPC landscape. The first generation of exas...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
Graphics Processing Units (GPUs) have revolutionized the HPC landscape. The first generation of exas...
This paper presents an automated performance tuning solution, which partitions a program into a numb...
>Magister Scientiae - MScIn this chapter, we provide the motivation and background behind the automa...
AbstractAutomatic performance tuning of computationally intensive kernels in scientific applications...
Graphics Processing Units (GPUs) have revolutionized the computing landscape in the past decade and ...
Autotuning is an established technique for optimizing the performance of parallel applications. Howe...
Modern high performance libraries, such as ATLAS and FFTW, and programming languages, such as PetaBr...
International audienceComputing systems rarely deliver best possible performance due to ever increas...
In computing systems, energy consumption is limiting the performance growth experienced in the last ...
International audienceThis article proposes an online auto-tuning approach for computing kernels. Di...
In high-performance computing, excellent node-level performance is required for the efficient use of...
Recent years have witnessed phenomenal growth in the application, and capabilities of Graphical Proc...
The recent transformation from an environment where gains in computational performance came from inc...
Graphics Processing Units (GPUs) have revolutionized the HPC landscape. The first generation of exas...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
Graphics Processing Units (GPUs) have revolutionized the HPC landscape. The first generation of exas...
This paper presents an automated performance tuning solution, which partitions a program into a numb...
>Magister Scientiae - MScIn this chapter, we provide the motivation and background behind the automa...
AbstractAutomatic performance tuning of computationally intensive kernels in scientific applications...
Graphics Processing Units (GPUs) have revolutionized the computing landscape in the past decade and ...
Autotuning is an established technique for optimizing the performance of parallel applications. Howe...
Modern high performance libraries, such as ATLAS and FFTW, and programming languages, such as PetaBr...
International audienceComputing systems rarely deliver best possible performance due to ever increas...
In computing systems, energy consumption is limiting the performance growth experienced in the last ...