Automatic performance tuning (auto-tuning) has been used in parallel numerical applications for adapting per-formance-relevant parameters. We extend auto-tuning to general-purpose parallel applications on multicores. This paper concentrates on Atune-IL, an instrumentation language for specifying a wide range of tunable parame-ters for a generic auto-tuner. Tunable parameters include the number of threads and other size parameters, but also choice of algorithms, numbers of pipeline stages, etc. A case study of Atune-IL’s usage in a real-world application with 13 parameters and over 24 million possible value combinations is discussed. With Atune-IL, the search space was reduced to 1,600 combinations, and the lines of code needed for instrumen...
This paper presents an automated performance tuning solution, which partitions a program into a numb...
Automatic tuning (auto-tuning) of software has emerged in recent years as a promising method that tr...
Graphics Processing Units (GPUs) have revolutionized the HPC landscape. The first generation of exas...
Autotuning is an established technique for optimizing the performance of parallel applications. Howe...
Auto-tuning has recently received significant attention from the High Performance Computing communi...
Application auto-tuning has produced excellent results in a wide range of computing domains. Yet ada...
The tuning of parallel programs on large distributed-memory machines today is usually a costly, and ...
The recent transformation from an environment where gains in computational performance came from inc...
Auto-tuning has become increasingly popular for optimizing non-functional parameters of parallel pro...
This paper describes a new parallel program tuning framework, with a new approach for tuning. The ap...
There are proposed software tools for automatic generating autotuners – special kind of applications...
In today’s multicore era, parallelization of serial code is essential in order to exploit the archit...
AbstractWe present a dynamic method for tuning algorithmic parameters of parallel scientific program...
In high-performance computing, excellent node-level performance is required for the efficient use of...
International audienceCurrent architecture complexity requires fine tuning of compiler and runtime p...
This paper presents an automated performance tuning solution, which partitions a program into a numb...
Automatic tuning (auto-tuning) of software has emerged in recent years as a promising method that tr...
Graphics Processing Units (GPUs) have revolutionized the HPC landscape. The first generation of exas...
Autotuning is an established technique for optimizing the performance of parallel applications. Howe...
Auto-tuning has recently received significant attention from the High Performance Computing communi...
Application auto-tuning has produced excellent results in a wide range of computing domains. Yet ada...
The tuning of parallel programs on large distributed-memory machines today is usually a costly, and ...
The recent transformation from an environment where gains in computational performance came from inc...
Auto-tuning has become increasingly popular for optimizing non-functional parameters of parallel pro...
This paper describes a new parallel program tuning framework, with a new approach for tuning. The ap...
There are proposed software tools for automatic generating autotuners – special kind of applications...
In today’s multicore era, parallelization of serial code is essential in order to exploit the archit...
AbstractWe present a dynamic method for tuning algorithmic parameters of parallel scientific program...
In high-performance computing, excellent node-level performance is required for the efficient use of...
International audienceCurrent architecture complexity requires fine tuning of compiler and runtime p...
This paper presents an automated performance tuning solution, which partitions a program into a numb...
Automatic tuning (auto-tuning) of software has emerged in recent years as a promising method that tr...
Graphics Processing Units (GPUs) have revolutionized the HPC landscape. The first generation of exas...