Auto-tuning has become increasingly popular for optimizing non-functional parameters of parallel programs. The typically large search space requires sophisticated techniques to find well performing parameter values in a reasonable amount of time. Different parts of a program often perform best with different parameter values. We therefore subdivide programs into several regions, and try to optimize the parameter values for each of those regions separately as opposed to setting the parameter values globally for the entire program. As this enlarges the search space even further, we have to extend existing auto-tuning techniques in order to obtain good results. In this paper we introduce a novel enhancement to the RS-GDE3 algorithm which is us...
Graphics Processing Units (GPUs) have revolutionized the HPC landscape. The first generation of exas...
The contemporary parallel I/O software stack is complex due to a large number of configurations for ...
Over the last several decades we have witnessed tremendous change in the landscape of computer archi...
Auto-tuning has become increasingly popular for optimizing non-functional parameters of parallel pro...
Automatic tuning (auto-tuning) of software has emerged in recent years as a promising method that tr...
For large scale systems, such as data centers, energy efficiency has proven to be key for reducing c...
Auto-tuning has recently received significant attention from the High Performance Computing communi...
This paper describes a new parallel program tuning framework, with a new approach for tuning. The ap...
Automatic performance tuning (auto-tuning) has been used in parallel numerical applications for adap...
The tuning of parallel programs on large distributed-memory machines today is usually a costly, and ...
This paper presents an automated performance tuning solution, which partitions a program into a numb...
There are proposed software tools for automatic generating autotuners – special kind of applications...
In high-performance computing, excellent node-level performance is required for the efficient use of...
The recent transformation from an environment where gains in computational performance came from inc...
Abstract. The increasing complexities of modern architectures require compilers to extensively apply...
Graphics Processing Units (GPUs) have revolutionized the HPC landscape. The first generation of exas...
The contemporary parallel I/O software stack is complex due to a large number of configurations for ...
Over the last several decades we have witnessed tremendous change in the landscape of computer archi...
Auto-tuning has become increasingly popular for optimizing non-functional parameters of parallel pro...
Automatic tuning (auto-tuning) of software has emerged in recent years as a promising method that tr...
For large scale systems, such as data centers, energy efficiency has proven to be key for reducing c...
Auto-tuning has recently received significant attention from the High Performance Computing communi...
This paper describes a new parallel program tuning framework, with a new approach for tuning. The ap...
Automatic performance tuning (auto-tuning) has been used in parallel numerical applications for adap...
The tuning of parallel programs on large distributed-memory machines today is usually a costly, and ...
This paper presents an automated performance tuning solution, which partitions a program into a numb...
There are proposed software tools for automatic generating autotuners – special kind of applications...
In high-performance computing, excellent node-level performance is required for the efficient use of...
The recent transformation from an environment where gains in computational performance came from inc...
Abstract. The increasing complexities of modern architectures require compilers to extensively apply...
Graphics Processing Units (GPUs) have revolutionized the HPC landscape. The first generation of exas...
The contemporary parallel I/O software stack is complex due to a large number of configurations for ...
Over the last several decades we have witnessed tremendous change in the landscape of computer archi...