The tuning of parallel programs on large distributed-memory machines today is usually a costly, and often extensive, manual process. Automatic tuning techniques can help reduce this manual burden. This dissertation investigates the utility of a new class of automatic tuning methods for large-scale parallel programs whereby each program exposes information about its behavior to the runtime system. This behavioral information enables a tuning framework to quickly find appropriate ways to reconfigure or steer the application towards better performance. This dissertation describes both new automatic tuning mechanisms within a parallel runtime system, and a new framework that automatically reconfigures the behavior or structure of the program...
Abstract — Performance of parallel programs is one of the reasons of their development. The process ...
Compared to traditional compile-time optimization, run-time optimization could offer signi@...
Over the past 10 years we have seen the transition from single core computer to multicore computing,...
The tuning of parallel programs on large distributed-memory machines today is usually a costly, and ...
This paper describes a new parallel program tuning framework, with a new approach for tuning. The ap...
Auto-tuning has recently received significant attention from the High Performance Computing communi...
The thesis addresses the problem of automatic and dynamic tuning of parallel and distributed applica...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
Performance of parallel programs is one of the reasons of their development. The process of designin...
Big data processing systems (e.g., Hadoop, Spark, Storm) contain a vast number of configuration para...
. The evident difficulties in creating efficient parallel programs have encouraged the development o...
Parallel programming has always been difficult due to the complexity of hardware and the diversity o...
AbstractWe present a dynamic method for tuning algorithmic parameters of parallel scientific program...
Improving program performance through the use of multiple homogeneous processing elements, or cores...
Parallel and distributed programming constitutes a highly promising approach to improving the perfor...
Abstract — Performance of parallel programs is one of the reasons of their development. The process ...
Compared to traditional compile-time optimization, run-time optimization could offer signi@...
Over the past 10 years we have seen the transition from single core computer to multicore computing,...
The tuning of parallel programs on large distributed-memory machines today is usually a costly, and ...
This paper describes a new parallel program tuning framework, with a new approach for tuning. The ap...
Auto-tuning has recently received significant attention from the High Performance Computing communi...
The thesis addresses the problem of automatic and dynamic tuning of parallel and distributed applica...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
Performance of parallel programs is one of the reasons of their development. The process of designin...
Big data processing systems (e.g., Hadoop, Spark, Storm) contain a vast number of configuration para...
. The evident difficulties in creating efficient parallel programs have encouraged the development o...
Parallel programming has always been difficult due to the complexity of hardware and the diversity o...
AbstractWe present a dynamic method for tuning algorithmic parameters of parallel scientific program...
Improving program performance through the use of multiple homogeneous processing elements, or cores...
Parallel and distributed programming constitutes a highly promising approach to improving the perfor...
Abstract — Performance of parallel programs is one of the reasons of their development. The process ...
Compared to traditional compile-time optimization, run-time optimization could offer signi@...
Over the past 10 years we have seen the transition from single core computer to multicore computing,...