Achieving high performance in task-parallel runtime systems, especially with high degrees of parallelism and fine-grained tasks, requires tuning a large variety of behavioral parameters according to program characteristics. In the current state of the art, this tuning is generally performed in one of two ways: either by a group of experts who derive a single setup which achieves good -- but not optimal -- performance across a wide variety of use cases, or by monitoring a system's behavior at runtime and responding to it. The former approach invariably fails to achieve optimal performance for programs with highly distinct execution patterns, while the latter induces some overhead and cannot affect parameters which need to be fixed at compile...
In this paper we analyze the effect of compiler optimizations on fine grain parallelism in scalar pr...
Making computer systems more energy efficient while obtaining the maximum performance possible is ke...
Production compilers have achieved a high level of maturity in terms of generating efficient code. C...
Achieving high performance in task-parallel runtime systems, especially with high degrees of paralle...
Developing programs that fully utilize the available computing capabilities of the underlying hardwa...
Parallelizing compiler technology has improved in re-cent years. One area in which compilers have ma...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
The goal of parallelizing, or restructuring, compilers is to detect and exploit parallelism in seque...
For a wide variety of applications, both task and data parallelism must be exploited to achieve the ...
Runtime compilation, due to its online nature, presents unique challenges and opportunities to compi...
As systems become more complex, there are increasing demands for improvement with respect to attribu...
Maximizing the performance of computer systems while making them more energy efficient is vital for ...
Fully utilizing the potential of parallel architectures is known to be a challenging task. In the pa...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
Current static parallel optimization techniques rarely try to account for either code block run-time...
In this paper we analyze the effect of compiler optimizations on fine grain parallelism in scalar pr...
Making computer systems more energy efficient while obtaining the maximum performance possible is ke...
Production compilers have achieved a high level of maturity in terms of generating efficient code. C...
Achieving high performance in task-parallel runtime systems, especially with high degrees of paralle...
Developing programs that fully utilize the available computing capabilities of the underlying hardwa...
Parallelizing compiler technology has improved in re-cent years. One area in which compilers have ma...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
The goal of parallelizing, or restructuring, compilers is to detect and exploit parallelism in seque...
For a wide variety of applications, both task and data parallelism must be exploited to achieve the ...
Runtime compilation, due to its online nature, presents unique challenges and opportunities to compi...
As systems become more complex, there are increasing demands for improvement with respect to attribu...
Maximizing the performance of computer systems while making them more energy efficient is vital for ...
Fully utilizing the potential of parallel architectures is known to be a challenging task. In the pa...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
Current static parallel optimization techniques rarely try to account for either code block run-time...
In this paper we analyze the effect of compiler optimizations on fine grain parallelism in scalar pr...
Making computer systems more energy efficient while obtaining the maximum performance possible is ke...
Production compilers have achieved a high level of maturity in terms of generating efficient code. C...