International audienceToday's multi-core era places significant demands on an optimizing compiler, which must parallelize programs, exploit memory hierarchy, and leverage the ever-increasing SIMD capabilities of modern processors. Existing model-based heuristics for performance optimization used in compilers are limited in their ability to identify profitable parallelism/locality trade-offs and usually lead to sub-optimal performance. To address this problem, we distinguish optimizations for which effective model-based heuristics and profitability estimates exist, from optimizations that require empirical search to achieve good performance in a portable fashion. We have developed a completely automatic framework in which we focus the empiri...
University of Minnesota Ph.D. dissertation. September 2014. Major: Computer Science. Advisor: Pen-Ch...
International audiencePortability of performance has been the underlying assumption for the vast maj...
International audienceScientific computation requires more and more performance in its algorithms. N...
International audienceToday's multi-core era places significant demands on an optimizing compiler, w...
Today's multi-core era places significant demands on an optimizing compiler, which must parallelize ...
On modern architectures, a missed optimization can translate into performance degradations reaching ...
International audienceHigh-level program optimizations, such as loop transformations, are critical f...
The goal of this dissertation is to give programmers the ability to achieve high performance by focu...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
In high-performance computing, one primary objective is to exploit the performance that the given ta...
UnrestrictedWe are facing an increasing performance gap between processor and memory speed on today'...
International audienceThe quality of compiler-optimized code for high-performance applications lags ...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
Although code optimizations have been applied by compilers for over 40 years, much of the research h...
Massively parallel and heterogeneous systems together with their APIs have been used for various app...
University of Minnesota Ph.D. dissertation. September 2014. Major: Computer Science. Advisor: Pen-Ch...
International audiencePortability of performance has been the underlying assumption for the vast maj...
International audienceScientific computation requires more and more performance in its algorithms. N...
International audienceToday's multi-core era places significant demands on an optimizing compiler, w...
Today's multi-core era places significant demands on an optimizing compiler, which must parallelize ...
On modern architectures, a missed optimization can translate into performance degradations reaching ...
International audienceHigh-level program optimizations, such as loop transformations, are critical f...
The goal of this dissertation is to give programmers the ability to achieve high performance by focu...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
In high-performance computing, one primary objective is to exploit the performance that the given ta...
UnrestrictedWe are facing an increasing performance gap between processor and memory speed on today'...
International audienceThe quality of compiler-optimized code for high-performance applications lags ...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
Although code optimizations have been applied by compilers for over 40 years, much of the research h...
Massively parallel and heterogeneous systems together with their APIs have been used for various app...
University of Minnesota Ph.D. dissertation. September 2014. Major: Computer Science. Advisor: Pen-Ch...
International audiencePortability of performance has been the underlying assumption for the vast maj...
International audienceScientific computation requires more and more performance in its algorithms. N...