Profile-based optimizations can be used for instruction scheduling, loop scheduling, data preloading, function in-lining, and instruction cache performance enhancement. However, these techniques have not been embraced by software vendors because programs instrumented for pro ling run significantly slower, an awkward compile-run-recompile sequence is required, and a test input suite must be collected and validated for each program. This paper introduces hardware-based profiling that uses traditional branch handling hardware to generate profile information in real time. Techniques are presented for both one-level and two-level branch hardware organizations. The approach produces high accuracy with small slowdown in execution (0.4%-4.6%). This...
This paper describes the DIGITAL Continuous Profiling Infrastructure, a sampling-based profiling sys...
Reconfigurable systems map the computational intensive parts of the code in hardware while less comp...
This paper investigates a possible architecture to a dynamically adaptable processor. In this archit...
Abstract Profile-based optimizations can be used for instruction scheduling, loop scheduling, data p...
Runtime profile gives considerable information that can be reused, to optimize the executable for fa...
Pro le-based optimizations can be used for instruction scheduling, loop scheduling, data preloading,...
For advanced profile-guided optimizations to be e ective in online environments, fine-grained and ac...
Application profiling is an important step in the design and optimization of embedded systems. Accur...
For aggressive path-based program optimizations to be profitable in cost-sensitive environments, acc...
For aggressive path-based program optimizations to be profitable in cost-sensitive environments, acc...
Traditional compilers rely on static information about programs to perform optimizations. While such...
For aggressive path-based optimizations to be profitable in cost-senstive environments, accurate pat...
Microsoft ResearchAlthough runtime systems and the dynamic compilation model have revolutionized the...
For aggressive path-based optimizations to be profitable in cost-senstive environments, accurate pat...
A program profile attributes run-time costs to portions of a program's execution. Most profiling sys...
This paper describes the DIGITAL Continuous Profiling Infrastructure, a sampling-based profiling sys...
Reconfigurable systems map the computational intensive parts of the code in hardware while less comp...
This paper investigates a possible architecture to a dynamically adaptable processor. In this archit...
Abstract Profile-based optimizations can be used for instruction scheduling, loop scheduling, data p...
Runtime profile gives considerable information that can be reused, to optimize the executable for fa...
Pro le-based optimizations can be used for instruction scheduling, loop scheduling, data preloading,...
For advanced profile-guided optimizations to be e ective in online environments, fine-grained and ac...
Application profiling is an important step in the design and optimization of embedded systems. Accur...
For aggressive path-based program optimizations to be profitable in cost-sensitive environments, acc...
For aggressive path-based program optimizations to be profitable in cost-sensitive environments, acc...
Traditional compilers rely on static information about programs to perform optimizations. While such...
For aggressive path-based optimizations to be profitable in cost-senstive environments, accurate pat...
Microsoft ResearchAlthough runtime systems and the dynamic compilation model have revolutionized the...
For aggressive path-based optimizations to be profitable in cost-senstive environments, accurate pat...
A program profile attributes run-time costs to portions of a program's execution. Most profiling sys...
This paper describes the DIGITAL Continuous Profiling Infrastructure, a sampling-based profiling sys...
Reconfigurable systems map the computational intensive parts of the code in hardware while less comp...
This paper investigates a possible architecture to a dynamically adaptable processor. In this archit...