The increasing demand for high performance forces computer architects to employ a plethora of hardware optimizations. These optimizations and new architecture features are often examined with past generation compilers that do not take into account the new architecture optimizations. The use of a complier that is unaware of a set of architecture optimizations may lead to an incorrect estimation of the impact of these new optimizations. This problem is exacerbated for in-order architectures which rely on the complier to assist in scheduling. Our research focuses on efficient techniques for generating a highly optimized and scheduled binary for VLIW and in-order architectures. We propose to use a modified out-of-order simulator to generate ...
Dynamically optimizing programs is worthwhile only if the overhead created by the dynamic optimizer ...
VLIW processors are statically scheduled processors and their performance depends on the quality of ...
The Software Trace Cache is a compiler transformation, or a postcompilation binary optimization, tha...
The increasing demand for high performance forces computer architects to employ a plethora of hardwa...
In trace processors, a sequential program is partitioned at run time into "traces." A tra...
Trace cache, an instruction fetch technique that reduces taken branch penalties by storing and fetch...
Wide-issue processors continue to achieve higher performance by exploiting greater instruction-level...
International audienceTo meet the high demand for powerful embedded processors, VLIW architectures a...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
This work examines the interaction of compiler scheduling techniques with processor features such as...
instruction-level parallelism, compilers, VLIW, superscalar, code generation Trace Scheduling-2 is a...
Developing an optimizing compiler for a newly proposed architecture is ex-tremely difficult when the...
VLIW processors are statically scheduled processors and their performance depends on the quality of ...
Developing an optimizing compiler for a newly proposed architecture is extremely difficult when ther...
Abstract Profile-based optimizations can be used for instruction scheduling, loop scheduling, data p...
Dynamically optimizing programs is worthwhile only if the overhead created by the dynamic optimizer ...
VLIW processors are statically scheduled processors and their performance depends on the quality of ...
The Software Trace Cache is a compiler transformation, or a postcompilation binary optimization, tha...
The increasing demand for high performance forces computer architects to employ a plethora of hardwa...
In trace processors, a sequential program is partitioned at run time into "traces." A tra...
Trace cache, an instruction fetch technique that reduces taken branch penalties by storing and fetch...
Wide-issue processors continue to achieve higher performance by exploiting greater instruction-level...
International audienceTo meet the high demand for powerful embedded processors, VLIW architectures a...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
This work examines the interaction of compiler scheduling techniques with processor features such as...
instruction-level parallelism, compilers, VLIW, superscalar, code generation Trace Scheduling-2 is a...
Developing an optimizing compiler for a newly proposed architecture is ex-tremely difficult when the...
VLIW processors are statically scheduled processors and their performance depends on the quality of ...
Developing an optimizing compiler for a newly proposed architecture is extremely difficult when ther...
Abstract Profile-based optimizations can be used for instruction scheduling, loop scheduling, data p...
Dynamically optimizing programs is worthwhile only if the overhead created by the dynamic optimizer ...
VLIW processors are statically scheduled processors and their performance depends on the quality of ...
The Software Trace Cache is a compiler transformation, or a postcompilation binary optimization, tha...