In the past, instruction fetch speeds have been improved by using cache schemes that capture the actual program flow. In this paper, we elaborate on the architecture and operation of an instruction cache named Variable-Sized Block Cache (VSBC) that also makes use of the dynamic behavior of a program. Current trace-based cache schemes usually have some instructions stored repeatedly; this redundancy is eliminated in VSBC. Our cache also allows storage of basic blocks of arbitrary sizes, in multiple-way cache structure. An overall comparison of trace miss rate and average trace length shows VSBC to be a better performing cache scheme than TC, using SPECint2000 integer benchmarks.Facultad de Informátic
The design of higher performance processors has been following two major trends: increasing the pipe...
Future processors combining out-of-order execution with aggressive speculation techniques will need ...
The Software Trace Cache is a compiler transformation, or a postcompilation binary optimization, tha...
In the past, instruction fetch speeds have been improved by using cache schemes that capture the act...
In the past, instruction fetch speeds have been improved by using cache schemes that capture the act...
In the past, instruction fetch speeds have been improved by using cache schemes that capture the act...
In the past, instruction fetch speeds have been improved by using cache schemes that capture the act...
To maximize the performance of a wide-issue superscalar processor, the fetch mechanism must be capab...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
The trace cache is a recently proposed solution to achieving high instruction fetch bandwidth by buf...
As the instruction issue width of superscalar proces-sors increases, instruction fetch bandwidth req...
In order to meet the demands of wider issue processors, fetch mechanisms will need to fetch multiple...
As the issue width of superscalar processors is increased, instruction fetch bandwidth requirements ...
Fetch performance is a very important factor because it effectively limits the overall processor per...
The design of higher performance processors has been following two major trends: increasing the pipe...
The design of higher performance processors has been following two major trends: increasing the pipe...
Future processors combining out-of-order execution with aggressive speculation techniques will need ...
The Software Trace Cache is a compiler transformation, or a postcompilation binary optimization, tha...
In the past, instruction fetch speeds have been improved by using cache schemes that capture the act...
In the past, instruction fetch speeds have been improved by using cache schemes that capture the act...
In the past, instruction fetch speeds have been improved by using cache schemes that capture the act...
In the past, instruction fetch speeds have been improved by using cache schemes that capture the act...
To maximize the performance of a wide-issue superscalar processor, the fetch mechanism must be capab...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
The trace cache is a recently proposed solution to achieving high instruction fetch bandwidth by buf...
As the instruction issue width of superscalar proces-sors increases, instruction fetch bandwidth req...
In order to meet the demands of wider issue processors, fetch mechanisms will need to fetch multiple...
As the issue width of superscalar processors is increased, instruction fetch bandwidth requirements ...
Fetch performance is a very important factor because it effectively limits the overall processor per...
The design of higher performance processors has been following two major trends: increasing the pipe...
The design of higher performance processors has been following two major trends: increasing the pipe...
Future processors combining out-of-order execution with aggressive speculation techniques will need ...
The Software Trace Cache is a compiler transformation, or a postcompilation binary optimization, tha...