Abstract—An increasing cache latency in next-generation pro-cessors incurs profound performance impacts in spite of advanced out-of-order execution techniques. One way to circumvent this cache latency problem is to predict load values at the onset of pipeline execution by exploiting either the load value locality or the address correlation of stores and loads. In this paper, we describe a new load value speculation mechanism based on the program syntax correlation of stores and loads. We establish a symbolic cache (SC), which is accessed in early pipeline stages to achieve a zero-cycle load. Instead of using memory addresses, the SC is accessed by the encoding bits of base register ID plus the displacement directly from the instruction code...
The execution time of programs that have large working sets is substantially increased by the overhe...
One major restriction to the performance of out-of-order superscalar processors is the latency of lo...
Value specialization is a technique which can improve a program’s performance when its code frequent...
For many programs, especially integer codes, untolerated load instruction latencies account for a si...
With the increasing performance gap between the processor and the memory, the importance of caches i...
An increasing cache latency in future processors incurs profound performance impacts in spite of adv...
In this correspondence, we propose design techniques that may significantly simplify the cache acces...
While runahead execution is effective at parallelizing independent long-latency cache misses, it is ...
. Data speculation refers to the execution of an instruction before some logically preceding instruc...
Untolerated load instruction latencies often have a significant impact on overall program performanc...
While runahead execution is effective at parallelizing independent long-latency cache misses, it is ...
Two orthogonal hardware techniques, table-based address prediction and early address calculation, fo...
The ever-increasing computational power of contemporary microprocessors reduces the execution time s...
Memory operations remain a significant bottleneck in dynamically scheduled pipelined processors, due...
The ever-increasing computational power of contemporary microprocessors reduces the execution time s...
The execution time of programs that have large working sets is substantially increased by the overhe...
One major restriction to the performance of out-of-order superscalar processors is the latency of lo...
Value specialization is a technique which can improve a program’s performance when its code frequent...
For many programs, especially integer codes, untolerated load instruction latencies account for a si...
With the increasing performance gap between the processor and the memory, the importance of caches i...
An increasing cache latency in future processors incurs profound performance impacts in spite of adv...
In this correspondence, we propose design techniques that may significantly simplify the cache acces...
While runahead execution is effective at parallelizing independent long-latency cache misses, it is ...
. Data speculation refers to the execution of an instruction before some logically preceding instruc...
Untolerated load instruction latencies often have a significant impact on overall program performanc...
While runahead execution is effective at parallelizing independent long-latency cache misses, it is ...
Two orthogonal hardware techniques, table-based address prediction and early address calculation, fo...
The ever-increasing computational power of contemporary microprocessors reduces the execution time s...
Memory operations remain a significant bottleneck in dynamically scheduled pipelined processors, due...
The ever-increasing computational power of contemporary microprocessors reduces the execution time s...
The execution time of programs that have large working sets is substantially increased by the overhe...
One major restriction to the performance of out-of-order superscalar processors is the latency of lo...
Value specialization is a technique which can improve a program’s performance when its code frequent...