Instruction fetch bandwidth is feared to be a major limiting factor to the performance of future wide-issue aggressive superscalars. In this paper, we focus on database applications running decision support workloads. We characterize the locality patterns of ia database kernel and find frequently executed paths. Using this information, we propose an algorithm to lay out the basic blocks for improved I-fetch. Our results show a miss reduction of 60-98% for realistic I-cache sizes and a doubling of the number of instructions executed between taken branches. As a consequence, we increase the fetch bandwith provided by an aggressive sequential fetch unit from 5.8 for the original code to 10.6 using our proposed layout. Our software scheme combi...
Commercial applications such as databases and Web servers constitute the most important market segme...
L1 instruction-cache misses pose a critical performance bottleneck in commercial server workloads. C...
Techniques such as out-of-order issue and speculative execution aggressively exploit instruction lev...
Instruction fetch bandwidth is feared to be a major limiting factor to the performance of future wid...
The design of higher performance processors has been following two major trends: increasing the pipe...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
Fetch performance is a very important factor because it effectively limits the overall processor per...
As more and more query processing work can be done in main memory, memory access is becoming a signi...
The design of higher performance processors has been following two major trends: increasing the pipe...
Recent supers calar processors issue four tnstructzons per cycle. These processors are also powered ...
This paper examines the behavior of current and next generation microprocessors’ fetch engines while...
Recent studies highlight that traditional transaction processing systems utilize the micro-architect...
To maximize the performance of a wide-issue superscalar processor, the fetch mechanism must be capab...
The effective performance of wide-issue superscalar processors depends on many parameters, such as b...
In the past, instruction fetch speeds have been improved by using cache schemes that capture the act...
Commercial applications such as databases and Web servers constitute the most important market segme...
L1 instruction-cache misses pose a critical performance bottleneck in commercial server workloads. C...
Techniques such as out-of-order issue and speculative execution aggressively exploit instruction lev...
Instruction fetch bandwidth is feared to be a major limiting factor to the performance of future wid...
The design of higher performance processors has been following two major trends: increasing the pipe...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
Fetch performance is a very important factor because it effectively limits the overall processor per...
As more and more query processing work can be done in main memory, memory access is becoming a signi...
The design of higher performance processors has been following two major trends: increasing the pipe...
Recent supers calar processors issue four tnstructzons per cycle. These processors are also powered ...
This paper examines the behavior of current and next generation microprocessors’ fetch engines while...
Recent studies highlight that traditional transaction processing systems utilize the micro-architect...
To maximize the performance of a wide-issue superscalar processor, the fetch mechanism must be capab...
The effective performance of wide-issue superscalar processors depends on many parameters, such as b...
In the past, instruction fetch speeds have been improved by using cache schemes that capture the act...
Commercial applications such as databases and Web servers constitute the most important market segme...
L1 instruction-cache misses pose a critical performance bottleneck in commercial server workloads. C...
Techniques such as out-of-order issue and speculative execution aggressively exploit instruction lev...