Latency hiding techniques such as multilevel cache hierarchies yield high performance when applications map well onto hierarchy implementations, but performance can suffer drastically when they do not. Identifying and reduc-ing mismatches between an application and the memory hierarchy is dificult without insight into the actual behav-ior of the hardware implementation. We advocate the use of hardware event counters, as a cheap, effective and practi-cal way to tune applications for a given hardware plat-form. We take a case study approach, focussing on the counters available on the SPARCcenter 2000, a 20 proces-sor, shared-bus based multiprocessor We describe the tools we built to relate hardware event counts to user applica-tions and give ...
Hardware performance counters are CPU registers that count data loads and stores, cache misses, and ...
Journal PaperCurrent microprocessors incorporate techniques to aggressively exploit instruction-leve...
When creating architectural tools, it is essential to know whether the generated results make sense....
Modern processors incorporate several performance monitoring units, which can be used to count event...
We introduce the usage of hardware performance counters (HPCs) as a new method that allows very prec...
International audienceHardware performance monitoring counters have recently received a lot of atten...
In this paper, the authors characterize application performance with a memory-centric view. Using a ...
As useful as performance counters are, the meaning of reported aggregate event counts is sometimes q...
Applications may have unintended performance problems in spite of compiler optimizations, because of...
A common way of representing processor performance is to use Cycles per Instruction (CPI) `stacks' w...
Cycles per Instruction (CPI) stacks break down processor execution time into a baseline CPI plus a n...
Memory contention is one of the largest sources of inter-core interference in statically partitioned...
One of the major architectural design considerations for any computer system is that of the memory s...
Over the past several de ades, mi ropro essors have evolved to assist system software in implementin...
As the performance gap between the processor cores and the memory subsystem increases, designers are...
Hardware performance counters are CPU registers that count data loads and stores, cache misses, and ...
Journal PaperCurrent microprocessors incorporate techniques to aggressively exploit instruction-leve...
When creating architectural tools, it is essential to know whether the generated results make sense....
Modern processors incorporate several performance monitoring units, which can be used to count event...
We introduce the usage of hardware performance counters (HPCs) as a new method that allows very prec...
International audienceHardware performance monitoring counters have recently received a lot of atten...
In this paper, the authors characterize application performance with a memory-centric view. Using a ...
As useful as performance counters are, the meaning of reported aggregate event counts is sometimes q...
Applications may have unintended performance problems in spite of compiler optimizations, because of...
A common way of representing processor performance is to use Cycles per Instruction (CPI) `stacks' w...
Cycles per Instruction (CPI) stacks break down processor execution time into a baseline CPI plus a n...
Memory contention is one of the largest sources of inter-core interference in statically partitioned...
One of the major architectural design considerations for any computer system is that of the memory s...
Over the past several de ades, mi ropro essors have evolved to assist system software in implementin...
As the performance gap between the processor cores and the memory subsystem increases, designers are...
Hardware performance counters are CPU registers that count data loads and stores, cache misses, and ...
Journal PaperCurrent microprocessors incorporate techniques to aggressively exploit instruction-leve...
When creating architectural tools, it is essential to know whether the generated results make sense....