We study the issue of performance prediction on the SGI-Power Challenge, a typical SMP. On such a platform, the cost of memory accesses depends on their locality and on contention among processors. By running a carefully designed suite of microbenchmarks, we provide quantitative evidence that memory hierarchy effects impact performance far more substantially than other phenomena related to contention. We also fit three cost functions based on variants of the BSP model, which do not account for the hierarchy, and a newly defined function F expressed in terms of hardware counters, which captures both memory hierarchy and contention effects. We test the accuracy of all the functions on both synthetic and application benchmarks showing that, un...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Application performance on modern microprocessors depends heavily on performance related characteris...
Benchmarking high performance computing systems is crucial to optimize memory consumption and maximi...
We study the issue of performance prediction on the SGI-Power Challenge, a typical SMP. On such a pl...
Modern processors incorporate several performance monitoring units, which can be used to count event...
In this paper, the authors characterize application performance with a memory-centric view. Using a ...
International audienceMemory interferences may introduce important slowdowns in applications running...
Abstract—We present a study on estimating the dynamic power consumption of a processor based on perf...
As computers with tens of thousands of processors successfully deliver high performance power for so...
Hierarchical memory is a cornerstone of modern hardware design because it provides high memory perfo...
As the speed gap widens between CPU and memory, memory hierarchy performance has become the bottlene...
Many situations call for an estimation of the execution time of applications, e.g., during design or...
While computers with tens of thousands of processors have successfully delivered high performance po...
Hierarchical memory is a cornerstone of modern hardware design because it provides high memory perfo...
Historically, energy constrained devices (ECDs) have favored simple in-order pipelines over out-of-o...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Application performance on modern microprocessors depends heavily on performance related characteris...
Benchmarking high performance computing systems is crucial to optimize memory consumption and maximi...
We study the issue of performance prediction on the SGI-Power Challenge, a typical SMP. On such a pl...
Modern processors incorporate several performance monitoring units, which can be used to count event...
In this paper, the authors characterize application performance with a memory-centric view. Using a ...
International audienceMemory interferences may introduce important slowdowns in applications running...
Abstract—We present a study on estimating the dynamic power consumption of a processor based on perf...
As computers with tens of thousands of processors successfully deliver high performance power for so...
Hierarchical memory is a cornerstone of modern hardware design because it provides high memory perfo...
As the speed gap widens between CPU and memory, memory hierarchy performance has become the bottlene...
Many situations call for an estimation of the execution time of applications, e.g., during design or...
While computers with tens of thousands of processors have successfully delivered high performance po...
Hierarchical memory is a cornerstone of modern hardware design because it provides high memory perfo...
Historically, energy constrained devices (ECDs) have favored simple in-order pipelines over out-of-o...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Application performance on modern microprocessors depends heavily on performance related characteris...
Benchmarking high performance computing systems is crucial to optimize memory consumption and maximi...