Memory contention is one of the largest sources of inter-core interference in statically partitioned multicore systems, and the contention reduces the overall performance of applications and causes unpredictable execution-times. A first step in achieving predictable execution is to accurately measure the amount of consumed memory bandwidth for each application. Such measurements can be used to track down bottlenecks, provide better partitioning among cores, and ultimately be used to arbitrate and police access to the memory bus. We propose to use hardware performance counters to continuously track the memory-bandwidth consumed by different applications executing in parallel. In this paper we describe ongoing efforts exploring suitable perfo...
Abstract—We present a study on estimating the dynamic power consumption of a processor based on perf...
Application-level monitoring is required for continuously operating software systems to maintain the...
We present a low-overhead method for accurately measuring application performance (CPI) and off-chip...
Memory contention is one of the largest sources of inter-core interference in statically partitioned...
On multicore processors, co-executing applications compete for shared resources, such as cache capac...
Modern processors incorporate several performance monitoring units, which can be used to count event...
International audienceMemory interferences may introduce important slowdowns in applications running...
As the speed gap widens between CPU and memory, memory hierarchy performance has become the bottlene...
To reduce latency and increase bandwidth to memory, modern microprocessors are designed with deep me...
International audienceThe aim of this paper is to present a high precision and event-versatile MBPTA...
Hierarchical memory is a cornerstone of modern hardware design because it provides high memory perfo...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
We introduce the usage of hardware performance counters (HPCs) as a new method that allows very prec...
Multiprocessors are often quoted as being capable of a ‘peak performance,’ but in practise it is dif...
Abstract—We present a study on estimating the dynamic power consumption of a processor based on perf...
Application-level monitoring is required for continuously operating software systems to maintain the...
We present a low-overhead method for accurately measuring application performance (CPI) and off-chip...
Memory contention is one of the largest sources of inter-core interference in statically partitioned...
On multicore processors, co-executing applications compete for shared resources, such as cache capac...
Modern processors incorporate several performance monitoring units, which can be used to count event...
International audienceMemory interferences may introduce important slowdowns in applications running...
As the speed gap widens between CPU and memory, memory hierarchy performance has become the bottlene...
To reduce latency and increase bandwidth to memory, modern microprocessors are designed with deep me...
International audienceThe aim of this paper is to present a high precision and event-versatile MBPTA...
Hierarchical memory is a cornerstone of modern hardware design because it provides high memory perfo...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
We introduce the usage of hardware performance counters (HPCs) as a new method that allows very prec...
Multiprocessors are often quoted as being capable of a ‘peak performance,’ but in practise it is dif...
Abstract—We present a study on estimating the dynamic power consumption of a processor based on perf...
Application-level monitoring is required for continuously operating software systems to maintain the...
We present a low-overhead method for accurately measuring application performance (CPI) and off-chip...