There exists a divide between the ever-increasing demand for high-performance embedded systems and the availability of practical methodologies to understand the interplay of complex data-intensive applications with hardware memory resources. On the one hand, traditional static analysis approaches are seldomly applicable to latest-generation multi-core platforms due to a lack of accurate micro-architectural models. On the other hand, measurement-based methods only provide coarse-grained information about the end-to-end execution of a given real-time application. In this paper, we describe a novel methodology, namely Black-Box Profiling (BBProf), to gather fine-grained insights on the usage of cache resources in applications of realistic comp...
The vast majority of high-performance embedded systems implement multi-level CPU cache hierarchies. ...
Modern computing systems are constructed using commodity multi-core processors, on which part of the...
The proliferation of multi-core, accelerator-enabled embedded systems has introduced new opportuniti...
There exists a divide between the ever-increasing demand for high-performance embedded systems and t...
© ACM, 2015. This is the author's version of the work. It is posted here by permission of ACM for yo...
Contention for shared cache resources has been recognized as a major bottleneck for multicores—espec...
To reduce latency and increase bandwidth to memory, modern microprocessors are often designed with d...
Our thesis is that operating systems should manage the on-chip shared caches of multicore processors...
Architects have adopted the shared memory model that implicitly manages cache coherence and cache ca...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
There is a constant need to improve processor performance on any system. It is vital to be able to v...
The ever-increasing gap between processor and memory speed is an issue also in embedded systems, bec...
AbstractApplication analysis is facilitated through a number of program profiling tools. The tools v...
International audienceThe increasing computation capability of servers comes with a dramatic increas...
The vast majority of high-performance embedded systems implement multi-level CPU cache hierarchies. ...
Modern computing systems are constructed using commodity multi-core processors, on which part of the...
The proliferation of multi-core, accelerator-enabled embedded systems has introduced new opportuniti...
There exists a divide between the ever-increasing demand for high-performance embedded systems and t...
© ACM, 2015. This is the author's version of the work. It is posted here by permission of ACM for yo...
Contention for shared cache resources has been recognized as a major bottleneck for multicores—espec...
To reduce latency and increase bandwidth to memory, modern microprocessors are often designed with d...
Our thesis is that operating systems should manage the on-chip shared caches of multicore processors...
Architects have adopted the shared memory model that implicitly manages cache coherence and cache ca...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
There is a constant need to improve processor performance on any system. It is vital to be able to v...
The ever-increasing gap between processor and memory speed is an issue also in embedded systems, bec...
AbstractApplication analysis is facilitated through a number of program profiling tools. The tools v...
International audienceThe increasing computation capability of servers comes with a dramatic increas...
The vast majority of high-performance embedded systems implement multi-level CPU cache hierarchies. ...
Modern computing systems are constructed using commodity multi-core processors, on which part of the...
The proliferation of multi-core, accelerator-enabled embedded systems has introduced new opportuniti...