Dead blocks are handled inefficiently in the multi-level cache hierarchies of many-core architectures because the decision whether a block is dead has to be made locally at each level. This paper introduces runtime-assisted global cache management to quickly deem blocks dead at all levels. The scheme is based on a cooperative hardware/software approach that leverages dynamic information about future region accesses. We show that our proposed runtime-assisted global cache management approach outperforms previously proposed local dead-block management schemes
Efficient cache hierarchy management is of a paramount importance when designing high performance pr...
As the technology continuous to shrink, power consumption appears to be the main design parameter. O...
Hiding memory latency is critical in modern machines. Typically, machines have used cache and addres...
Dead blocks are handled inefficiently in the multi-level cache hierarchies of many-core architecture...
Dead blocks are handled inefficiently in multi-level cache hierarchies because the decision as to wh...
Task-parallel programs inefficiently utilize the cache hierarchy due to the presence of dead blocks ...
Last-level caches bridge the speed gap between processors and the off-chip memory hierarchy and redu...
Last-level caches (LLCs) bridge the processor/memory speed gap and reduce energy consumed per access...
Architects have adopted the shared memory model that implicitly manages cache coherence and cache ca...
Cache memories currently treat all blocks as if they were equally important. This assumption of equa...
Multilevel caching is common in many storage config-urations, introducing new challenges to cache ma...
One of the key requirements to obtaining high performance from chip multiprocessors (CMPs) is to eff...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
[EN] Multi-level buffer cache hierarchies are now commonly seen in most client/server cluster config...
This dissertation analyzes a way to improve cache performance via active management of a target cach...
Efficient cache hierarchy management is of a paramount importance when designing high performance pr...
As the technology continuous to shrink, power consumption appears to be the main design parameter. O...
Hiding memory latency is critical in modern machines. Typically, machines have used cache and addres...
Dead blocks are handled inefficiently in the multi-level cache hierarchies of many-core architecture...
Dead blocks are handled inefficiently in multi-level cache hierarchies because the decision as to wh...
Task-parallel programs inefficiently utilize the cache hierarchy due to the presence of dead blocks ...
Last-level caches bridge the speed gap between processors and the off-chip memory hierarchy and redu...
Last-level caches (LLCs) bridge the processor/memory speed gap and reduce energy consumed per access...
Architects have adopted the shared memory model that implicitly manages cache coherence and cache ca...
Cache memories currently treat all blocks as if they were equally important. This assumption of equa...
Multilevel caching is common in many storage config-urations, introducing new challenges to cache ma...
One of the key requirements to obtaining high performance from chip multiprocessors (CMPs) is to eff...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
[EN] Multi-level buffer cache hierarchies are now commonly seen in most client/server cluster config...
This dissertation analyzes a way to improve cache performance via active management of a target cach...
Efficient cache hierarchy management is of a paramount importance when designing high performance pr...
As the technology continuous to shrink, power consumption appears to be the main design parameter. O...
Hiding memory latency is critical in modern machines. Typically, machines have used cache and addres...