Instruction set extensions (ISEs) improve the performance and energy consumption of application-specific processors. ISEs can use architecturally visible storage (AVS), localized compiler-controlled memories, to provide higher I/O bandwidth than reading data from the processor pipeline. AVS creates coherence and consistence problems with the data cache. Although a hardware coherence protocol could solve the problem, this approach is costly for a single-processor system. As a low-cost alternative, we introduce Virtual Ways, which ensures coherence through a reduced form of inclusion between the data cache and AVS. VirtualWays achieve higher performance and lower energy consumption than using a hardware coherence protocol. © 2014 ACM
Traditionally, cache coherence in multiprocessors has been maintained in hardware. However, the cost...
Present application specific embedded systems tend to choose instruction set extensions (ISEs) based...
Caches have the potential to provide multiprocessors with an automatic mechanism for reducing both n...
Customizable processors augmented with application-specific Instruction Set Extensions (ISEs) have b...
Way Stealing is a simple architectural modification to a cache-based processor that increases the da...
Cache coherence protocols limit the scalability of multicore and manycore architectures and are resp...
This paper introduces Way Stealing, a simple architectural modification to a cache-based processor t...
: Virtual memory based cache coherence is a mechanism that relies only on hardware that already exi...
With the advancement of design and fabrication of high-performance integrated circuits technology, i...
Memory is hardware that is used by computer to load the operating system and run programs. It is bui...
Modern out-of-order processor architectures focus significantly on the high performance execution of...
We present ALFRED: a virtual memory abstraction that resolves the dichotomy between volatile and non...
Virtual memory is a classic computer science abstraction and is ubiquitous in all scales of computin...
Directory-based cache coherence is the de-facto standard for scalable shared-memory multi/many-cores...
Virtual memory offers a simple hardware abstraction to programmers freeing them from the tedious pro...
Traditionally, cache coherence in multiprocessors has been maintained in hardware. However, the cost...
Present application specific embedded systems tend to choose instruction set extensions (ISEs) based...
Caches have the potential to provide multiprocessors with an automatic mechanism for reducing both n...
Customizable processors augmented with application-specific Instruction Set Extensions (ISEs) have b...
Way Stealing is a simple architectural modification to a cache-based processor that increases the da...
Cache coherence protocols limit the scalability of multicore and manycore architectures and are resp...
This paper introduces Way Stealing, a simple architectural modification to a cache-based processor t...
: Virtual memory based cache coherence is a mechanism that relies only on hardware that already exi...
With the advancement of design and fabrication of high-performance integrated circuits technology, i...
Memory is hardware that is used by computer to load the operating system and run programs. It is bui...
Modern out-of-order processor architectures focus significantly on the high performance execution of...
We present ALFRED: a virtual memory abstraction that resolves the dichotomy between volatile and non...
Virtual memory is a classic computer science abstraction and is ubiquitous in all scales of computin...
Directory-based cache coherence is the de-facto standard for scalable shared-memory multi/many-cores...
Virtual memory offers a simple hardware abstraction to programmers freeing them from the tedious pro...
Traditionally, cache coherence in multiprocessors has been maintained in hardware. However, the cost...
Present application specific embedded systems tend to choose instruction set extensions (ISEs) based...
Caches have the potential to provide multiprocessors with an automatic mechanism for reducing both n...