Step caches are caches in which data entered to an cache array is kept valid only until the end of ongoing step of execution. Together with an advanced pipelined multithreaded architecture they can be used to implement concurrent read concurrent write (CRCW) memory access in shared memory multiprocessor systems on chip (MP-SOC) without cache coherency problems. Unfortunately obvious step cache architectures assume full associativity, which can become expensive since the size and thus associativity of caches equal the number of threads per processor being at least the square root of the number of processors. In this paper, we describe a technique to radically reduce the associativity and even size of step caches in CRCW operation. We give a ...
We introduce a new organization for multi-bank caches: the skewed-associative cache. A two-way skewe...
In this paper, we study a hardware-supported, compiler-directed (HSCD) cache coherence scheme, which...
In this paper, we study a hardware-supported, compilerdirected (HSCD) cache coherence scheme, which ...
In this paper we introduce a novel class of caches, named step caches, that can be used to implement...
Recent advances in shared memory multiprocessor system-on-a-chip (MP-SOC) architectures include usin...
A new cache memory organization called “Shared-Way Set Associative” (SWSA) is described in this pape...
Due to inherent non-scalability of superscalar processors, processor manufacturers have switched to ...
Recent research shows that the occupancy of the coherence controllers is a major performance bottlen...
The effects of various cache coherence strategies are analyzed for a multiported shared memory multi...
Microprocessor industry has converged on chip multiprocessor (CMP) as the architecture of choice to ...
To reduce the average time needed to perform a read or a write access in a multiprocessor, a cache i...
This paper describes Constrained Associative-Mapping-of-Tracking-Entries (C-AMTE), a scalable mechan...
We introduce a new organization for multi-bank caches: the skewed-associative cache. A two-way skewe...
On the road to computer systems able to support the requirements of exascale applications, Chip Mult...
Due to VLSI lithography problems and the limitation of additional architectural enhancements uniproc...
We introduce a new organization for multi-bank caches: the skewed-associative cache. A two-way skewe...
In this paper, we study a hardware-supported, compiler-directed (HSCD) cache coherence scheme, which...
In this paper, we study a hardware-supported, compilerdirected (HSCD) cache coherence scheme, which ...
In this paper we introduce a novel class of caches, named step caches, that can be used to implement...
Recent advances in shared memory multiprocessor system-on-a-chip (MP-SOC) architectures include usin...
A new cache memory organization called “Shared-Way Set Associative” (SWSA) is described in this pape...
Due to inherent non-scalability of superscalar processors, processor manufacturers have switched to ...
Recent research shows that the occupancy of the coherence controllers is a major performance bottlen...
The effects of various cache coherence strategies are analyzed for a multiported shared memory multi...
Microprocessor industry has converged on chip multiprocessor (CMP) as the architecture of choice to ...
To reduce the average time needed to perform a read or a write access in a multiprocessor, a cache i...
This paper describes Constrained Associative-Mapping-of-Tracking-Entries (C-AMTE), a scalable mechan...
We introduce a new organization for multi-bank caches: the skewed-associative cache. A two-way skewe...
On the road to computer systems able to support the requirements of exascale applications, Chip Mult...
Due to VLSI lithography problems and the limitation of additional architectural enhancements uniproc...
We introduce a new organization for multi-bank caches: the skewed-associative cache. A two-way skewe...
In this paper, we study a hardware-supported, compiler-directed (HSCD) cache coherence scheme, which...
In this paper, we study a hardware-supported, compilerdirected (HSCD) cache coherence scheme, which ...