This paper proposes a novel L1 data cache design with dual-versioning SRAM cells (dvSRAM) for chip multi-processors (CMP) that implement optimistic concurrency proposals. In this new cache architecture, each dvSRAM cell has two cells, a main cell and a secondary cell, which keep two versions of the same data. These values can be accessed, modified, moved back and forth between the main and secondary cells within the access time of the cache. We design and simulate a 32-KB dual-versioning L1 data cache with 45nm CMOS technology at 2GHz processor frequency and 1V supply voltage, which we describe in detail. We also introduce three well-known use cases that make use of optimistic concurrency execution and that can benefit from our proposed des...
SRAM and DRAM cells have been the predominant technologies used to implement memory cells in compute...
Weak memory consistency models can maximize system performance by enabling hardware and compiler opt...
A shared-L1 cache architecture is proposed for tightly coupled processor clusters. Sharing an L1 tig...
This paper proposes a novel L1 data cache design with dual-versioning SRAM cells (dvSRAM) for chip m...
This paper proposes a novel adaptable and reliable L1 data cache design (Adapcache) with the unique ...
(c) 2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for...
Cache memories have been usually implemented with Static Random-Access Memory (SRAM) technology sin...
Caches are known to consume a large part of total microprocessor power. Traditionally, voltage scali...
Computing technology has witnessed an inimitable progress in the last decades which is the result of...
High-performance processors use a large set–associative L1 data cache with multiple ports. As clock ...
High Energy efficiency and high performance are the key regiments for Internet of Things (IoT) edge ...
With continued technology scaling, process variations will be especially detrimental to six-transist...
Caches are known to consume a large part of total microprocessor energy. Traditionally, voltage scal...
“©2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
Die-stacking is a new technology that allows multiple integrated circuits to be stacked on top of ea...
SRAM and DRAM cells have been the predominant technologies used to implement memory cells in compute...
Weak memory consistency models can maximize system performance by enabling hardware and compiler opt...
A shared-L1 cache architecture is proposed for tightly coupled processor clusters. Sharing an L1 tig...
This paper proposes a novel L1 data cache design with dual-versioning SRAM cells (dvSRAM) for chip m...
This paper proposes a novel adaptable and reliable L1 data cache design (Adapcache) with the unique ...
(c) 2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for...
Cache memories have been usually implemented with Static Random-Access Memory (SRAM) technology sin...
Caches are known to consume a large part of total microprocessor power. Traditionally, voltage scali...
Computing technology has witnessed an inimitable progress in the last decades which is the result of...
High-performance processors use a large set–associative L1 data cache with multiple ports. As clock ...
High Energy efficiency and high performance are the key regiments for Internet of Things (IoT) edge ...
With continued technology scaling, process variations will be especially detrimental to six-transist...
Caches are known to consume a large part of total microprocessor energy. Traditionally, voltage scal...
“©2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
Die-stacking is a new technology that allows multiple integrated circuits to be stacked on top of ea...
SRAM and DRAM cells have been the predominant technologies used to implement memory cells in compute...
Weak memory consistency models can maximize system performance by enabling hardware and compiler opt...
A shared-L1 cache architecture is proposed for tightly coupled processor clusters. Sharing an L1 tig...