Abstract—Stacked memory modules are likely to be tightly integrated with the processor. It is vital that these modules operate reliably, where failure can require replacement of the entire socket. To make matters worse, stacked memory designs are susceptible to new failure modes (for example, due to faulty through-silicon vias, or TSVs) that can cause large portions of memory, such as a bank, to become faulty. To avoid data loss from large-granularity failures, the memory system may use symbol-based codes that stripe the data for a cache line across several banks (or channels). Unfortunately, such data-striping reduces memory-level parallelism, causing significant slowdown and higher memory power consumption. This paper proposes Citadel, a ...
With memories continuing to dominate the area, power, cost and performance of a design, there is a c...
Continued scaling of DRAM technologies induces more faulty DRAM cells than before. These inherent fa...
Improving energy efficiency is critical to increasing computing capability, from mobile devices oper...
Abstract—Stacked memory modules are likely to be tightly integrated with the processor. It is vital ...
textFuture computing platforms will increasingly demand more stringent memory resiliency mechanisms ...
Die-stacked DRAM can provide large amounts of in-package, high-bandwidth cache storage. For server a...
Adaptive-granularity memory architectures have been considered mainly because of main memory bottlen...
As device density grows, each transistor gets smaller and more fragile leading to an overall higher ...
Memory reliability has been a major design constraint for mission-critical and large-scale systems f...
Abstract—Recent studies of DRAM failures in data centers and supercomputer environments have highlig...
As we look toward exascale it is clear that high-capacity HPC storage systems will incorporate the l...
Servers and HPC systems often use a strong memory error correction code, or ECC, to meet their relia...
With memories continuing to dominate the area, power, cost and performance of a design, there is a c...
DRAM scaling has been the prime driver for increasing the capac-ity of main memory system over the p...
pre-printMemory system reliability is a serious and growing concern in modern servers. Existing chip...
With memories continuing to dominate the area, power, cost and performance of a design, there is a c...
Continued scaling of DRAM technologies induces more faulty DRAM cells than before. These inherent fa...
Improving energy efficiency is critical to increasing computing capability, from mobile devices oper...
Abstract—Stacked memory modules are likely to be tightly integrated with the processor. It is vital ...
textFuture computing platforms will increasingly demand more stringent memory resiliency mechanisms ...
Die-stacked DRAM can provide large amounts of in-package, high-bandwidth cache storage. For server a...
Adaptive-granularity memory architectures have been considered mainly because of main memory bottlen...
As device density grows, each transistor gets smaller and more fragile leading to an overall higher ...
Memory reliability has been a major design constraint for mission-critical and large-scale systems f...
Abstract—Recent studies of DRAM failures in data centers and supercomputer environments have highlig...
As we look toward exascale it is clear that high-capacity HPC storage systems will incorporate the l...
Servers and HPC systems often use a strong memory error correction code, or ECC, to meet their relia...
With memories continuing to dominate the area, power, cost and performance of a design, there is a c...
DRAM scaling has been the prime driver for increasing the capac-ity of main memory system over the p...
pre-printMemory system reliability is a serious and growing concern in modern servers. Existing chip...
With memories continuing to dominate the area, power, cost and performance of a design, there is a c...
Continued scaling of DRAM technologies induces more faulty DRAM cells than before. These inherent fa...
Improving energy efficiency is critical to increasing computing capability, from mobile devices oper...