Today’s computers have gigabytes of main memory due to improved DRAM density. As density increases, smaller bit cells become more susceptible to errors. With an increase in error susceptibility, the need for memory resiliency also increases. Self-testing of memory health can proactively check for errors to improve resiliency. Developing a memory diagnostic is challenging due to requirements for transparency, scalability and low performance overheads. In my thesis, I developed a software-only self-test to continuously test memory. I present the challenges and the design for two approaches, called COMeT and Asteroid, that are built on a common software framework for memory diagnostic and target chip multiprocessors. COMeT tests memory health ...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Persistent memory (PM) technologies offer performance close to DRAM with persistence. Persistent mem...
Modern memory systems play a critical role in the performance of applications, but a detailed unders...
Today’s computers have gigabytes of main memory due to improved DRAM density. As density increases, ...
Abstract—Memory errors are a major source of reliability problems in current computers. Undetected e...
[[abstract]]Hundreds of memory cores can be found on a typical system-on-chip (SOC) today. Diagnosin...
Leveraging Storage Class Memory (SCM) as a universal memory--i.e. as memory and storage at the same ...
Almost all functional safety standards that regulate safety-critical domains impose to periodically ...
<p>Memory devices represent a key component of datacenter total cost of ownership (TCO), and techniq...
textFuture computing platforms will increasingly demand more stringent memory resiliency mechanisms ...
As technology scales, the increased vulnerability of modern systems due to unreliable components bec...
Modern memory consistency models are complex, and it is difficult to reason about the relaxed behavi...
Modern software systems are deeply embedded into our daily lives; the failures of these systems can ...
Although today’s computers provide huge amounts of main memory, the ever-increasing load of large da...
Thesis (Ph. D.)--University of Rochester. Dept. of Electrical and Computer Engineering, 2012In moder...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Persistent memory (PM) technologies offer performance close to DRAM with persistence. Persistent mem...
Modern memory systems play a critical role in the performance of applications, but a detailed unders...
Today’s computers have gigabytes of main memory due to improved DRAM density. As density increases, ...
Abstract—Memory errors are a major source of reliability problems in current computers. Undetected e...
[[abstract]]Hundreds of memory cores can be found on a typical system-on-chip (SOC) today. Diagnosin...
Leveraging Storage Class Memory (SCM) as a universal memory--i.e. as memory and storage at the same ...
Almost all functional safety standards that regulate safety-critical domains impose to periodically ...
<p>Memory devices represent a key component of datacenter total cost of ownership (TCO), and techniq...
textFuture computing platforms will increasingly demand more stringent memory resiliency mechanisms ...
As technology scales, the increased vulnerability of modern systems due to unreliable components bec...
Modern memory consistency models are complex, and it is difficult to reason about the relaxed behavi...
Modern software systems are deeply embedded into our daily lives; the failures of these systems can ...
Although today’s computers provide huge amounts of main memory, the ever-increasing load of large da...
Thesis (Ph. D.)--University of Rochester. Dept. of Electrical and Computer Engineering, 2012In moder...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Persistent memory (PM) technologies offer performance close to DRAM with persistence. Persistent mem...
Modern memory systems play a critical role in the performance of applications, but a detailed unders...