Decreasing voltage levels and continued transistor scaling have drastically increased the chance of a processor bit encountering a soft error. We find that the microarchitecture state in an out-of-order core is vulnerable to soft errors especially while waiting for data to return from memory. The severity of the problem is further aggravated by the increasingly large size of microarchitecture state with every new processor generation. Prior solutions are ineffective as they incur too high overhead in terms of chip area, energy consumption and/or performance.In this paper, we make the observation that runahead execution, which was originally conceived to improve performance, also improves soft-error reliability as an unintended side effect. ...
Soft errors induced by energetic particle strikes in on-chip memory structures, such as L1 data/inst...
In this paper, we show that the vulnerability of memory components due to data retention in the pres...
As device density grows, each transistor gets smaller and more fragile leading to an overall higher ...
textHigh-performance processors tolerate latency using out-of-order execution. Unfortunately, today...
Runahead execution improves processor performance by accurately prefetching long-latency memory acce...
The memory wall places a significant limit on performance for many modern workloads. These applicati...
Today’s high-performance processors face main-memory latencies on the order of hundreds of processor...
Soft error tolerance is a hot research topic for modern microprocessors. We have been investigating ...
Soft errors (or Transient faults) are temporary faults that arise in a circuit due to a variety of i...
Runahead execution is a technique that improves processor performance by pre-executing the running a...
Microprocessors are increasingly used in a variety of applications from small handheld calculators t...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
Submitted for publication. Please do not distribute. Although device scaling has been providing stea...
With the scaling of technology, transient errors caused by external particle strikes have become a c...
The exponentially increasing gap between processors and off-chip memory, as measured in processor cy...
Soft errors induced by energetic particle strikes in on-chip memory structures, such as L1 data/inst...
In this paper, we show that the vulnerability of memory components due to data retention in the pres...
As device density grows, each transistor gets smaller and more fragile leading to an overall higher ...
textHigh-performance processors tolerate latency using out-of-order execution. Unfortunately, today...
Runahead execution improves processor performance by accurately prefetching long-latency memory acce...
The memory wall places a significant limit on performance for many modern workloads. These applicati...
Today’s high-performance processors face main-memory latencies on the order of hundreds of processor...
Soft error tolerance is a hot research topic for modern microprocessors. We have been investigating ...
Soft errors (or Transient faults) are temporary faults that arise in a circuit due to a variety of i...
Runahead execution is a technique that improves processor performance by pre-executing the running a...
Microprocessors are increasingly used in a variety of applications from small handheld calculators t...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
Submitted for publication. Please do not distribute. Although device scaling has been providing stea...
With the scaling of technology, transient errors caused by external particle strikes have become a c...
The exponentially increasing gap between processors and off-chip memory, as measured in processor cy...
Soft errors induced by energetic particle strikes in on-chip memory structures, such as L1 data/inst...
In this paper, we show that the vulnerability of memory components due to data retention in the pres...
As device density grows, each transistor gets smaller and more fragile leading to an overall higher ...