Duplication with Comparison (DWC) is an effective software-level solution to improve the reliability of computing devices. However, it introduces performance and energy consumption overheads that could be unsuitable for high-performance computing or real-time safety-critical applications. In this article, we present Reduced-Precision Duplication with Comparison (RP-DWC) as a means to lower the overhead of DWC by executing the redundant copy in reduced precision. RP-DWC is particularly suitable for modern mixed-precision architectures, such as NVIDIA GPUs, that feature dedicated functional units for computing with programmable accuracy. We discuss the benefits and challenges associated with RP-DWC and show that the intrinsic difference betwe...
As device dimensions continue to be aggressively scaled, microprocessors are becoming increasingly v...
Increased power densities (and resultant temperatures) and other effects of device scaling are predi...
Abstract: With the ever increasing volume of data and the ability to integrate dif-ferent data sourc...
International audienceGraphics Processing Units (GPUs) offer the possibility to execute floating-poi...
Duplication With Comparison (DWC) is a traditional and accepted method for improving systems’ reliab...
Customisable data formats provide an opportunity for exploring trade-offs in accuracy and performanc...
Reduced-precision redundancy (RPR) has been shown to be a viable alternative to triple modular redun...
AbstractGeneral-purpose graphics processing units (GPGPUs) are extensively used in high-performance ...
Applications in various fields, such as machine learning, scientific computing and signal/image proc...
International audienceDue to many factors such as, high transistor density, high frequency, and low ...
Even though graphics processors (GPUs) are becoming increasingly popular for general purpose computi...
Variation in performance and power across manufactured parts and their operating conditions is an ac...
International audienceSelecting the ideal trade-off between reliability improvement and cost (i.e., ...
As the semiconductor roadmap reaches smaller feature sizes and the end of Dennard Scaling, design go...
International audienceFull-precision Floating-Point Units (FPUs) can be a source of extensive hardwa...
As device dimensions continue to be aggressively scaled, microprocessors are becoming increasingly v...
Increased power densities (and resultant temperatures) and other effects of device scaling are predi...
Abstract: With the ever increasing volume of data and the ability to integrate dif-ferent data sourc...
International audienceGraphics Processing Units (GPUs) offer the possibility to execute floating-poi...
Duplication With Comparison (DWC) is a traditional and accepted method for improving systems’ reliab...
Customisable data formats provide an opportunity for exploring trade-offs in accuracy and performanc...
Reduced-precision redundancy (RPR) has been shown to be a viable alternative to triple modular redun...
AbstractGeneral-purpose graphics processing units (GPGPUs) are extensively used in high-performance ...
Applications in various fields, such as machine learning, scientific computing and signal/image proc...
International audienceDue to many factors such as, high transistor density, high frequency, and low ...
Even though graphics processors (GPUs) are becoming increasingly popular for general purpose computi...
Variation in performance and power across manufactured parts and their operating conditions is an ac...
International audienceSelecting the ideal trade-off between reliability improvement and cost (i.e., ...
As the semiconductor roadmap reaches smaller feature sizes and the end of Dennard Scaling, design go...
International audienceFull-precision Floating-Point Units (FPUs) can be a source of extensive hardwa...
As device dimensions continue to be aggressively scaled, microprocessors are becoming increasingly v...
Increased power densities (and resultant temperatures) and other effects of device scaling are predi...
Abstract: With the ever increasing volume of data and the ability to integrate dif-ferent data sourc...