Heterogeneous computing system with both CPUs and GPUs has become a class of widely used hardware architecture in supercomputers. As heterogeneous systems delivering higher computational performance, they are being built with an increasing number of complex components. This is anticipated that these systems will be more susceptible to hardware faults with higher power consumption. Numerical linear algebra libraries are used in a wide spectrum of high-performance scientific applications. Among numerical linear algebra operations, one-sided matrix decompositions can sometimes take a large portion of execution time or even dominate the whole scientific application execution. Due to the computational characteristic of one-sided matrix decomposi...
The mean time between failure (MTBF) of large supercomputers is decreasing, and future exascale comp...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...
Heterogeneous computing system with both CPUs and GPUs has become a class of widely used hardware ar...
Current algorithm-based fault tolerance (ABFT) approach for one-sided matrix decomposition on hetero...
Extensive researches have been done on developing and optimizing algorithm-based fault tolerance (AB...
AbstractOne-sided dense matrix factorizations are important computational kernels in many scientific...
As the exascale supercomputers are expected to embark around 2020, supercomputers nowadays expand ra...
Deep learning technology has enabled the development of increasingly complex safety-related autonomo...
Near Threshold Voltage (NTV) computing has been recently proposed as a technique to save energy, at ...
Matrix Factorization (MF) has been widely applied in machine learning and data mining. Due to the la...
As large-scale linear equation systems are pervasive in many scientific fields, great efforts have b...
Abstract- The rapid progress in VLSI technology has reduced the cost of hardware, allowing multiple ...
Dense matrix factorizations, like LU, Cholesky and QR, are widely used for scientific applications t...
This paper presents an algorithm based fault tolerance method to harden three two-sided matrix facto...
The mean time between failure (MTBF) of large supercomputers is decreasing, and future exascale comp...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...
Heterogeneous computing system with both CPUs and GPUs has become a class of widely used hardware ar...
Current algorithm-based fault tolerance (ABFT) approach for one-sided matrix decomposition on hetero...
Extensive researches have been done on developing and optimizing algorithm-based fault tolerance (AB...
AbstractOne-sided dense matrix factorizations are important computational kernels in many scientific...
As the exascale supercomputers are expected to embark around 2020, supercomputers nowadays expand ra...
Deep learning technology has enabled the development of increasingly complex safety-related autonomo...
Near Threshold Voltage (NTV) computing has been recently proposed as a technique to save energy, at ...
Matrix Factorization (MF) has been widely applied in machine learning and data mining. Due to the la...
As large-scale linear equation systems are pervasive in many scientific fields, great efforts have b...
Abstract- The rapid progress in VLSI technology has reduced the cost of hardware, allowing multiple ...
Dense matrix factorizations, like LU, Cholesky and QR, are widely used for scientific applications t...
This paper presents an algorithm based fault tolerance method to harden three two-sided matrix facto...
The mean time between failure (MTBF) of large supercomputers is decreasing, and future exascale comp...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...