Abstract The exponential growth in user and application data entails new means for providing fault tolerance and protection against data loss. High Performance Com-puting (HPC) storage systems, which are at the forefront of handling the data del-uge, typically employ hardware RAID at the backend. However, such solutions are costly, do not ensure end-to-end data integrity, and can become a bottleneck during data reconstruction. In this paper, we design an innovative solution to achieve a flex-ible, fault-tolerant, and high-performance RAID-6 solution for a parallel file system (PFS). Our system utilizes low-cost, strategically placed GPUs — both on the client and server sides — to accelerate parity computation. In contrast to hardware-based ...