The vulnerability of computer nodes due to component failures is a critical issue for cluster-based file systems. This paper studies the development and deployment of mirroring in cluster-based parallel virtual file systems to provide fault tolerance and analyzes the tradeoffs between the performance and the reliability in the mirroring scheme. It presents the design and implementation of CEFT, a scalable RAID-10 style file system based on PVFS, and proposes four novel mirroring protocols depending on whether the mirroring operations are server-driven or client-driven, whether they are asynchronous or synchronous. The comparisons of their write performances, measured in a real cluster, and their reliability and availability, obtained throug...
As Linux clusters have matured as platforms for low-cost, high-performance parallel computing, softw...
As Linux clusters have matured as platforms for low-cost, high-performance parallel computing, softw...
Abstract The exponential growth in user and application data entails new means for providing fault t...
The vulnerability of computer nodes due to component failures is a critical issue for cluster-based ...
[[abstract]]Providing data availability in a high performance computing environment is very importan...
Abstract. Without any additional hardware, CEFT-PVFS utilizes the existing disks on each cluster nod...
[[abstract]]© 2005 Springer Verlag-Providing data availability in a high performance computing envir...
AbstractAs parallel le systems span larger and larger numbers of nodes in order to provide the perfo...
Abstract. Modern cluster file systems such as PVFS that stripe files across multiple nodes have show...
While aggregating the throughput of existing disks on cluster nodes is a cost-effective approach to ...
Modern cluster file systems such as PVFS that stripe files across multiple nodes have shown to provi...
The introduction of Exascale storage into production systems will lead to an increase on the number ...
Abstract—The exponential growth in user and application data entails new means for providing fault t...
As Linux clusters have matured as platforms for low-cost, high-performance parallel computing, softw...
As we move towards the Exactable era of supercomputing, node-level failures are becoming more common...
As Linux clusters have matured as platforms for low-cost, high-performance parallel computing, softw...
As Linux clusters have matured as platforms for low-cost, high-performance parallel computing, softw...
Abstract The exponential growth in user and application data entails new means for providing fault t...
The vulnerability of computer nodes due to component failures is a critical issue for cluster-based ...
[[abstract]]Providing data availability in a high performance computing environment is very importan...
Abstract. Without any additional hardware, CEFT-PVFS utilizes the existing disks on each cluster nod...
[[abstract]]© 2005 Springer Verlag-Providing data availability in a high performance computing envir...
AbstractAs parallel le systems span larger and larger numbers of nodes in order to provide the perfo...
Abstract. Modern cluster file systems such as PVFS that stripe files across multiple nodes have show...
While aggregating the throughput of existing disks on cluster nodes is a cost-effective approach to ...
Modern cluster file systems such as PVFS that stripe files across multiple nodes have shown to provi...
The introduction of Exascale storage into production systems will lead to an increase on the number ...
Abstract—The exponential growth in user and application data entails new means for providing fault t...
As Linux clusters have matured as platforms for low-cost, high-performance parallel computing, softw...
As we move towards the Exactable era of supercomputing, node-level failures are becoming more common...
As Linux clusters have matured as platforms for low-cost, high-performance parallel computing, softw...
As Linux clusters have matured as platforms for low-cost, high-performance parallel computing, softw...
Abstract The exponential growth in user and application data entails new means for providing fault t...