In this paper we present a recovery-conscious framework for improving the fault resiliency and recovery efficiency of highly concurrent embedded storage software systems. Our frame-work consists of a three-tier architecture and a suite of recovery conscious techniques. In the top tier, we promote fine-grained recovery at the task level by introducing recovery groups to model recovery dependencies between tasks. At the middle tier we develop highly effective mappings of dependent tasks to pro-cessor resources through careful tuning of recovery efficiency sensitive parameters. At the bottom tier, we advocate the use of recovery-conscious scheduling by careful serialization of de-pendent tasks, which provides high recovery efficiency without s...
A system architecture called the recovery metaprogram (RMP) is proposed. It separates the applicatio...
User applications and data in volatile memory are usually lost when an operating system crashes beca...
Due to the use of commodity software and hardware, crash-stop and Byzantine failures are likely to b...
In this paper we present a recovery-conscious framework for improving the fault resiliency and reco...
This report aims to describe and improve a system recovery process in large-scale storage systems. I...
Fault-tolerant distributed applications require mechanisms to recover data lost via a process failur...
Enterprises today are dealing with extremely large amounts of critical digital information that cont...
Data availability is critical in distributed storage systems, especially when node failures are prev...
This paper proposes a novel methodology and an architectural framework for handling multiple classes...
Abstract—This paper presents a recovery architecture for in-memory data management systems. Recovery...
Gracefully recovering from software and hardware faults is important to ensuring highly reliable an...
Memory system design is important for providing high reliability and availability. This dissertation...
Much research has gone into making operating systems more amenable to recovery and more resilient to...
Separation logic has been successful at verifying that programs do not crash due to illegal use of r...
The data contents of an information system may be corrupted due to human errors, malicious attacks o...
A system architecture called the recovery metaprogram (RMP) is proposed. It separates the applicatio...
User applications and data in volatile memory are usually lost when an operating system crashes beca...
Due to the use of commodity software and hardware, crash-stop and Byzantine failures are likely to b...
In this paper we present a recovery-conscious framework for improving the fault resiliency and reco...
This report aims to describe and improve a system recovery process in large-scale storage systems. I...
Fault-tolerant distributed applications require mechanisms to recover data lost via a process failur...
Enterprises today are dealing with extremely large amounts of critical digital information that cont...
Data availability is critical in distributed storage systems, especially when node failures are prev...
This paper proposes a novel methodology and an architectural framework for handling multiple classes...
Abstract—This paper presents a recovery architecture for in-memory data management systems. Recovery...
Gracefully recovering from software and hardware faults is important to ensuring highly reliable an...
Memory system design is important for providing high reliability and availability. This dissertation...
Much research has gone into making operating systems more amenable to recovery and more resilient to...
Separation logic has been successful at verifying that programs do not crash due to illegal use of r...
The data contents of an information system may be corrupted due to human errors, malicious attacks o...
A system architecture called the recovery metaprogram (RMP) is proposed. It separates the applicatio...
User applications and data in volatile memory are usually lost when an operating system crashes beca...
Due to the use of commodity software and hardware, crash-stop and Byzantine failures are likely to b...