Advances in hardware have enabled many long-running applications to execute entirely in main memory. With the emergence of cloud computing, thousands of machines could be made available to deploy such applications with lowered operational and maintenance costs. While achieving substantially better performance, these applications have encountered new challenges in achieving fault tolerance; i.e., to ensure durability in the event of a crash. In addition, many of these applications, such as massively multiplayer online games, main-memory OLTP systems, main-memory search engine and deterministic transaction processing systems, must sustain extremely high update rates - often hundreds of thousands of updates per second. They also demand extreme...
In this paper we present recovery techniques for distributed main-memory databases, specically for c...
Fault tolerance in cloud computing is considered as one of the most vital issues to deliver reliable...
The next generation of capability-class, massively parallel processing (MPP) systems is expected to ...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Multicore in-memory databases for modern machines can support extraordinarily high transaction rates...
: We propose a method to incorporate coordinated checkpointing and rollback in high performance comp...
Checkpointing has been widely adopted in support of fault-tolerance and job migration essential for ...
In this paper we present recovery techniques for distributed main-memory databases, specifically for...
As the size of supercomputers increases, the probability of system failure grows substantially, posi...
Clouds are powerful computer centers that provide computing and storage facilities that can be remot...
By leveraging the enormous amount of computational capabilities, scientists today are being able to ...
Multicore in-memory databases for modern machines can support extraordinarily high transaction rates...
In this paper, we aim at optimizing fault-tolerance techniques based on a checkpointing/restart mech...
International audienceAs High Performance platforms (Clusters, Grids, etc.) continue to grow in size...
Memory devices represent a key component of datacenter total cost of ownership (TCO), and techniques...
In this paper we present recovery techniques for distributed main-memory databases, specically for c...
Fault tolerance in cloud computing is considered as one of the most vital issues to deliver reliable...
The next generation of capability-class, massively parallel processing (MPP) systems is expected to ...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Multicore in-memory databases for modern machines can support extraordinarily high transaction rates...
: We propose a method to incorporate coordinated checkpointing and rollback in high performance comp...
Checkpointing has been widely adopted in support of fault-tolerance and job migration essential for ...
In this paper we present recovery techniques for distributed main-memory databases, specifically for...
As the size of supercomputers increases, the probability of system failure grows substantially, posi...
Clouds are powerful computer centers that provide computing and storage facilities that can be remot...
By leveraging the enormous amount of computational capabilities, scientists today are being able to ...
Multicore in-memory databases for modern machines can support extraordinarily high transaction rates...
In this paper, we aim at optimizing fault-tolerance techniques based on a checkpointing/restart mech...
International audienceAs High Performance platforms (Clusters, Grids, etc.) continue to grow in size...
Memory devices represent a key component of datacenter total cost of ownership (TCO), and techniques...
In this paper we present recovery techniques for distributed main-memory databases, specically for c...
Fault tolerance in cloud computing is considered as one of the most vital issues to deliver reliable...
The next generation of capability-class, massively parallel processing (MPP) systems is expected to ...