Robustness is a fundamental and timeless issue, and it remains vital to all aspects of computation systems, regardless of specific computation platforms, architectures, and algorithm design. The issue is also timely: modern computing systems are increasingly built on unreliable substrates. This thesis designs reliable computing techniques for distributed systems, circuits and networks.We primarily study techniques inspired from coding theory to address the robustness issues such as system elasticity, stragglers (slow workers), machine failures and soft errors, by carefully weaving redundancy into the data andthe design of the algorithm. We primarily focus on three aspects of coding-based computation techniques.The first aspect is to design ...
The author describes analogous coding theorems for the more general, interactive, communications req...
International audienceFor many types of integrated circuits, accepting larger failure rates in compu...
We present an overview of massively parallel deterministic algorithms which combine high fault-toler...
Modern data centers have been providing exponentially increasing computing and storage resources, wh...
The advent of the information age has bestowed upon us three challenges related to the way we deal w...
In this dissertation, the constructions and schemes for flexible coding in distributed systems are i...
12 pagesInternational audienceWe investigate the coded model of fault-tolerant computations introduc...
As an increasing number of modern big data systems utilize horizontal scaling,the general trend in t...
Coded computation techniques provide robustness against straggling workers in distributed computing....
In traditional information processing systems, inference algorithms are designed to collect and proc...
A ubiquitous problem in computer science research is the optimization of computation on large data s...
Coded computation techniques provide robustness against straggling workers in distributed computing....
When a computational task tolerates a relaxation of its specification or when an algorithm tolerates...
Coded Computing presents a novel method of computing that uses coding theory to overcome major bottl...
textDistributed systems are rapidly increasing in importance due to the need for scalable computatio...
The author describes analogous coding theorems for the more general, interactive, communications req...
International audienceFor many types of integrated circuits, accepting larger failure rates in compu...
We present an overview of massively parallel deterministic algorithms which combine high fault-toler...
Modern data centers have been providing exponentially increasing computing and storage resources, wh...
The advent of the information age has bestowed upon us three challenges related to the way we deal w...
In this dissertation, the constructions and schemes for flexible coding in distributed systems are i...
12 pagesInternational audienceWe investigate the coded model of fault-tolerant computations introduc...
As an increasing number of modern big data systems utilize horizontal scaling,the general trend in t...
Coded computation techniques provide robustness against straggling workers in distributed computing....
In traditional information processing systems, inference algorithms are designed to collect and proc...
A ubiquitous problem in computer science research is the optimization of computation on large data s...
Coded computation techniques provide robustness against straggling workers in distributed computing....
When a computational task tolerates a relaxation of its specification or when an algorithm tolerates...
Coded Computing presents a novel method of computing that uses coding theory to overcome major bottl...
textDistributed systems are rapidly increasing in importance due to the need for scalable computatio...
The author describes analogous coding theorems for the more general, interactive, communications req...
International audienceFor many types of integrated circuits, accepting larger failure rates in compu...
We present an overview of massively parallel deterministic algorithms which combine high fault-toler...