Seeing the forest and the trees: Tackling Distributed Systems Problems by Querying Observations of Executions

Ramasubramanian, Kamala

Publication date

January 2022

Publisher

eScholarship, University of California

Abstract

Distributed systems are ubiquitous but continue to be challenging to understand, build, and troubleshoot. Fundamentally, reasoning about distributed system behaviors is hard due to the effects of partial failures and nondeterminism in system executions. For example, we expect systems to remain available even if some number of replicas fail. These problems are exacerbated by the dynamic nature and scale of production systems today. Tooling support has lagged behind the pace at which systems are being deployed, urgently requiring more research in this space.Our overarching claim is that many common distributed systems problems such as improving fault tolerance or debugging failures can be addressed by querying observations of executions. Sinc...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Seeing the forest and the trees: Tackling Distributed Systems Problems by Querying Observations of Executions

Abstract

Extracted data

Seeing the forest and the trees: Tackling Distributed Systems Problems by Querying Observations of Executions

Abstract

Extracted data

Related items

Related items