A Gossip-Style Failure Detection Service

Van Renesse, Robbert
Minsky, Yaron
Hayden, Mark

Publication date

May 1998

Publisher

SAGE Publications

Abstract

Failure Detection is valuable for system management, replication, load balancing, and other distributed services. To date, Failure Detection Services scale badly in the number of members that are being monitored. This paper describes a new protocol based on gossiping that does scale well and provides timely detection. We analyze the protocol, and then extend it to discover and leverage the underlying network topology for much improved resource utilization. We then combine it with another protocol, based on broadcast, that is used to handle partition failures

Extracted data

We use cookies to provide a better user experience.

Data Protection

A Gossip-Style Failure Detection Service

Abstract

Extracted data

A Gossip-Style Failure Detection Service

Abstract

Extracted data

Related items

Related items