Tales of the Tail: Hardware, OS, and Application-level Sources of Tail Latency

Jialin Li
Naveen Kr. Sharma
Dan R. K. Ports
Steven D. Gribble

Publication date

October 2015

Abstract

Abstract. Interactive services often have large-scale par-allel implementations. To deliver fast responses, the me-dian and tail latencies of a service’s components must be low. In this paper, we explore the hardware, OS, and application-level sources of poor tail latency in high throughput servers executing on multi-core machines. We first review the basic queuing theory that governs service latency. Using fine-grained measurements of three different servers (a null RPC service, Memcached, and Nginx) on Linux, we then explore why these servers ex-hibit significantly worse tail latencies than queuing mod-els alone predict. The underlying causes include inter-ference from background processes, request re-ordering caused by poor scheduling or...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Tales of the Tail: Hardware, OS, and Application-level Sources of Tail Latency

Abstract

Extracted data

Tales of the Tail: Hardware, OS, and Application-level Sources of Tail Latency

Abstract

Extracted data

Related items

Related items