Abstract. Interactive services often have large-scale par-allel implementations. To deliver fast responses, the me-dian and tail latencies of a service’s components must be low. In this paper, we explore the hardware, OS, and application-level sources of poor tail latency in high throughput servers executing on multi-core machines. We first review the basic queuing theory that governs service latency. Using fine-grained measurements of three different servers (a null RPC service, Memcached, and Nginx) on Linux, we then explore why these servers ex-hibit significantly worse tail latencies than queuing mod-els alone predict. The underlying causes include inter-ference from background processes, request re-ordering caused by poor scheduling or...
Latency requirements for Linux software can be ex-treme. One example is the financial industry: Who-...
Datacenters are the heart of our digital lives. Online applications, such as social-networking and e...
Low latency is critical for interactive networked applications. But while we know how to scale syste...
Interactive services often have large-scale parallel implemen-tations. To deliver fast responses, th...
Interactive services such as Web search, recommendations, games, and finance must respond quickly ...
A major theme of IT in the past decade has been the shift from on-premise hardware to cloud computin...
Interactive services, such as Web search, recommendations, games, and finance, must respond quickly ...
This study aims to examine the effects of transforming a monolithic server system into a microservic...
Low latency is critical for interactive networked appli-cations. But while we know how to scale syst...
Multithreaded multiprocessor systems (MMS) have been proposed to tolerate long latencies for communi...
Time matters. In a networked world, we would like mobile devices to provide a crisp user experience ...
Energy proportionality and workload consolidation are im-portant objectives towards increasing effic...
This article presents some performance evaluating results obtained by measuring computer clusters us...
Abstract – We found that interactive services at Bing have highly variable datacenter-side processin...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Latency requirements for Linux software can be ex-treme. One example is the financial industry: Who-...
Datacenters are the heart of our digital lives. Online applications, such as social-networking and e...
Low latency is critical for interactive networked applications. But while we know how to scale syste...
Interactive services often have large-scale parallel implemen-tations. To deliver fast responses, th...
Interactive services such as Web search, recommendations, games, and finance must respond quickly ...
A major theme of IT in the past decade has been the shift from on-premise hardware to cloud computin...
Interactive services, such as Web search, recommendations, games, and finance, must respond quickly ...
This study aims to examine the effects of transforming a monolithic server system into a microservic...
Low latency is critical for interactive networked appli-cations. But while we know how to scale syst...
Multithreaded multiprocessor systems (MMS) have been proposed to tolerate long latencies for communi...
Time matters. In a networked world, we would like mobile devices to provide a crisp user experience ...
Energy proportionality and workload consolidation are im-portant objectives towards increasing effic...
This article presents some performance evaluating results obtained by measuring computer clusters us...
Abstract – We found that interactive services at Bing have highly variable datacenter-side processin...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Latency requirements for Linux software can be ex-treme. One example is the financial industry: Who-...
Datacenters are the heart of our digital lives. Online applications, such as social-networking and e...
Low latency is critical for interactive networked applications. But while we know how to scale syste...