Interactive services often have large-scale parallel implemen-tations. To deliver fast responses, the median and tail laten-cies of a service’s components must be low. In this paper, we explore the hardware, OS, and application-level sources of poor tail latency in high throughput servers executing on multi-core machines. We model these network services as a queuing system in order to establish the best-achievable latency distribution. Using fine-grained measurements of three different servers (a null RPC service, Memcached, and Nginx) on Linux, we then explore why these servers exhibit significantly worse tail latencies than queuing models alone predict. The un-derlying causes include interference from background pro-cesses, request re-ord...
Concerns about propagation delay have dominated the discussion of latency, bandwidth and their effec...
Datacenters are the heart of our digital lives. Online applications, such as social-networking and e...
In this thesis, we begin by analyzing the increasing trend of running large-scale services on Wareho...
Abstract. Interactive services often have large-scale par-allel implementations. To deliver fast res...
Interactive services such as Web search, recommendations, games, and finance must respond quickly ...
A major theme of IT in the past decade has been the shift from on-premise hardware to cloud computin...
Energy proportionality and workload consolidation are im-portant objectives towards increasing effic...
Low latency is critical for interactive networked appli-cations. But while we know how to scale syst...
Interactive services, such as Web search, recommendations, games, and finance, must respond quickly ...
This article presents some performance evaluating results obtained by measuring computer clusters us...
Multithreaded multiprocessor systems (MMS) have been proposed to tolerate long latencies for communi...
This study aims to examine the effects of transforming a monolithic server system into a microservic...
Energy proportionality and workload consolidation are important objectives towards increasing effici...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
We present the results of a parametric study of the buffer size needed to prevent overflow or loss i...
Concerns about propagation delay have dominated the discussion of latency, bandwidth and their effec...
Datacenters are the heart of our digital lives. Online applications, such as social-networking and e...
In this thesis, we begin by analyzing the increasing trend of running large-scale services on Wareho...
Abstract. Interactive services often have large-scale par-allel implementations. To deliver fast res...
Interactive services such as Web search, recommendations, games, and finance must respond quickly ...
A major theme of IT in the past decade has been the shift from on-premise hardware to cloud computin...
Energy proportionality and workload consolidation are im-portant objectives towards increasing effic...
Low latency is critical for interactive networked appli-cations. But while we know how to scale syst...
Interactive services, such as Web search, recommendations, games, and finance, must respond quickly ...
This article presents some performance evaluating results obtained by measuring computer clusters us...
Multithreaded multiprocessor systems (MMS) have been proposed to tolerate long latencies for communi...
This study aims to examine the effects of transforming a monolithic server system into a microservic...
Energy proportionality and workload consolidation are important objectives towards increasing effici...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
We present the results of a parametric study of the buffer size needed to prevent overflow or loss i...
Concerns about propagation delay have dominated the discussion of latency, bandwidth and their effec...
Datacenters are the heart of our digital lives. Online applications, such as social-networking and e...
In this thesis, we begin by analyzing the increasing trend of running large-scale services on Wareho...