Distributed key-value systems have been widely used as elemental components of many Internet-scale services at sites such as Amazon, Facebook and Twitter. This thesis examines a system design approach to scale existing key-value systems, both horizontally and vertically, by carefully engineering and integrating techniques that are grounded in recent theory but also informed by underlying architectures and expected workloads in practice. As a case study, we re-design FAWN-KV—a distributed key-value cluster consisting of “wimpy” key-value nodes—to use less memory but achieve higher throughput even in the worst case. First, to improve the worst-case throughput of a FAWN-KV system, we propose a randomized load balancing scheme that can fully ut...
Key-value stores are ubiquitous at all layers of the computational stack; offering constant average...
jthaler(at)fas.harvard.edu A dictionary (or map) is a key-value store that requires all keys be uniq...
This paper introduces the FAWN—Fast Array of Wimpy Nodes—cluster architecture for providing fast, sc...
Social communication features on most of today’s largest websites require propagating the data insid...
Common web infrastructure relies on distributed main memory key-value stores to reduce access load o...
Abstract—As one of the most popular cloud services, data storage has attracted great attention in re...
The sheer volumes of data handled by today's Internet services demand uncompromising scalability fro...
DHT systems are structured overlay networks capable of using P2P resources as a scalable platform fo...
This paper presents a set of architecturally and workload-inspired algorithmic and engineering impro...
Abstract: Key-value store is an essential component with an increasing demand in many scale-out env...
Common web infrastructure relies on distributed main mem-ory key-value stores to reduce access load ...
Contemporary web sites can store and process very large amounts of data. To provide timely service t...
Load balancing is a critical issue for the efficient operation of peer-to-peer networks. We give two...
Load balancing is a critical issue for the efficient operation of peer-to-peer networks. We give two...
Distributed Hash Table (DHT) overlay networks offer an efficient and robust technique for wire-area ...
Key-value stores are ubiquitous at all layers of the computational stack; offering constant average...
jthaler(at)fas.harvard.edu A dictionary (or map) is a key-value store that requires all keys be uniq...
This paper introduces the FAWN—Fast Array of Wimpy Nodes—cluster architecture for providing fast, sc...
Social communication features on most of today’s largest websites require propagating the data insid...
Common web infrastructure relies on distributed main memory key-value stores to reduce access load o...
Abstract—As one of the most popular cloud services, data storage has attracted great attention in re...
The sheer volumes of data handled by today's Internet services demand uncompromising scalability fro...
DHT systems are structured overlay networks capable of using P2P resources as a scalable platform fo...
This paper presents a set of architecturally and workload-inspired algorithmic and engineering impro...
Abstract: Key-value store is an essential component with an increasing demand in many scale-out env...
Common web infrastructure relies on distributed main mem-ory key-value stores to reduce access load ...
Contemporary web sites can store and process very large amounts of data. To provide timely service t...
Load balancing is a critical issue for the efficient operation of peer-to-peer networks. We give two...
Load balancing is a critical issue for the efficient operation of peer-to-peer networks. We give two...
Distributed Hash Table (DHT) overlay networks offer an efficient and robust technique for wire-area ...
Key-value stores are ubiquitous at all layers of the computational stack; offering constant average...
jthaler(at)fas.harvard.edu A dictionary (or map) is a key-value store that requires all keys be uniq...
This paper introduces the FAWN—Fast Array of Wimpy Nodes—cluster architecture for providing fast, sc...