We consider the load balancing problem in large-scale heterogeneous systems with multiple dispatchers. We introduce a general framework called Local-Estimation-Driven (LED). Under this framework, each dispatcher keeps local (possibly outdated) estimates of the queue lengths for all the servers, and the dispatching decision is made purely based on these local estimates. The local estimates are updated via infrequent communications between dispatchers and servers. We derive sufficient conditions for LED policies to achieve throughput optimality and delay optimality in heavy-traffic, respectively. These conditions directly imply delay optimality for many previous local-memory based policies in heavy traffic. Moreover, the results enable us to ...
We consider a system of N identical server pools and a single dispatcher in which tasks with unit-ex...
Abstract—In geographically-distributed systems, communi-cation latencies are non-negligible. The per...
We consider the following distributed service model: jobs with unit mean, general distribution, and ...
We present an overview of scalable load balancing algorithms which provide favorable delay performan...
This electronic version was submitted by the student author. The certified thesis is available in th...
A fundamental problem in large-scale data centers is to reduce the average response time of jobs. Th...
We consider a cluster of heterogeneous servers, modeled as M/G/1 queues with different processing sp...
Load balancing algorithms play a crucial role in delivering robust application performance in data c...
With the rapid increase in the size and volume of cloud services and data centers, architectures wit...
Load balancing plays a critical role in efficiently dispatching jobs in parallel-server systems such...
[[abstract]]A dynamic load-balancing policy is proposed with a central job dispatcher called the LBC...
In this thesis we consider a system of two heterogeneous servers with a shared queue, and examine a ...
Abstract: In large-scale distributed systems, balancing the load in an efficient way is crucial in o...
The model is motivated by the problem of load distribution in large-scale cloud-based data processin...
Click on the DOI link to access the article (may not be free).Scheduling is one of the most importan...
We consider a system of N identical server pools and a single dispatcher in which tasks with unit-ex...
Abstract—In geographically-distributed systems, communi-cation latencies are non-negligible. The per...
We consider the following distributed service model: jobs with unit mean, general distribution, and ...
We present an overview of scalable load balancing algorithms which provide favorable delay performan...
This electronic version was submitted by the student author. The certified thesis is available in th...
A fundamental problem in large-scale data centers is to reduce the average response time of jobs. Th...
We consider a cluster of heterogeneous servers, modeled as M/G/1 queues with different processing sp...
Load balancing algorithms play a crucial role in delivering robust application performance in data c...
With the rapid increase in the size and volume of cloud services and data centers, architectures wit...
Load balancing plays a critical role in efficiently dispatching jobs in parallel-server systems such...
[[abstract]]A dynamic load-balancing policy is proposed with a central job dispatcher called the LBC...
In this thesis we consider a system of two heterogeneous servers with a shared queue, and examine a ...
Abstract: In large-scale distributed systems, balancing the load in an efficient way is crucial in o...
The model is motivated by the problem of load distribution in large-scale cloud-based data processin...
Click on the DOI link to access the article (may not be free).Scheduling is one of the most importan...
We consider a system of N identical server pools and a single dispatcher in which tasks with unit-ex...
Abstract—In geographically-distributed systems, communi-cation latencies are non-negligible. The per...
We consider the following distributed service model: jobs with unit mean, general distribution, and ...