We present a new hybrid approach to performance management, combining disparate strengths of Reinforcment Learning (RL) with model-based (e.g. queuing-theoretic) approaches. Our method trains nonlinear function approximators using offline RL on data collected while a model-based policy controls the system. By training offline we avoid potentially poor performance in live online training, while function approximation allows generalization across both states and actions, so that the need for exploratory actions may be greatly reduced. Our results show that, in a prototype resource allocation scenario among multiple web applications, hybrid RL training can achieve significant performance improvements over a variety of initial queuing model-bas...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
This paper studies the use of Reinforcement Learning (RL) policies for optimizing the sequencing of...
This paper presents an online learning scheme based on reinforcement learning and adaptive dynamic p...
This paper considers a novel application domain for rein-forcement learning: that of “autonomic comp...
We consider a load balancing problem with task-server affinity and server-dependent task recurrence,...
Conventional reinforcement learning (RL) needs an environment to collect fresh data, which is imprac...
In a web system, configuration is crucial to the perfor-mance and service availability. It is a chal...
Classical control theory requires a model to be derived for a system, before any control design can ...
A RL agent trained offline for reliability and able to refine its policies during online operation i...
Online Network Resource Allocation (ONRA) for service provisioning is a fundamental problem in commu...
Learning to act optimally in the complex world has long been a major goal in artificial intelligence...
With the rapid advance of information technology, network systems have become increasingly complex a...
Abstract---Reinforcement learning (RL) has become more popular due to promising results in applicati...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an ...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
This paper studies the use of Reinforcement Learning (RL) policies for optimizing the sequencing of...
This paper presents an online learning scheme based on reinforcement learning and adaptive dynamic p...
This paper considers a novel application domain for rein-forcement learning: that of “autonomic comp...
We consider a load balancing problem with task-server affinity and server-dependent task recurrence,...
Conventional reinforcement learning (RL) needs an environment to collect fresh data, which is imprac...
In a web system, configuration is crucial to the perfor-mance and service availability. It is a chal...
Classical control theory requires a model to be derived for a system, before any control design can ...
A RL agent trained offline for reliability and able to refine its policies during online operation i...
Online Network Resource Allocation (ONRA) for service provisioning is a fundamental problem in commu...
Learning to act optimally in the complex world has long been a major goal in artificial intelligence...
With the rapid advance of information technology, network systems have become increasingly complex a...
Abstract---Reinforcement learning (RL) has become more popular due to promising results in applicati...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an ...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
This paper studies the use of Reinforcement Learning (RL) policies for optimizing the sequencing of...
This paper presents an online learning scheme based on reinforcement learning and adaptive dynamic p...