Reward design is a fundamental problem in reinforcement learning (RL). A misspecified or poorly designed reward can result in low sample efficiency and undesired behaviors. In this paper, we propose the idea of programmatic reward design, i.e. using programs to specify the reward functions in RL environments. Programs allow human engineers to express sub-goals and complex task scenarios in a structured and interpretable way. The challenge of programmatic reward design, however, is that while humans can provide the high-level structures, properly setting the low-level details, such as the right amount of reward for a specific sub-task, remains difficult. A major contribution of this paper is a probabilistic framework that can infer the best ...
We study reward design strategies for incentivizing a reinforcement learning agent to adopt a policy...
Applying conventional reinforcement to complex domains requires the use of an overly simplified task...
We study reward design strategies for incentivizing a reinforcement learning agent to adopt a policy...
In reinforcement learning (RL), a reward function that aligns exactly with a task's true performance...
Reinforcement learning involves the study of how to solve sequential decision-making problems using ...
Reinforcement learning (RL) is a machine learning technique that has been increasingly used in robot...
The desire to build good systems in the face of complex societal effects requires a dynamic approach...
The task of learning a reward function from expert demonstrations suffers from high sample complexit...
This thesis studies algorithms for teaching autonomous agents to complete tasks through trial and er...
Reinforcement learning (RL) is a machine learning technique whereby the controller learns the contro...
In reinforcement learning, reward is used to guide the learning process. The reward is often designe...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
To convey desired behavior to a Reinforcement Learning (RL) agent, a designer must choose a reward f...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
The reward function is considered as the critical component for its effect of evaluating the action ...
We study reward design strategies for incentivizing a reinforcement learning agent to adopt a policy...
Applying conventional reinforcement to complex domains requires the use of an overly simplified task...
We study reward design strategies for incentivizing a reinforcement learning agent to adopt a policy...
In reinforcement learning (RL), a reward function that aligns exactly with a task's true performance...
Reinforcement learning involves the study of how to solve sequential decision-making problems using ...
Reinforcement learning (RL) is a machine learning technique that has been increasingly used in robot...
The desire to build good systems in the face of complex societal effects requires a dynamic approach...
The task of learning a reward function from expert demonstrations suffers from high sample complexit...
This thesis studies algorithms for teaching autonomous agents to complete tasks through trial and er...
Reinforcement learning (RL) is a machine learning technique whereby the controller learns the contro...
In reinforcement learning, reward is used to guide the learning process. The reward is often designe...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
To convey desired behavior to a Reinforcement Learning (RL) agent, a designer must choose a reward f...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
The reward function is considered as the critical component for its effect of evaluating the action ...
We study reward design strategies for incentivizing a reinforcement learning agent to adopt a policy...
Applying conventional reinforcement to complex domains requires the use of an overly simplified task...
We study reward design strategies for incentivizing a reinforcement learning agent to adopt a policy...