Devising guidance on how to assign individuals to treatment is an important goal in empirical research. In practice, individuals often arrive sequentially, and the planner faces various constraints such as limited budget/capacity, or borrowing constraints, or the need to place people in a queue. For instance, a governmental body may receive a budget outlay at the beginning of a year, and it may need to decide how best to allocate resources within the year to individuals who arrive sequentially. In this and other examples involving inter-temporal trade-offs, previous work on devising optimal policy rules in a static context is either not applicable, or sub-optimal. Here we show how one can use offline observational data to estimate an optima...
We study a dynamic allocation problem in which $T$ sequentially arriving divisible resources need to...
Following the novel paradigm developed by Van Roy and coauthors for reinforcement learning in arbitr...
We characterize the incentive compatible, constrained e ¢ cient pol-icy ("second-best") in...
This paper studies statistical decisions for dynamic treatment assignment problems. Many policies in...
In a sequential decision-making problem, off-policy evaluation estimates the expected cumulative rew...
The quintessential model-based reinforcement-learning agent iteratively refines its estimates or pri...
I study the problem of a decision maker choosing a policy to allocate treatment to a heterogeneous p...
In reinforcement learning (RL), the goal is to obtain an optimal policy, for which the optimality cr...
AI methods are used in societally important settings, ranging from credit to employment to housing, ...
We study fair multi-objective reinforcement learning in which an agent must learn a policy that simu...
Decision makers often aim to learn a treatment assignment policy under a capacity constraint on the ...
Dynamic treatment regimes are treatment allocations tailored to heterogeneous individuals (e.g., via...
This paper proposes a novel method to estimate individualised treatment assignment rules. The method...
Reinforcement learning is a family of machine learning algorithms, in which the system learns to mak...
Consider a setting in which a policy maker assigns subjects to treatments, observing each outcome be...
We study a dynamic allocation problem in which $T$ sequentially arriving divisible resources need to...
Following the novel paradigm developed by Van Roy and coauthors for reinforcement learning in arbitr...
We characterize the incentive compatible, constrained e ¢ cient pol-icy ("second-best") in...
This paper studies statistical decisions for dynamic treatment assignment problems. Many policies in...
In a sequential decision-making problem, off-policy evaluation estimates the expected cumulative rew...
The quintessential model-based reinforcement-learning agent iteratively refines its estimates or pri...
I study the problem of a decision maker choosing a policy to allocate treatment to a heterogeneous p...
In reinforcement learning (RL), the goal is to obtain an optimal policy, for which the optimality cr...
AI methods are used in societally important settings, ranging from credit to employment to housing, ...
We study fair multi-objective reinforcement learning in which an agent must learn a policy that simu...
Decision makers often aim to learn a treatment assignment policy under a capacity constraint on the ...
Dynamic treatment regimes are treatment allocations tailored to heterogeneous individuals (e.g., via...
This paper proposes a novel method to estimate individualised treatment assignment rules. The method...
Reinforcement learning is a family of machine learning algorithms, in which the system learns to mak...
Consider a setting in which a policy maker assigns subjects to treatments, observing each outcome be...
We study a dynamic allocation problem in which $T$ sequentially arriving divisible resources need to...
Following the novel paradigm developed by Van Roy and coauthors for reinforcement learning in arbitr...
We characterize the incentive compatible, constrained e ¢ cient pol-icy ("second-best") in...