Finding an effective medical treatment often requires a search by trial and error. Making this search more efficient by minimizing the number of unnecessary trials could lower both costs and patient suffering. We formalize this problem as learning a policy for finding a near-optimal treatment in a minimum number of trials using a causal inference framework. We give a model-based dynamic programming algorithm which learns from observational data while being robust to unmeasured confounding. To reduce time complexity, we suggest a greedy algorithm which bounds the near-optimality constraint. The methods are evaluated on synthetic and real-world healthcare data and compared to model-free reinforcement learning. We find that our methods compare...
Many widely-used causal discovery methods such as Greedy Equivalent Search (GES), although with asym...
In recent years, we have investigated algorithmic methods for automatically discovering and optimizi...
We develop reinforcement learning trials for discovering individualized treatment regimens for life-...
Policy learning from observational data seeks to extract personalized interventions from passive int...
The proliferation of digitally-available medical data has enabled a new paradigm of decision-making ...
Off-policy reinforcement learning enables near-optimal policy from suboptimal experience, thereby pr...
From observational data alone, a causal DAG is only identifiable up to Markov equivalence. Intervent...
We propose a novel approach for constructing effective treatment policies when the observed data is ...
This paper studies the problem of learning diagnostic policies from training examples. A diagnostic ...
Estimation of individual treatment effects is commonly used as the basis for contextual decision mak...
• This paper reviews and develops methods for implementing in practice recent ideas in the field of ...
Precision medicine allows personalized treatment regime for patients with distinct clinical history ...
<p>The vision for precision medicine is to use individual patient characteristics to inform a person...
<div><p>Dynamic treatment regimes (DTRs) are sequential decision rules for individual patients that ...
Causal structure learning is a key problem in many domains. Causal structures can be learnt by perfo...
Many widely-used causal discovery methods such as Greedy Equivalent Search (GES), although with asym...
In recent years, we have investigated algorithmic methods for automatically discovering and optimizi...
We develop reinforcement learning trials for discovering individualized treatment regimens for life-...
Policy learning from observational data seeks to extract personalized interventions from passive int...
The proliferation of digitally-available medical data has enabled a new paradigm of decision-making ...
Off-policy reinforcement learning enables near-optimal policy from suboptimal experience, thereby pr...
From observational data alone, a causal DAG is only identifiable up to Markov equivalence. Intervent...
We propose a novel approach for constructing effective treatment policies when the observed data is ...
This paper studies the problem of learning diagnostic policies from training examples. A diagnostic ...
Estimation of individual treatment effects is commonly used as the basis for contextual decision mak...
• This paper reviews and develops methods for implementing in practice recent ideas in the field of ...
Precision medicine allows personalized treatment regime for patients with distinct clinical history ...
<p>The vision for precision medicine is to use individual patient characteristics to inform a person...
<div><p>Dynamic treatment regimes (DTRs) are sequential decision rules for individual patients that ...
Causal structure learning is a key problem in many domains. Causal structures can be learnt by perfo...
Many widely-used causal discovery methods such as Greedy Equivalent Search (GES), although with asym...
In recent years, we have investigated algorithmic methods for automatically discovering and optimizi...
We develop reinforcement learning trials for discovering individualized treatment regimens for life-...