We develop methodology for a multistage-decision problem with flexible number of stages in which the rewards are survival times that are subject to censoring. We present a novel Q-learning algorithm that is adjusted for censored data and allows a flexible number of stages. We provide finite sample bounds on the generalization error of the policy learned by the algorithm, and show that when the optimal Q-function belongs to the approximation space, the expected survival time for policies obtained by the algorithm converges to that of the optimal policy. We simulate a multistage clinical trial with flexible number of stages and apply the proposed censored-Q-learning algorithm to find individualized treatment regimens. The methodology presente...
Typical regimens for advanced metastatic stage IIIB/IV non-small cell lung cancer (NSCLC) consist of...
Dynamic treatment regimes (DTRs) are sequential decision rules for individual patients that can adap...
In clinical practice, physicians make a series of treatment decisions over the course of a patient’s...
We develop methodology for a multistage-decision problem with flexible number of stages in which the...
An optimal dynamic treatment regime (DTR) is a sequence of treatment decisions that yields the best ...
We develop reinforcement learning trials for discovering individualized treatment regimens for life-...
The focus of this work is to investigate a form of Q-learning using estimating equations for quality...
In a simulation of an advanced generic cancer trial, I use Q-learning, a reinforcement learning algo...
We propose a reinforcement learning method for estimating an optimal dynamic treatment regime for su...
Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/115910/1/sim6558.pd
A dynamic treatment regimen incorporates both accrued information and long-term effects of treatment...
Dynamic treatment regimes are a set of decision rules and each treatment decision is tailored over t...
Dynamic treatment regimes are fast becoming an important part of medicine, with the corresponding ch...
A dynamic treatment regimen (DTR) is a set of decision rules to personalize treatments for an indivi...
University of Minnesota Ph.D. dissertation. 2021. Major: Biostatistics. Advisors: Thomas Murray, Dav...
Typical regimens for advanced metastatic stage IIIB/IV non-small cell lung cancer (NSCLC) consist of...
Dynamic treatment regimes (DTRs) are sequential decision rules for individual patients that can adap...
In clinical practice, physicians make a series of treatment decisions over the course of a patient’s...
We develop methodology for a multistage-decision problem with flexible number of stages in which the...
An optimal dynamic treatment regime (DTR) is a sequence of treatment decisions that yields the best ...
We develop reinforcement learning trials for discovering individualized treatment regimens for life-...
The focus of this work is to investigate a form of Q-learning using estimating equations for quality...
In a simulation of an advanced generic cancer trial, I use Q-learning, a reinforcement learning algo...
We propose a reinforcement learning method for estimating an optimal dynamic treatment regime for su...
Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/115910/1/sim6558.pd
A dynamic treatment regimen incorporates both accrued information and long-term effects of treatment...
Dynamic treatment regimes are a set of decision rules and each treatment decision is tailored over t...
Dynamic treatment regimes are fast becoming an important part of medicine, with the corresponding ch...
A dynamic treatment regimen (DTR) is a set of decision rules to personalize treatments for an indivi...
University of Minnesota Ph.D. dissertation. 2021. Major: Biostatistics. Advisors: Thomas Murray, Dav...
Typical regimens for advanced metastatic stage IIIB/IV non-small cell lung cancer (NSCLC) consist of...
Dynamic treatment regimes (DTRs) are sequential decision rules for individual patients that can adap...
In clinical practice, physicians make a series of treatment decisions over the course of a patient’s...