University of Minnesota Ph.D. dissertation. May 2020. Major: Statistics. Advisor: Yuhong Yang. 1 computer file (PDF); ix, 154 pages.Contextual bandit problems are important for sequential learning in various practical settings that require balancing the exploration-exploitation trade-off to maximize total rewards. Motivated by applications in health care, we consider a multi-armed bandit setting with covariates and allow for delay in observing the rewards (treatment outcomes) as would most likely be the case in a medical setting. We focus on developing randomized allocation strategies that incorporate delayed rewards using nonparametric regression methods for estimating the mean reward functions. Although there has been substantial work on...
The stochastic generalised linear bandit is a well-understood model for sequential decision-making p...
Suppose two treatments with binary responses are available for patients with some disease. Sequentia...
The data explosion and development of artificial intelligence (AI) has fueled the demand for recomme...
Contextual multi-armed bandit (MAB) algorithms have been shown promising for maximizing cumulative r...
Multi-armed bandit problem is an important optimization game that requires an exploration-exploitati...
We study a multi-armed bandit problem in a setting where covariates are available. We take a nonpara...
We propose a new sequential decision-making setting, combining key aspects of two established online...
University of Minnesota Ph.D. dissertation. July 2014. Major: Statistics. Advisor: Yuhong Yang. 1 co...
The bandit problem models a sequential decision process between a player and an environment. In the ...
The stochastic generalised linear bandit is a well-understood model for sequential decision-making p...
Contextual bandits are canonical models for sequential decision-making under uncertainty in environm...
We consider a bandit problem which involves sequential sampling from two populations (arms). Each ar...
We study the problem of using causal models to improve the rate at which good interventions can be l...
In many applications, e.g. in healthcare and e-commerce, the goal of a contextual bandit may be to l...
Consider a Bayesian sequential allocation problem that incorporates a covariate. The goal is to maxi...
The stochastic generalised linear bandit is a well-understood model for sequential decision-making p...
Suppose two treatments with binary responses are available for patients with some disease. Sequentia...
The data explosion and development of artificial intelligence (AI) has fueled the demand for recomme...
Contextual multi-armed bandit (MAB) algorithms have been shown promising for maximizing cumulative r...
Multi-armed bandit problem is an important optimization game that requires an exploration-exploitati...
We study a multi-armed bandit problem in a setting where covariates are available. We take a nonpara...
We propose a new sequential decision-making setting, combining key aspects of two established online...
University of Minnesota Ph.D. dissertation. July 2014. Major: Statistics. Advisor: Yuhong Yang. 1 co...
The bandit problem models a sequential decision process between a player and an environment. In the ...
The stochastic generalised linear bandit is a well-understood model for sequential decision-making p...
Contextual bandits are canonical models for sequential decision-making under uncertainty in environm...
We consider a bandit problem which involves sequential sampling from two populations (arms). Each ar...
We study the problem of using causal models to improve the rate at which good interventions can be l...
In many applications, e.g. in healthcare and e-commerce, the goal of a contextual bandit may be to l...
Consider a Bayesian sequential allocation problem that incorporates a covariate. The goal is to maxi...
The stochastic generalised linear bandit is a well-understood model for sequential decision-making p...
Suppose two treatments with binary responses are available for patients with some disease. Sequentia...
The data explosion and development of artificial intelligence (AI) has fueled the demand for recomme...