© 2018 Society for Industrial and Applied Mathematics. We consider the problem of estimating the expected value of information (the knowledge gradient) for Bayesian learning problems where the belief model is nonlinear in the parameters.Our goal is to maximize an objective function represented by a nonlinear parametric belief model,while simultaneously learning the unknown parameters, by guiding a sequential experimentationprocess which is expensive. We overcome the problem of computing the expected value of an experiment, which is computationally intractable, by using a sampled approximation, which helps toguide experiments but does not provide an accurate estimate of the unknown parameters. We thenintroduce a resampling process which allo...
The use of machine learning algorithms frequently involves careful tuning of learning parameters and...
We consider the Bayesian formulation of a number of learning problems, where we focus on sequential ...
Real world systems often have parameterized controllers which can be tuned to improve performance. B...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
Research on the implications of learning-by-doing has typically been restricted to specifications of...
Optimal experimental design (OED) seeks experiments expected to yield the most useful data for some ...
In this paper we propose a new framework for learning from large scale datasets based on iterative l...
We propose a general framework for machine learning based optimization under uncertainty. Our approa...
We consider the problem of selecting the best of a finite but very large set of alternatives. Each a...
Modern technological advances have prompted massive scale data collection in manymodern fields such ...
Using the expression for the unnormalized nonlinear filter for a hidden Markov model, we develop a d...
This paper introduces a set of algorithms for Monte-Carlo Bayesian reinforcement learning. Firstly, ...
This thesis is mostly focused on reinforcement learning, which is viewed as an optimization problem:...
We consider the problem of a learning mechanism (for example, a robot) locating a point on a line wh...
The use of machine learning algorithms frequently involves careful tuning of learning parameters and...
The use of machine learning algorithms frequently involves careful tuning of learning parameters and...
We consider the Bayesian formulation of a number of learning problems, where we focus on sequential ...
Real world systems often have parameterized controllers which can be tuned to improve performance. B...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
Research on the implications of learning-by-doing has typically been restricted to specifications of...
Optimal experimental design (OED) seeks experiments expected to yield the most useful data for some ...
In this paper we propose a new framework for learning from large scale datasets based on iterative l...
We propose a general framework for machine learning based optimization under uncertainty. Our approa...
We consider the problem of selecting the best of a finite but very large set of alternatives. Each a...
Modern technological advances have prompted massive scale data collection in manymodern fields such ...
Using the expression for the unnormalized nonlinear filter for a hidden Markov model, we develop a d...
This paper introduces a set of algorithms for Monte-Carlo Bayesian reinforcement learning. Firstly, ...
This thesis is mostly focused on reinforcement learning, which is viewed as an optimization problem:...
We consider the problem of a learning mechanism (for example, a robot) locating a point on a line wh...
The use of machine learning algorithms frequently involves careful tuning of learning parameters and...
The use of machine learning algorithms frequently involves careful tuning of learning parameters and...
We consider the Bayesian formulation of a number of learning problems, where we focus on sequential ...
Real world systems often have parameterized controllers which can be tuned to improve performance. B...