Many problems in control and signal processing can be formulated as sequential decision problems for general state space models. However, except for some simple models one cannot obtain analytical solutions and has to resort to approximation. In this thesis, we have investigated problems where Se-quential Monte Carlo (SMC) methods can be combined with a gradient based search to provide solutions to online optimisation problems. We summarise the main contributions of the thesis as follows. Chapter 4 focuses on solving the sensor scheduling problem when cast as a controlled HiddenMarkov Model. We consider the case in which the state, observation and action spaces are continuous. This general case is important as it is the natural framework fo...
Abstract: This paper proposes the use of Sequential Monte Carlo (SMC) as the computational engine fo...
Reinforcement learning methods are being applied to control problems in robotics domain. These algor...
A framework capable of computing optimal control policies for a continuous system in the presence of...
The sensor scheduling problem can be formulated as a controlled hidden Markov model and this paper s...
Abstract — Sensor scheduling has been a topic of interest to the target tracking community for some ...
Abstract — This paper introduces a new approach to solve sensor management problems. Classically sen...
The operation of a variety of natural or man-made systems subject to uncertainty is maintained withi...
Abstract-The increasing use of smart sensors that can dynamically adapt their observations has creat...
The work in this thesis focuses on the development of a Stochastic Model Predictive Control (SMPC) a...
The search for finite-state controllers for partially observable Markov decision processes (POMDPs) ...
Discrete-time stochastic optimal control problems are stated over a finite number of decision stages...
We introduce a new stochastic gradient algorithm, SAAGA, and investigate its employment for solving ...
We consider the problem of "optimal learning" for Markov decision processes with uncertain...
Abstract. This paper proposes the use of Sequential Monte Carlo (SMC) as the computational engine fo...
International audienceThis paper introduces a new approach to solve sensor management problems. Clas...
Abstract: This paper proposes the use of Sequential Monte Carlo (SMC) as the computational engine fo...
Reinforcement learning methods are being applied to control problems in robotics domain. These algor...
A framework capable of computing optimal control policies for a continuous system in the presence of...
The sensor scheduling problem can be formulated as a controlled hidden Markov model and this paper s...
Abstract — Sensor scheduling has been a topic of interest to the target tracking community for some ...
Abstract — This paper introduces a new approach to solve sensor management problems. Classically sen...
The operation of a variety of natural or man-made systems subject to uncertainty is maintained withi...
Abstract-The increasing use of smart sensors that can dynamically adapt their observations has creat...
The work in this thesis focuses on the development of a Stochastic Model Predictive Control (SMPC) a...
The search for finite-state controllers for partially observable Markov decision processes (POMDPs) ...
Discrete-time stochastic optimal control problems are stated over a finite number of decision stages...
We introduce a new stochastic gradient algorithm, SAAGA, and investigate its employment for solving ...
We consider the problem of "optimal learning" for Markov decision processes with uncertain...
Abstract. This paper proposes the use of Sequential Monte Carlo (SMC) as the computational engine fo...
International audienceThis paper introduces a new approach to solve sensor management problems. Clas...
Abstract: This paper proposes the use of Sequential Monte Carlo (SMC) as the computational engine fo...
Reinforcement learning methods are being applied to control problems in robotics domain. These algor...
A framework capable of computing optimal control policies for a continuous system in the presence of...