Learning and optimization of stochastic systems is a multi-disciplinary area that attracts wide attentions from researchers in control systems, operations research and computer science. Areas such as perturbation analysis (PA), Markov decision process (MDP), and reinforcement learning (RL) share the common goal. In this paper, we offer an overview of the area of learning and optimization from a system theoretic perspective. We show how these seemly different disciplines are closely related, how one topic leads to the others, and how this perspective may lead to new research topics and new results, and how the performance sensitivity formulas can serve as the basis for learning and optimization
Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning introdu...
The principal characteristic of stochastic adaptive optimization problems is the uncertainty in the ...
Stochastic Recursive Algorithms for Optimization presents algorithms for constrained and unconstrain...
We introduce a sensitivity-based view to the area of learning and optimization of stochastic dynamic...
The goals of perturbation analysis (PA), Markov decision processes (MDPs), and reinforcement learnin...
Recent research indicates that perturbation analysis (PA), Markov decision process (MDP). and reinfo...
Performance optimization is vital in the design and operation of modern engineering systems. This bo...
In this article we study the connection of stochastic optimal control and reinforcement learning. Ou...
We study the structure of sample paths of Markov systems by using performance potentials as the fund...
The aim of the talk is to discuss the role of stochastic optimization techniques in designing learni...
In this paper, we present a brief survey of reinforcement learning, with particular emphasis on stoc...
2014-10-14This dissertation addresses some problems in the area of learning, optimization and decisi...
Reinforcement learning is a learning paradigm concerned with learning to control a system so as to m...
We first illustrate the possible limitations of the widely-used Markov model and then introduce the ...
The topic of this thesis is stochastic optimal control and reinforcement learning. Our aim is to uni...
Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning introdu...
The principal characteristic of stochastic adaptive optimization problems is the uncertainty in the ...
Stochastic Recursive Algorithms for Optimization presents algorithms for constrained and unconstrain...
We introduce a sensitivity-based view to the area of learning and optimization of stochastic dynamic...
The goals of perturbation analysis (PA), Markov decision processes (MDPs), and reinforcement learnin...
Recent research indicates that perturbation analysis (PA), Markov decision process (MDP). and reinfo...
Performance optimization is vital in the design and operation of modern engineering systems. This bo...
In this article we study the connection of stochastic optimal control and reinforcement learning. Ou...
We study the structure of sample paths of Markov systems by using performance potentials as the fund...
The aim of the talk is to discuss the role of stochastic optimization techniques in designing learni...
In this paper, we present a brief survey of reinforcement learning, with particular emphasis on stoc...
2014-10-14This dissertation addresses some problems in the area of learning, optimization and decisi...
Reinforcement learning is a learning paradigm concerned with learning to control a system so as to m...
We first illustrate the possible limitations of the widely-used Markov model and then introduce the ...
The topic of this thesis is stochastic optimal control and reinforcement learning. Our aim is to uni...
Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning introdu...
The principal characteristic of stochastic adaptive optimization problems is the uncertainty in the ...
Stochastic Recursive Algorithms for Optimization presents algorithms for constrained and unconstrain...