The optimal stopping problem is concerned with finding an optimal policy to stop a stochastic process to maximize the expected return. This problem is critical in stochastic control and can be found in many different fields, such as operations research and finance. In this thesis, we model the optimal stopping problem as a Markov decision process and propose to solve it via a model-free value-based reinforcement learning approach, named delta-V. We consider two types of optimal stopping problems: the standard optimal stopping and the regenerative optimal stopping, which differ in their transition dynamics once the stopping action is executed. Then, we incorporate the unique structural properties of the optimal stopping problem into our algo...
Abstract. Optimal stopping of stochastic processes having both absolutely continuous and singular be...
A large class of problems of sequential decision making under uncertainty, of which the underlying p...
Abstract We consider an optimal stopping problem with a discrete time stochastic process where a cri...
A linear programming formulation of the optimal stopping problem for Markov decision processes is ap...
In this paper, we study the optimal stopping problem in the so-called exploratory framework, in whic...
Accepted at the 2020 American Control Conference.The objective in this paper is to obtain fast conve...
We consider incorporating action elimination procedures in reinforcement learning algorithms. We sug...
We present a brief review of optimal stopping and dynamic programming using minimal technical tools ...
In this paper we develop a deep learning method for optimal stopping problems which directly learns ...
This paper presents new machine learning approaches to approximate the solutions of optimal stopping...
This thesis deals with the explicit solution of optimal stopping problems with infinite time horizon...
In this article we study the connection of stochastic optimal control and reinforcement learning. Ou...
Replaces Memorandum COSO 74-12. In this paper we study the problem of the optimal stopping of a Mark...
This paper considers the optimal stopping problem for continuous-time Markov processes. We describe ...
The framework of dynamic programming (DP) and reinforcement learning (RL) can be used to express imp...
Abstract. Optimal stopping of stochastic processes having both absolutely continuous and singular be...
A large class of problems of sequential decision making under uncertainty, of which the underlying p...
Abstract We consider an optimal stopping problem with a discrete time stochastic process where a cri...
A linear programming formulation of the optimal stopping problem for Markov decision processes is ap...
In this paper, we study the optimal stopping problem in the so-called exploratory framework, in whic...
Accepted at the 2020 American Control Conference.The objective in this paper is to obtain fast conve...
We consider incorporating action elimination procedures in reinforcement learning algorithms. We sug...
We present a brief review of optimal stopping and dynamic programming using minimal technical tools ...
In this paper we develop a deep learning method for optimal stopping problems which directly learns ...
This paper presents new machine learning approaches to approximate the solutions of optimal stopping...
This thesis deals with the explicit solution of optimal stopping problems with infinite time horizon...
In this article we study the connection of stochastic optimal control and reinforcement learning. Ou...
Replaces Memorandum COSO 74-12. In this paper we study the problem of the optimal stopping of a Mark...
This paper considers the optimal stopping problem for continuous-time Markov processes. We describe ...
The framework of dynamic programming (DP) and reinforcement learning (RL) can be used to express imp...
Abstract. Optimal stopping of stochastic processes having both absolutely continuous and singular be...
A large class of problems of sequential decision making under uncertainty, of which the underlying p...
Abstract We consider an optimal stopping problem with a discrete time stochastic process where a cri...