Information relaxation and duality in Markov decision processes have been studied recently by several researchers with the goal to derive dual bounds on the value function. In this paper we extend this dual formulation to controlled Markov diffusions: in a similar way we relax the constraint that the decision should be made based on the current information and impose penalty to punish the access to the information in advance. We establish the weak duality, strong duality and complementary slackness results in a parallel way as those in Markov decision processes. We explore the structure of the optimal penalties and expose the connection between Markov decision processes and controlled Markov diffusions. We demonstrate the use of the dual ...
A Markov decision process (MDP) relies on the notions of state, describing the current situation of ...
This paper considers Markov decision processes (MDPs) with unbounded rates, as a function of state. ...
We consider a threshold probability optimization problem over controlled Markov chains. The problem ...
Information relaxation and duality in Markov decision processes have been studied recently by severa...
Dynamic programming is a principal method for analyzing stochastic optimal control problems. However...
We consider infinite horizon stochastic dynamic programs with discounted costs and study how to use ...
We consider the problem of producing lower bounds on the optimal cost-to-go function of a Markov dec...
We study two important generalizations of dynamic portfolio choice problems: a portfolio choice prob...
We consider the information relaxation approach for calculating performance bounds for stochastic dy...
We consider multistage decision processes where criterion function is an expectation of minimum func...
In this paper, we study the dual control approach for the optimal asset allocation problem in a cont...
International audienceIn this note, we propose two different approaches to rigorously justify a pseu...
A risk minimization problem is considered in a continuous-time Markovian regime-switching financial ...
In this work, we study a dynamic portfolio optimization problem related to pairs trading, which is a...
This dissertation consists of three main essays in which we study important problems in engineering ...
A Markov decision process (MDP) relies on the notions of state, describing the current situation of ...
This paper considers Markov decision processes (MDPs) with unbounded rates, as a function of state. ...
We consider a threshold probability optimization problem over controlled Markov chains. The problem ...
Information relaxation and duality in Markov decision processes have been studied recently by severa...
Dynamic programming is a principal method for analyzing stochastic optimal control problems. However...
We consider infinite horizon stochastic dynamic programs with discounted costs and study how to use ...
We consider the problem of producing lower bounds on the optimal cost-to-go function of a Markov dec...
We study two important generalizations of dynamic portfolio choice problems: a portfolio choice prob...
We consider the information relaxation approach for calculating performance bounds for stochastic dy...
We consider multistage decision processes where criterion function is an expectation of minimum func...
In this paper, we study the dual control approach for the optimal asset allocation problem in a cont...
International audienceIn this note, we propose two different approaches to rigorously justify a pseu...
A risk minimization problem is considered in a continuous-time Markovian regime-switching financial ...
In this work, we study a dynamic portfolio optimization problem related to pairs trading, which is a...
This dissertation consists of three main essays in which we study important problems in engineering ...
A Markov decision process (MDP) relies on the notions of state, describing the current situation of ...
This paper considers Markov decision processes (MDPs) with unbounded rates, as a function of state. ...
We consider a threshold probability optimization problem over controlled Markov chains. The problem ...