This paper proposes a simulation-based active policy learning algorithm for finite-horizon, partially-observed sequential decision processes. The algorithm is tested in the domain of robot navigation and exploration under uncertainty. In such a setting, the expected cost, that must be minimized, is a function of the belief state (filtering distribution). This filtering distribution is in turn nonlinear and subject to discontinuities, which arise because constraints in the robot motion and control models. As a result, the expected cost is non-differentiable and very expensive to simulate. The new algorithm overcomes the first difficulty and reduces the number of required simulations as follows. First, it assumes that we have carried out prev...
This paper presents a new problem solving approach that is able to generate optimal policy solution ...
Learning from demonstration (LfD) is an intuitive framework allowing non-expert users to easily (re-...
As the complexity of robots and other autonomous systems increases, it becomes more important that t...
This paper proposes a simulation-based active policy learning algorithm for finite-horizon, partiall...
How does uncertainty affect a robot when attempting to generate a control policy to achieve some obj...
Abstract—Autonomous learning has been a promising direction in control and robotics for more than a ...
Autonomous learning has been a promising direction in control and robotics for more than a decade si...
Autonomous learning has been a promising direction in control and robotics for more than a decade si...
International audienceAimed at on-board robot training, an approach hybridizing active preference le...
In the real world, a robotic system must operate in the presence of motion and sensing uncertainty. ...
We consider planning for mobile robots conducting missions in realworld domains where a priori unkno...
Abstract. Many machine learning approaches in robotics, based on re-inforcement learning, inverse op...
International audienceMany machine learning approaches in robotics, based on re- inforcement learnin...
When monitoring spatial phenomena, such as the ecological condition of a river, deciding where to ma...
Model-free reinforcement learning based methods such as Proximal Policy Optimization, or Q-learning ...
This paper presents a new problem solving approach that is able to generate optimal policy solution ...
Learning from demonstration (LfD) is an intuitive framework allowing non-expert users to easily (re-...
As the complexity of robots and other autonomous systems increases, it becomes more important that t...
This paper proposes a simulation-based active policy learning algorithm for finite-horizon, partiall...
How does uncertainty affect a robot when attempting to generate a control policy to achieve some obj...
Abstract—Autonomous learning has been a promising direction in control and robotics for more than a ...
Autonomous learning has been a promising direction in control and robotics for more than a decade si...
Autonomous learning has been a promising direction in control and robotics for more than a decade si...
International audienceAimed at on-board robot training, an approach hybridizing active preference le...
In the real world, a robotic system must operate in the presence of motion and sensing uncertainty. ...
We consider planning for mobile robots conducting missions in realworld domains where a priori unkno...
Abstract. Many machine learning approaches in robotics, based on re-inforcement learning, inverse op...
International audienceMany machine learning approaches in robotics, based on re- inforcement learnin...
When monitoring spatial phenomena, such as the ecological condition of a river, deciding where to ma...
Model-free reinforcement learning based methods such as Proximal Policy Optimization, or Q-learning ...
This paper presents a new problem solving approach that is able to generate optimal policy solution ...
Learning from demonstration (LfD) is an intuitive framework allowing non-expert users to easily (re-...
As the complexity of robots and other autonomous systems increases, it becomes more important that t...