This paper proposes a simulation-based active policy learning algorithm for finite-horizon, partially-observed sequential decision processes. The algorithm is tested in the domain of robot navigation and exploration under uncertainty. In such a setting, the expected cost, that must be minimized, is a function of the belief state (filtering distribution). This filtering distribution is in turn nonlinear and depends on an observation model with discontinuities. These discontinuities arise because the robot has a finite field of view and the environment may contain occluding obstacles. As a result, the expected cost is non-differentiable and very expensive to simulate. The new algorithm overcomes the first difficulty and reduces the number of ...
International audienceMany machine learning approaches in robotics, based on re- inforcement learnin...
Abstract — Policy gradient algorithms are among the few learning methods successfully applied to dem...
This paper presents a new problem solving approach that is able to generate optimal policy solution ...
This paper proposes a simulation-based active policy learning algorithm for finite-horizon, partiall...
This paper proposes a simulation-based active policy learning algorithm for finite-horizon, partiall...
How does uncertainty affect a robot when attempting to generate a control policy to achieve some obj...
Abstract—Autonomous learning has been a promising direction in control and robotics for more than a ...
Autonomous learning has been a promising direction in control and robotics for more than a decade si...
Autonomous learning has been a promising direction in control and robotics for more than a decade si...
International audienceAimed at on-board robot training, an approach hybridizing active preference le...
When monitoring spatial phenomena, such as the ecological condition of a river, deciding where to ma...
We consider planning for mobile robots conducting missions in realworld domains where a priori unkno...
In the real world, a robotic system must operate in the presence of motion and sensing uncertainty. ...
Recent research in robot exploration and mapping has focused on sampling hotspot fields, which often...
Abstract. Many machine learning approaches in robotics, based on re-inforcement learning, inverse op...
International audienceMany machine learning approaches in robotics, based on re- inforcement learnin...
Abstract — Policy gradient algorithms are among the few learning methods successfully applied to dem...
This paper presents a new problem solving approach that is able to generate optimal policy solution ...
This paper proposes a simulation-based active policy learning algorithm for finite-horizon, partiall...
This paper proposes a simulation-based active policy learning algorithm for finite-horizon, partiall...
How does uncertainty affect a robot when attempting to generate a control policy to achieve some obj...
Abstract—Autonomous learning has been a promising direction in control and robotics for more than a ...
Autonomous learning has been a promising direction in control and robotics for more than a decade si...
Autonomous learning has been a promising direction in control and robotics for more than a decade si...
International audienceAimed at on-board robot training, an approach hybridizing active preference le...
When monitoring spatial phenomena, such as the ecological condition of a river, deciding where to ma...
We consider planning for mobile robots conducting missions in realworld domains where a priori unkno...
In the real world, a robotic system must operate in the presence of motion and sensing uncertainty. ...
Recent research in robot exploration and mapping has focused on sampling hotspot fields, which often...
Abstract. Many machine learning approaches in robotics, based on re-inforcement learning, inverse op...
International audienceMany machine learning approaches in robotics, based on re- inforcement learnin...
Abstract — Policy gradient algorithms are among the few learning methods successfully applied to dem...
This paper presents a new problem solving approach that is able to generate optimal policy solution ...