Partially Observable Monte-Carlo Planning (POMCP) is a powerful online algorithm able to generate approximate policies for large Partially Observable Markov Decision Processes. The online nature of this method supports scalability by avoiding complete policy representation. The lack of an explicit representation however hinders policy interpretability and makes policy verification very complex. In this work, we propose two contributions. The first is a method for identifying unexpected actions selected by POMCP with respect to expert prior knowledge of the task. The second is a shielding approach that prevents POMCP from selecting unexpected actions. The first method is based on Satisfiability Modulo Theory (SMT). It inspects traces (i.e., ...
AbstractActing in domains where an agent must plan several steps ahead to achieve a goal can be a ch...
The framework of partially observable Markov decision processes (POMDPs) offers a standard approach ...
In automated planning, action preconditions are boolean-valued formulas, which check whether a given...
Partially Observable Monte-Carlo Planning (POMCP) is a powerful online algorithm able to generate ap...
Partially Observable Monte Carlo Planning (POMCP) is a powerful online algorithm that can generate a...
Partially Observable Monte-Carlo Planning (POMCP) is a powerful online algorithm able to generate ap...
Partially Observable Monte-Carlo Planning (POMCP) is a powerful online algorithm that can generate o...
Partially Observable Monte Carlo Planning is a recently proposed online planning algorithm which mak...
Autonomous mobile robots employed in industrial applications often operate in complex and uncertain ...
Online planning methods for partially observable Markov decision processes (POMDPs) have re- cently ...
Partially Observable Markov Decision Processes (POMDPs) define a useful formalism to express probabi...
We address the problem of learning relationships on state variables in Partially Observable Markov D...
Summarization: Online planning methods for partially observable Markov decision processes (POMDPs) h...
Motion planning under uncertainty that can efficiently take into account changes in the environment ...
AbstractIn this paper, we bring techniques from operations research to bear on the problem of choosi...
AbstractActing in domains where an agent must plan several steps ahead to achieve a goal can be a ch...
The framework of partially observable Markov decision processes (POMDPs) offers a standard approach ...
In automated planning, action preconditions are boolean-valued formulas, which check whether a given...
Partially Observable Monte-Carlo Planning (POMCP) is a powerful online algorithm able to generate ap...
Partially Observable Monte Carlo Planning (POMCP) is a powerful online algorithm that can generate a...
Partially Observable Monte-Carlo Planning (POMCP) is a powerful online algorithm able to generate ap...
Partially Observable Monte-Carlo Planning (POMCP) is a powerful online algorithm that can generate o...
Partially Observable Monte Carlo Planning is a recently proposed online planning algorithm which mak...
Autonomous mobile robots employed in industrial applications often operate in complex and uncertain ...
Online planning methods for partially observable Markov decision processes (POMDPs) have re- cently ...
Partially Observable Markov Decision Processes (POMDPs) define a useful formalism to express probabi...
We address the problem of learning relationships on state variables in Partially Observable Markov D...
Summarization: Online planning methods for partially observable Markov decision processes (POMDPs) h...
Motion planning under uncertainty that can efficiently take into account changes in the environment ...
AbstractIn this paper, we bring techniques from operations research to bear on the problem of choosi...
AbstractActing in domains where an agent must plan several steps ahead to achieve a goal can be a ch...
The framework of partially observable Markov decision processes (POMDPs) offers a standard approach ...
In automated planning, action preconditions are boolean-valued formulas, which check whether a given...