This paper presents IB-POMCP, a novel algorithm for online planning under partial observability. Our approach enhances the decision-making process by using estimations of the world belief's entropy to guide a tree search process and surpass the limitations of planning in scenarios with sparse reward configurations. By performing what we denominate as an information-guided planning process, the algorithm, which incorporates a novel I-UCB function, shows significant improvements in reward and reasoning time compared to state-of-the-art baselines in several benchmark scenarios, along with theoretical convergence guarantees
Partially observable Markov decision processes (POMDPs) offer a principled approach to control under...
Rarely planning domains are fully observable. For this reason, the ability to deal with partial obse...
POMDPs provide a principled framework for planning under uncertainty, but are computationally intrac...
Prior studies have demonstrated that for many real-world problems, POMDPs can be solved through onli...
Planning under uncertainty is a central topic at the intersection of disciplines such as artificial ...
Planning in large partially observable Markov decision processes (POMDPs) is challenging especially ...
This ongoing phD work aims at proposing a unified framework to optimize both perception and task pla...
9 pages, revised version of ECAI 2020 paperIn this article, we discuss how to solve information-gath...
9 pages, revised version of ECAI 2020 paperIn this article, we discuss how to solve information-gath...
9 pages, revised version of ECAI 2020 paperIn this article, we discuss how to solve information-gath...
9 pages, revised version of ECAI 2020 paperIn this article, we discuss how to solve information-gath...
International audienceIn this article, we discuss how to solve information-gathering problems expres...
Partially Observable Monte Carlo Planning is a recently proposed online planning algorithm which mak...
International audienceIn this article, we discuss how to solve information-gathering problems expres...
Deciding how to act in partially observable environments remains an active area of research. Identi...
Partially observable Markov decision processes (POMDPs) offer a principled approach to control under...
Rarely planning domains are fully observable. For this reason, the ability to deal with partial obse...
POMDPs provide a principled framework for planning under uncertainty, but are computationally intrac...
Prior studies have demonstrated that for many real-world problems, POMDPs can be solved through onli...
Planning under uncertainty is a central topic at the intersection of disciplines such as artificial ...
Planning in large partially observable Markov decision processes (POMDPs) is challenging especially ...
This ongoing phD work aims at proposing a unified framework to optimize both perception and task pla...
9 pages, revised version of ECAI 2020 paperIn this article, we discuss how to solve information-gath...
9 pages, revised version of ECAI 2020 paperIn this article, we discuss how to solve information-gath...
9 pages, revised version of ECAI 2020 paperIn this article, we discuss how to solve information-gath...
9 pages, revised version of ECAI 2020 paperIn this article, we discuss how to solve information-gath...
International audienceIn this article, we discuss how to solve information-gathering problems expres...
Partially Observable Monte Carlo Planning is a recently proposed online planning algorithm which mak...
International audienceIn this article, we discuss how to solve information-gathering problems expres...
Deciding how to act in partially observable environments remains an active area of research. Identi...
Partially observable Markov decision processes (POMDPs) offer a principled approach to control under...
Rarely planning domains are fully observable. For this reason, the ability to deal with partial obse...
POMDPs provide a principled framework for planning under uncertainty, but are computationally intrac...