We address two significant drawbacks of state-of-the-art solvers of decentralized POMDPs (DECPOMDPs): the reliance on complete knowledge of the model and limited scalability as the complexity of the domain grows. We extend a recently proposed approach for solving DEC-POMDPs via a reduction to the maximum likelihood problem, which in turn can be solved using EM. We introduce a model-free version of this approach that employs Monte-Carlo EM (MCEM). While a naïve implementation of MCEM is inadequate in multiagent settings, we introduce several improvements in sampling that produce high-quality results on a variety of DEC-POMDP benchmarks, including large problems with thousands of agents
We present decentralized rollout sampling pol-icy iteration (DecRSPI) — a new algorithm for multi-a...
Solving decentralized partially observable Markov decision processes (DEC-POMDPs) is a difficult tas...
Monte-Carlo Tree Search (MCTS) techniques are state-of-the-art for online planning in Partially Obse...
We address two significant drawbacks of state-of-the-art solvers of decentralized POMDPs (DEC-POMDPs...
Decentralized POMDPs provide a rigorous framework for multi-agent decision-theoretic planning. Howev...
Decentralized POMDPs provide an expressive framework for multi-agent sequential decision making. Whi...
Abstract. Planning for multiple agents under uncertainty is often based on decentralized partially o...
Decentralized partially observable Markov decision processes (Dec-POMDPs) offer a powerful modeling ...
Decentralized partially observable Markov decision processes (Dec-POMDPs) offer a powerful modeling ...
We advance the state of the art in optimal solving of decentralized partially observable Markov deci...
We advance the state of the art in optimal solving of decentralized partially observable Markov deci...
We present a memory-bounded optimization approach for solving infinite-horizon decen-tralized POMDPs...
Decentralized POMDP is an expressive model for multi-agent planning. Finite-state controllers (FSCs)...
In this paper we focus on distributed multiagent planning under uncertainty. For single-agent planni...
International audienceDecentralized partially observable Markov decision processes (Dec-POMDPs) are ...
We present decentralized rollout sampling pol-icy iteration (DecRSPI) — a new algorithm for multi-a...
Solving decentralized partially observable Markov decision processes (DEC-POMDPs) is a difficult tas...
Monte-Carlo Tree Search (MCTS) techniques are state-of-the-art for online planning in Partially Obse...
We address two significant drawbacks of state-of-the-art solvers of decentralized POMDPs (DEC-POMDPs...
Decentralized POMDPs provide a rigorous framework for multi-agent decision-theoretic planning. Howev...
Decentralized POMDPs provide an expressive framework for multi-agent sequential decision making. Whi...
Abstract. Planning for multiple agents under uncertainty is often based on decentralized partially o...
Decentralized partially observable Markov decision processes (Dec-POMDPs) offer a powerful modeling ...
Decentralized partially observable Markov decision processes (Dec-POMDPs) offer a powerful modeling ...
We advance the state of the art in optimal solving of decentralized partially observable Markov deci...
We advance the state of the art in optimal solving of decentralized partially observable Markov deci...
We present a memory-bounded optimization approach for solving infinite-horizon decen-tralized POMDPs...
Decentralized POMDP is an expressive model for multi-agent planning. Finite-state controllers (FSCs)...
In this paper we focus on distributed multiagent planning under uncertainty. For single-agent planni...
International audienceDecentralized partially observable Markov decision processes (Dec-POMDPs) are ...
We present decentralized rollout sampling pol-icy iteration (DecRSPI) — a new algorithm for multi-a...
Solving decentralized partially observable Markov decision processes (DEC-POMDPs) is a difficult tas...
Monte-Carlo Tree Search (MCTS) techniques are state-of-the-art for online planning in Partially Obse...