This paper explains an episodic-memory based approach for computing anticipatory robot behavior in a partially observable environment. Inspired by biological findings on the mammalian hippocampus, here, the episodic memories retain a sequence of experienced observation, behavior, and reward. Incorporating multiple machine learning methods, this approach attempts to help reducing the computational burden of the partially observable Markov decision process (POMDP). In particular, the proposed computational reduction techniques include: 1) abstraction of the state space via temporal difference learning; 2) abstraction of the action space by utilizing motor schemata; 3) narrowing down the state space in terms of the goals by employi...
The ability of an agent to detect changes in an environment is key to successful adaptation. This ab...
A very general framework for modeling uncertainty in learning environments is given by Partially Obs...
In this paper we describe a machine learning approach for acquiring a model of a robot behaviour fro...
Previously, we have introduced an anticipatory robot that could generate a cognitive map while simul...
Previously, we have introduced an anticipatory robot that could generate a cognitive map while simul...
Elements from cognitive psychology have been applied in a variety of ways to artificial intelligence...
Partially Observable Markov Decision Process models (POMDPs) have been applied to low-level robot c...
Representing and reasoning about uncertainty is crucial for autonomous agents acting in partially ob...
The ability to represent temporal information and to learn the timing of recurring, instantaneous ev...
International audienceAugmenting the representation of the current state of the external world with ...
People are efficient when they make decisions under uncertainty, even when their decisions have long...
We address the problem of learning relationships on state variables in Partially Observable Markov D...
Faced with an ever-increasing complexity of their domains of application, artificial learning agents...
Partially observable Markov decision processes (pomdp's) model decision problems in which an a...
Passive sensory processing is often insufficient to guide biological organisms in complex environmen...
The ability of an agent to detect changes in an environment is key to successful adaptation. This ab...
A very general framework for modeling uncertainty in learning environments is given by Partially Obs...
In this paper we describe a machine learning approach for acquiring a model of a robot behaviour fro...
Previously, we have introduced an anticipatory robot that could generate a cognitive map while simul...
Previously, we have introduced an anticipatory robot that could generate a cognitive map while simul...
Elements from cognitive psychology have been applied in a variety of ways to artificial intelligence...
Partially Observable Markov Decision Process models (POMDPs) have been applied to low-level robot c...
Representing and reasoning about uncertainty is crucial for autonomous agents acting in partially ob...
The ability to represent temporal information and to learn the timing of recurring, instantaneous ev...
International audienceAugmenting the representation of the current state of the external world with ...
People are efficient when they make decisions under uncertainty, even when their decisions have long...
We address the problem of learning relationships on state variables in Partially Observable Markov D...
Faced with an ever-increasing complexity of their domains of application, artificial learning agents...
Partially observable Markov decision processes (pomdp's) model decision problems in which an a...
Passive sensory processing is often insufficient to guide biological organisms in complex environmen...
The ability of an agent to detect changes in an environment is key to successful adaptation. This ab...
A very general framework for modeling uncertainty in learning environments is given by Partially Obs...
In this paper we describe a machine learning approach for acquiring a model of a robot behaviour fro...