This article focussei on the automated synthesis of agents In an uncertain environment, working In the setting of Reinforcement Learning and more precisely of Partially Observable Markov Decision Processes. The agents (with no model of their environment and no short-term memory) are facing multiple motivations/goals simultaneously, a problem related to thefield of Action Selection. We propose and evaluate various Action Selection architectures. They all combine already known basic behaviors in an adaptive manner, by learning the tuning of the combination, so as to maximize the agent's payoff. The logical continuation of this work is to automate the selection and design of the basic behaviors themselves
Systems with multiple parallel goals (e.g. autonomous mobile robots) have a problem analogous to tha...
Learning and reasoning in large, structured, probabilistic worlds is at the heart of artificial inte...
This thesis concerns model-based methods to solve reinforcement learning problems: these methods def...
Colloque avec actes et comité de lecture. nationale.National audienceSome agents have to face multip...
National audienceThis article focusses on the automated synthesis of agents in an uncertain environm...
The problem addressed in this article is that of automatically designing autonomous agents having to...
Colloque avec actes et comité de lecture. internationale.International audienceAgents are of interes...
This thesis addresses the dilemma between exploration and exploitation as it is faced by reinforceme...
Colloque avec actes et comité de lecture. internationale.International audienceThe agent approach, a...
Colloque avec actes et comité de lecture. internationale.International audienceReinforcement Learnin...
http://mitpress.mit.edu/We investigate on designing agents facing multiple objectives simultaneously...
Presentation given at the MAA Southeast Section. Abstract In a Markov Decision Process, an agent mus...
This paper presents a novel approach to the problem of action selection for an autonomous agent. A...
My research attempts to address on-line action selection in reinforcement learning from a Bayesian p...
People are efficient when they make decisions under uncertainty, even when their decisions have long...
Systems with multiple parallel goals (e.g. autonomous mobile robots) have a problem analogous to tha...
Learning and reasoning in large, structured, probabilistic worlds is at the heart of artificial inte...
This thesis concerns model-based methods to solve reinforcement learning problems: these methods def...
Colloque avec actes et comité de lecture. nationale.National audienceSome agents have to face multip...
National audienceThis article focusses on the automated synthesis of agents in an uncertain environm...
The problem addressed in this article is that of automatically designing autonomous agents having to...
Colloque avec actes et comité de lecture. internationale.International audienceAgents are of interes...
This thesis addresses the dilemma between exploration and exploitation as it is faced by reinforceme...
Colloque avec actes et comité de lecture. internationale.International audienceThe agent approach, a...
Colloque avec actes et comité de lecture. internationale.International audienceReinforcement Learnin...
http://mitpress.mit.edu/We investigate on designing agents facing multiple objectives simultaneously...
Presentation given at the MAA Southeast Section. Abstract In a Markov Decision Process, an agent mus...
This paper presents a novel approach to the problem of action selection for an autonomous agent. A...
My research attempts to address on-line action selection in reinforcement learning from a Bayesian p...
People are efficient when they make decisions under uncertainty, even when their decisions have long...
Systems with multiple parallel goals (e.g. autonomous mobile robots) have a problem analogous to tha...
Learning and reasoning in large, structured, probabilistic worlds is at the heart of artificial inte...
This thesis concerns model-based methods to solve reinforcement learning problems: these methods def...