This research seeks to quantify the impact of the choice of reward function on behavioral diversity in learning robot teams. The methodology developed for this work has been applied to multirobot foraging, soccer and cooperative movement. This paper focuses specifically on results in multirobot foraging. In these experiments three types of reward are used with Q-learning to train a multirobot team to forage: a local performance-based reward, a global performance-based reward, and a heuristic strategy referred to as shaped reinforcement. Local strategies provide each agent a specific reward according to its own behavior, while global rewards provide all the agents on the team the same reward simultaneously. Shaped reinforcement pro...
Cooperative decentralized multirobot learning refers to the use of multiple learning entities to lea...
We examine a canonical multi-robot foraging task, in which multiple objects must be located, collect...
Decentralized multirobot learning refers to the use of multiple learning entities to achieve the opt...
This paper describes research investigating behavioral specialization in learning robot teams. Each...
In many cases cooperation between robots is implemented using explicit, perhaps complex, coordinatio...
Behavioral diversity is known to benefit problem solving in biological social systems such as insec...
In this paper we present a novel approach to assigning roles to robots in a team of physical heterog...
This paper addresses qualitative and quantitative diversity and specialization issues in the framewo...
Robot Reinforcement Learning (RL) algorithms return a policy that maximizes a global cumulative rew...
We present a means in which individual members of a multi-robot team may allocate themselves into sp...
International audienceWe consider a scenario where an agent has multiple available strategies to exp...
Behavioural diversity has been demonstrated as beneficial in biological social systems, such as ins...
This paper addresses qualitative and quantitative diversity and specialization issues in the framewo...
International audienceIn this paper, we study how a swarm of robots adapts over time to solve a coll...
Decentralized multirobot learning refers to the use of multiple learning entities to achieve the opt...
Cooperative decentralized multirobot learning refers to the use of multiple learning entities to lea...
We examine a canonical multi-robot foraging task, in which multiple objects must be located, collect...
Decentralized multirobot learning refers to the use of multiple learning entities to achieve the opt...
This paper describes research investigating behavioral specialization in learning robot teams. Each...
In many cases cooperation between robots is implemented using explicit, perhaps complex, coordinatio...
Behavioral diversity is known to benefit problem solving in biological social systems such as insec...
In this paper we present a novel approach to assigning roles to robots in a team of physical heterog...
This paper addresses qualitative and quantitative diversity and specialization issues in the framewo...
Robot Reinforcement Learning (RL) algorithms return a policy that maximizes a global cumulative rew...
We present a means in which individual members of a multi-robot team may allocate themselves into sp...
International audienceWe consider a scenario where an agent has multiple available strategies to exp...
Behavioural diversity has been demonstrated as beneficial in biological social systems, such as ins...
This paper addresses qualitative and quantitative diversity and specialization issues in the framewo...
International audienceIn this paper, we study how a swarm of robots adapts over time to solve a coll...
Decentralized multirobot learning refers to the use of multiple learning entities to achieve the opt...
Cooperative decentralized multirobot learning refers to the use of multiple learning entities to lea...
We examine a canonical multi-robot foraging task, in which multiple objects must be located, collect...
Decentralized multirobot learning refers to the use of multiple learning entities to achieve the opt...