A fascinating aspect of nature lies in its ability to produce a large and diverse collection of organisms that are all high-performing in their niche. By contrast, most AI algorithms focus on finding a single efficient solution to a given problem. Aiming for diversity in addition to performance is a convenient way to deal with the exploration-exploitation trade-off that plays a central role in learning. It also allows for increased robustness when the returned collection contains several working solutions to the considered problem, making it well-suited for real applications such as robotics. Quality-Diversity (QD) methods are evolutionary algorithms designed for this purpose. This paper proposes a novel algorithm, QDPG, which combines the ...
While evolutionary computation and evolutionary robotics take inspiration from nature, they have lon...
Quality-Diversity optimisation algorithms enable the evolutionof collections of both high-performing...
Neuroevolution is an alternative to gradient-based optimisation that has the potential to avoid loca...
Exploration is a key challenge in Reinforcement Learning, especially in long-horizon, deceptive and ...
Recently there has been a growing movement of researchers that believes innovation and novelty creat...
While the field of Quality-Diversity (QD) has grown into a distinct branch of stochastic optimizatio...
Quality-Diversity algorithms, such as MAP-Elites, are a branch of Evolutionary Computation generatin...
We present a Quality-Diversity benchmark suite for Deep Neuroevolution in Reinforcement Learning dom...
Recent progress in reinforcement learning (RL) has started producing generally capable agents that c...
Traditional optimization algorithms search for a single global optimum that maximizes (or minimizes)...
The synergies between Quality-Diversity (QD) and Deep Reinforcement Learning (RL) have led to powerf...
Traditional optimization algorithms search for a single global optimum that maximizes (or minimizes)...
The goal of Quality Diversity Optimization is to generate a collection of diverse yet high-performin...
Quality-Diversity (QD) algorithms have recently gained traction as optimisation methods due to their...
We present Reward-Switching Policy Optimization (RSPO), a paradigm to discover diverse strategies in...
While evolutionary computation and evolutionary robotics take inspiration from nature, they have lon...
Quality-Diversity optimisation algorithms enable the evolutionof collections of both high-performing...
Neuroevolution is an alternative to gradient-based optimisation that has the potential to avoid loca...
Exploration is a key challenge in Reinforcement Learning, especially in long-horizon, deceptive and ...
Recently there has been a growing movement of researchers that believes innovation and novelty creat...
While the field of Quality-Diversity (QD) has grown into a distinct branch of stochastic optimizatio...
Quality-Diversity algorithms, such as MAP-Elites, are a branch of Evolutionary Computation generatin...
We present a Quality-Diversity benchmark suite for Deep Neuroevolution in Reinforcement Learning dom...
Recent progress in reinforcement learning (RL) has started producing generally capable agents that c...
Traditional optimization algorithms search for a single global optimum that maximizes (or minimizes)...
The synergies between Quality-Diversity (QD) and Deep Reinforcement Learning (RL) have led to powerf...
Traditional optimization algorithms search for a single global optimum that maximizes (or minimizes)...
The goal of Quality Diversity Optimization is to generate a collection of diverse yet high-performin...
Quality-Diversity (QD) algorithms have recently gained traction as optimisation methods due to their...
We present Reward-Switching Policy Optimization (RSPO), a paradigm to discover diverse strategies in...
While evolutionary computation and evolutionary robotics take inspiration from nature, they have lon...
Quality-Diversity optimisation algorithms enable the evolutionof collections of both high-performing...
Neuroevolution is an alternative to gradient-based optimisation that has the potential to avoid loca...