Artificial neural networks are promising as general function approximators but challenging to train on non-independent and identically distributed data due to catastrophic forgetting. Experience replay, a standard component in deep reinforcement learning, is often used to reduce forgetting and improve sample efficiency by storing experiences in a large buffer and using them for training later. However, a large replay buffer results in a heavy memory burden, especially for onboard and edge devices with limited memory capacities. We propose memory-efficient reinforcement learning algorithms based on the deep Q-network algorithm to alleviate this problem. Our algorithms reduce forgetting and maintain high sample efficiency by consolidating kno...
Online reinforcement learning agents are currently able to process an increasing amount of data by c...
Online reinforcement learning agents are currently able to process an increasing amount of data by c...
Online reinforcement learning agents are currently able to process an increasing amount of data by c...
Neural networks are very powerful computational models, capable of outperforming humans on a variety...
Deep reinforcement learning has emerged as a powerful tool for a variety of learning tasks, however ...
Experience replay memory in reinforcement learning enables agents to remember and reuse past experie...
Recent years have seen a growing interest in the use of deep neural networks as function approximato...
Recent years have seen a growing interest in the use of deep neural networks as function approximato...
Humans teach each other by recollecting one's own experiences and sharing them with others. The inte...
With the capacity of continual learning, humans can continuously acquire knowledge throughout their ...
Using neural networks as function approximators in temporal difference reinforcement problems proved...
Using neural networks as function approximators in temporal difference reinforcement problems proved...
Using neural networks as function approximators in temporal difference reinforcement problems proved...
In Continual Learning, a Neural Network is trained on a stream of data whose distribution shifts ove...
In Continual Learning, a Neural Network is trained on a stream of data whose distribution shifts ove...
Online reinforcement learning agents are currently able to process an increasing amount of data by c...
Online reinforcement learning agents are currently able to process an increasing amount of data by c...
Online reinforcement learning agents are currently able to process an increasing amount of data by c...
Neural networks are very powerful computational models, capable of outperforming humans on a variety...
Deep reinforcement learning has emerged as a powerful tool for a variety of learning tasks, however ...
Experience replay memory in reinforcement learning enables agents to remember and reuse past experie...
Recent years have seen a growing interest in the use of deep neural networks as function approximato...
Recent years have seen a growing interest in the use of deep neural networks as function approximato...
Humans teach each other by recollecting one's own experiences and sharing them with others. The inte...
With the capacity of continual learning, humans can continuously acquire knowledge throughout their ...
Using neural networks as function approximators in temporal difference reinforcement problems proved...
Using neural networks as function approximators in temporal difference reinforcement problems proved...
Using neural networks as function approximators in temporal difference reinforcement problems proved...
In Continual Learning, a Neural Network is trained on a stream of data whose distribution shifts ove...
In Continual Learning, a Neural Network is trained on a stream of data whose distribution shifts ove...
Online reinforcement learning agents are currently able to process an increasing amount of data by c...
Online reinforcement learning agents are currently able to process an increasing amount of data by c...
Online reinforcement learning agents are currently able to process an increasing amount of data by c...