Deep Reinforcement Learning (RL) has achieved high success in solving routing problems. However, state-of-the-art deep RL approaches require a considerable amount of data before they reach reasonable performance. This may be acceptable for small problems, but as instances grow bigger, this fact severely limits the applicability of these methods to many real-world instances. In this work, we study a setting where the agent can access data from previously handcrafted heuristics for the Traveling Salesman Problem. In our setting, the agent has access to demonstrations from 2-opt improvement policies. Our goal is to learn policies that can surpass the quality of the demonstrations while requiring fewer samples than pure RL. In this study, we pr...
Abstract. Imitation learning is an effective strategy to reinforcement learning, which avoids the de...
Different from classic Supervised Learning, Reinforcement Learning (RL), is fundamentally interactiv...
Model-free deep-reinforcement-based learning algorithms have been applied to a range of COPs~\cite{b...
Deep Reinforcement Learning (RL) has achieved high success in solving routing problems. However, sta...
Recent works using deep learning to solve the Traveling Salesman Problem (TSP) have focused on learn...
Recent works using deep learning to solve the Traveling Salesman Problem (TSP) have focused on learn...
Recent works using deep learning to solve routing problems such as the traveling salesman problem (T...
A comprehensive literature on the Traveling Salesman Problem (TSP) is available, and this problem ha...
Deep reinforcement learning (DRL) has shown promise in solving challenging combinatorial optimizatio...
A comprehensive literature on the Traveling Salesman Problem (TSP) is available, and this problem ha...
In this paper, we study Reinforcement Learning from Demonstrations (RLfD) that improves the explorat...
As robots and other autonomous agents enter our homes, hospitals, schools, and workplaces, it is imp...
The ML-Constructive heuristic is a recently presented method and the first hybrid method capable of ...
Abstract. Imitation learning is an effective strategy to reinforcement learning, which avoids the de...
Different from classic Supervised Learning, Reinforcement Learning (RL), is fundamentally interactiv...
Model-free deep-reinforcement-based learning algorithms have been applied to a range of COPs~\cite{b...
Deep Reinforcement Learning (RL) has achieved high success in solving routing problems. However, sta...
Recent works using deep learning to solve the Traveling Salesman Problem (TSP) have focused on learn...
Recent works using deep learning to solve the Traveling Salesman Problem (TSP) have focused on learn...
Recent works using deep learning to solve routing problems such as the traveling salesman problem (T...
A comprehensive literature on the Traveling Salesman Problem (TSP) is available, and this problem ha...
Deep reinforcement learning (DRL) has shown promise in solving challenging combinatorial optimizatio...
A comprehensive literature on the Traveling Salesman Problem (TSP) is available, and this problem ha...
In this paper, we study Reinforcement Learning from Demonstrations (RLfD) that improves the explorat...
As robots and other autonomous agents enter our homes, hospitals, schools, and workplaces, it is imp...
The ML-Constructive heuristic is a recently presented method and the first hybrid method capable of ...
Abstract. Imitation learning is an effective strategy to reinforcement learning, which avoids the de...
Different from classic Supervised Learning, Reinforcement Learning (RL), is fundamentally interactiv...
Model-free deep-reinforcement-based learning algorithms have been applied to a range of COPs~\cite{b...