Branch-and-bound is a systematic enumerative method for combinatorial optimization, where the performance highly relies on the variable selection strategy. State-of-the-art handcrafted heuristic strategies suffer from relatively slow inference time for each selection, while the current machine learning methods require a significant amount of labeled data. We propose a new approach for solving the data labeling and inference latency issues in combinatorial optimization based on the use of the reinforcement learning (RL) paradigm. We use imitation learning to bootstrap an RL agent and then use Proximal Policy Optimization (PPO) to further explore global optimal actions. Then, a value network is used to run Monte-Carlo tree search (MCTS) to en...
Applying reinforcement learning (RL) to combinatorial optimization problems is attractive as it remo...
Combinatorial Optimization Problems (COPs) are a family of problems that search over a finite set of...
This thesis is mostly focused on reinforcement learning, which is viewed as an optimization problem:...
Finding tight bounds on the optimal solution is a critical element of practical solution methods for...
Many real-world problems can be reduced to combinatorial optimization on a graph, where the subset o...
Many real-world problems can be reduced to combinatorial optimization on a graph, where the subset o...
Many real-world problems can be reduced to combinatorial optimization on a graph, where the subset o...
This paper gives a detailed review of reinforcement learning (RL) in combinatorial optimization, int...
Combinatorial optimisation problems framed as mixed integer linear programmes (MILPs) are ubiquitous...
Recently, deep reinforcement learning (RL) has proven its feasibility in solving combinatorial optim...
Neural Combinatorial Optimization approaches have recently leveraged the expressiveness and flexibil...
Over the recent years, reinforcement learning (RL) starts to show promising results in tackling comb...
This paper presents an approach that uses reinforcement learning (RL) algorithms to solve combinator...
Combinatorial optimization problem (COP) over graphs is a fundamental challenge in optimization. Rei...
We study combinatorial problems with real world applications such as machine scheduling, routing, an...
Applying reinforcement learning (RL) to combinatorial optimization problems is attractive as it remo...
Combinatorial Optimization Problems (COPs) are a family of problems that search over a finite set of...
This thesis is mostly focused on reinforcement learning, which is viewed as an optimization problem:...
Finding tight bounds on the optimal solution is a critical element of practical solution methods for...
Many real-world problems can be reduced to combinatorial optimization on a graph, where the subset o...
Many real-world problems can be reduced to combinatorial optimization on a graph, where the subset o...
Many real-world problems can be reduced to combinatorial optimization on a graph, where the subset o...
This paper gives a detailed review of reinforcement learning (RL) in combinatorial optimization, int...
Combinatorial optimisation problems framed as mixed integer linear programmes (MILPs) are ubiquitous...
Recently, deep reinforcement learning (RL) has proven its feasibility in solving combinatorial optim...
Neural Combinatorial Optimization approaches have recently leveraged the expressiveness and flexibil...
Over the recent years, reinforcement learning (RL) starts to show promising results in tackling comb...
This paper presents an approach that uses reinforcement learning (RL) algorithms to solve combinator...
Combinatorial optimization problem (COP) over graphs is a fundamental challenge in optimization. Rei...
We study combinatorial problems with real world applications such as machine scheduling, routing, an...
Applying reinforcement learning (RL) to combinatorial optimization problems is attractive as it remo...
Combinatorial Optimization Problems (COPs) are a family of problems that search over a finite set of...
This thesis is mostly focused on reinforcement learning, which is viewed as an optimization problem:...