Deep Reinforcement Learning for Combinatorial Optimization: Covering Salesman Problems

被引:38
|
作者
Li, Kaiwen [1 ,2 ]
Zhang, Tao [1 ,2 ]
Wang, Rui [1 ,2 ]
Wang, Yuheng [3 ]
Han, Yi [4 ]
Wang, Ling [5 ]
机构
[1] Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Peoples R China
[2] Hunan Key Lab Multienergy Syst Intelligent Interc, HKL MSI2T, Changsha 410073, Peoples R China
[3] Natl Univ Def Technol, Grad Coll, Changsha 410073, Peoples R China
[4] Natl Univ Def Technol, Coll Comp, Sci & Technol Parallel & Distributed Proc Lab, Changsha 410073, Peoples R China
[5] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Urban areas; Deep learning; Optimization; Task analysis; Approximation algorithms; Reinforcement learning; Search problems; Attention; covering salesman problem (CSP); deep learning; deep reinforcement learning (DRL); LOCAL SEARCH; COMPUTATION; ALGORITHM;
D O I
10.1109/TCYB.2021.3103811
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article introduces a new deep learning approach to approximately solve the covering salesman problem (CSP). In this approach, given the city locations of a CSP as input, a deep neural network model is designed to directly output the solution. It is trained using the deep reinforcement learning without supervision. Specifically, in the model, we apply the multihead attention (MHA) to capture the structural patterns, and design a dynamic embedding to handle the dynamic patterns of the problem. Once the model is trained, it can generalize to various types of CSP tasks (different sizes and topologies) without the need of retraining. Through controlled experiments, the proposed approach shows desirable time complexity: it runs more than 20 times faster than the traditional heuristic solvers with a tiny gap of optimality. Moreover, it significantly outperforms the current state-of-the-art deep learning approaches for combinatorial optimization in the aspect of both training and inference. In comparison with traditional solvers, this approach is highly desirable for most of the challenging tasks in practice that are usually large scale and require quick decisions.
引用
收藏
页码:13142 / 13155
页数:14
相关论文
共 50 条
  • [1] Transfer Reinforcement Learning for Combinatorial Optimization Problems
    Souza, Gleice Kelly Barbosa
    Santos, Samara Oliveira Silva
    Ottoni, Andre Luiz Carvalho
    Oliveira, Marcos Santos
    Oliveira, Daniela Carine Ramires
    Nepomuceno, Erivelton Geraldo
    [J]. ALGORITHMS, 2024, 17 (02)
  • [2] A general deep reinforcement learning hyperheuristic framework for solving combinatorial optimization problems
    Kallestad, Jakob
    Hasibi, Ramin
    Hemmati, Ahmad
    Soerensen, Kenneth
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2023, 309 (01) : 446 - 468
  • [3] Solving Dynamic Traveling Salesman Problems With Deep Reinforcement Learning
    Zhang, Zizhen
    Liu, Hong
    Zhou, MengChu
    Wang, Jiahai
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (04) : 2119 - 2132
  • [4] Deep Reinforcement Learning for Exact Combinatorial Optimization: Learning to Branch
    Zhang, Tianyu
    Banitalebi-Dehkordi, Amin
    Zhang, Yong
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3105 - 3111
  • [5] Deep reinforcement learning with credit assignment for combinatorial optimization
    Yan, Dong
    Weng, Jiayi
    Huang, Shiyu
    Li, Chongxuan
    Zhou, Yichi
    Su, Hang
    Zhu, Jun
    [J]. PATTERN RECOGNITION, 2022, 124
  • [6] Solving combinatorial optimization problems over graphs with BERT-Based Deep Reinforcement Learning
    Wang, Qi
    Lai, Kenneth H.
    Tang, Chunlei
    [J]. INFORMATION SCIENCES, 2023, 619 : 930 - 946
  • [7] Deep reinforcement learning for transportation network combinatorial optimization: A survey
    Wang, Qi
    Tang, Chunlei
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 233
  • [8] Deep reinforcement learning for multi-objective combinatorial optimization: A case study on multi-objective traveling salesman problem
    Li, Shicheng
    Wang, Feng
    He, Qi
    Wang, Xujie
    [J]. SWARM AND EVOLUTIONARY COMPUTATION, 2023, 83
  • [9] Online Vehicle Routing With Neural Combinatorial Optimization and Deep Reinforcement Learning
    Yu, James J. Q.
    Yu, Wen
    Gu, Jiatao
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (10) : 3806 - 3817
  • [10] Research Reviews of Combinatorial Optimization Methods Based on Deep Reinforcement Learning
    Li, Kai-Wen
    Zhang, Tao
    Wang, Rui
    Qin, Wei-Jian
    He, Hui-Hui
    Huang, Hong
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2021, 47 (11): : 2521 - 2537