Deep Reinforcement Learning for Combinatorial Optimization: Covering Salesman Problems

被引：38

作者：

Li, Kaiwen ^{[1
,2
]}

Zhang, Tao ^{[1
,2
]}

Wang, Rui ^{[1
,2
]}

Wang, Yuheng ^{[3
]}

Han, Yi ^{[4
]}

Wang, Ling ^{[5
]}

机构：

[1] Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Peoples R China

[2] Hunan Key Lab Multienergy Syst Intelligent Interc, HKL MSI2T, Changsha 410073, Peoples R China

[3] Natl Univ Def Technol, Grad Coll, Changsha 410073, Peoples R China

[4] Natl Univ Def Technol, Coll Comp, Sci & Technol Parallel & Distributed Proc Lab, Changsha 410073, Peoples R China

[5] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2022年 / 52卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Urban areas; Deep learning; Optimization; Task analysis; Approximation algorithms; Reinforcement learning; Search problems; Attention; covering salesman problem (CSP); deep learning; deep reinforcement learning (DRL); LOCAL SEARCH; COMPUTATION; ALGORITHM;

D O I：

10.1109/TCYB.2021.3103811

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article introduces a new deep learning approach to approximately solve the covering salesman problem (CSP). In this approach, given the city locations of a CSP as input, a deep neural network model is designed to directly output the solution. It is trained using the deep reinforcement learning without supervision. Specifically, in the model, we apply the multihead attention (MHA) to capture the structural patterns, and design a dynamic embedding to handle the dynamic patterns of the problem. Once the model is trained, it can generalize to various types of CSP tasks (different sizes and topologies) without the need of retraining. Through controlled experiments, the proposed approach shows desirable time complexity: it runs more than 20 times faster than the traditional heuristic solvers with a tiny gap of optimality. Moreover, it significantly outperforms the current state-of-the-art deep learning approaches for combinatorial optimization in the aspect of both training and inference. In comparison with traditional solvers, this approach is highly desirable for most of the challenging tasks in practice that are usually large scale and require quick decisions.

引用

页码：13142 / 13155

页数：14

共 50 条

[1] Transfer Reinforcement Learning for Combinatorial Optimization Problems
Souza, Gleice Kelly Barbosa
Santos, Samara Oliveira Silva
Ottoni, Andre Luiz Carvalho
Oliveira, Marcos Santos
Oliveira, Daniela Carine Ramires
Nepomuceno, Erivelton Geraldo
[J]. ALGORITHMS, 2024, 17 (02)
[2] A general deep reinforcement learning hyperheuristic framework for solving combinatorial optimization problems
Kallestad, Jakob
Hasibi, Ramin
Hemmati, Ahmad
Soerensen, Kenneth
[J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2023, 309 (01) : 446 - 468
[3] Solving Dynamic Traveling Salesman Problems With Deep Reinforcement Learning
Zhang, Zizhen
Liu, Hong
Zhou, MengChu
Wang, Jiahai
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (04) : 2119 - 2132
[4] Deep Reinforcement Learning for Exact Combinatorial Optimization: Learning to Branch
Zhang, Tianyu
Banitalebi-Dehkordi, Amin
Zhang, Yong
[J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3105 - 3111
[5] Deep reinforcement learning with credit assignment for combinatorial optimization
Yan, Dong
Weng, Jiayi
Huang, Shiyu
Li, Chongxuan
Zhou, Yichi
Su, Hang
Zhu, Jun
[J]. PATTERN RECOGNITION, 2022, 124
[6] Solving combinatorial optimization problems over graphs with BERT-Based Deep Reinforcement Learning
Wang, Qi
Lai, Kenneth H.
Tang, Chunlei
[J]. INFORMATION SCIENCES, 2023, 619 : 930 - 946
[7] Deep reinforcement learning for transportation network combinatorial optimization: A survey
Wang, Qi
Tang, Chunlei
[J]. KNOWLEDGE-BASED SYSTEMS, 2021, 233
[8] Deep reinforcement learning for multi-objective combinatorial optimization: A case study on multi-objective traveling salesman problem
Li, Shicheng
Wang, Feng
He, Qi
Wang, Xujie
[J]. SWARM AND EVOLUTIONARY COMPUTATION, 2023, 83
[9] Online Vehicle Routing With Neural Combinatorial Optimization and Deep Reinforcement Learning
Yu, James J. Q.
Yu, Wen
Gu, Jiatao
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (10) : 3806 - 3817
[10] Research Reviews of Combinatorial Optimization Methods Based on Deep Reinforcement Learning
Li, Kai-Wen
Zhang, Tao
Wang, Rui
Qin, Wei-Jian
He, Hui-Hui
Huang, Hong
[J]. Zidonghua Xuebao/Acta Automatica Sinica, 2021, 47 (11): : 2521 - 2537

← 1 2 3 4 5 →