Solving Dynamic Traveling Salesman Problems With Deep Reinforcement Learning

被引:76
|
作者
Zhang, Zizhen [1 ,2 ]
Liu, Hong [3 ]
Zhou, MengChu [4 ,5 ]
Wang, Jiahai [1 ,2 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510275, Peoples R China
[2] Sun Yat Sen Univ, Guangdong Key Lab Big Data Anal & Proc, Guangzhou 510275, Peoples R China
[3] City Univ Hong Kong, Sch Data Sci, Hong Kong, Peoples R China
[4] New Jersey Inst Technol, Helen & John C Hartmann Dept Elect & Comp Engn, Newark, NJ 07102 USA
[5] St Petersburg State Marine Tech Univ, Dept Cyber Phys Syst, St Petersburg 198262, Russia
基金
美国国家科学基金会;
关键词
Routing; Traveling salesman problems; Planning; Real-time systems; Decision making; Computational modeling; Reinforcement learning; Attention model; deep reinforcement learning (DRL); dynamic traveling salesman problem (DTSP); machine learning; policy gradient; ANT COLONY OPTIMIZATION; ROUTING PROBLEM; TIME WINDOWS; ALGORITHM; SEARCH;
D O I
10.1109/TNNLS.2021.3105905
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A traveling salesman problem (TSP) is a well-known NP-complete problem. Traditional TSP presumes that the locations of customers and the traveling time among customers are fixed and constant. In real-life cases, however, the traffic conditions and customer requests may change over time. To find the most economic route, the decisions can be made constantly upon the time-point when the salesman completes his service of each customer. This brings in a dynamic version of the traveling salesman problem (DTSP), which takes into account the information of real-time traffic and customer requests. DTSP can be extended to a dynamic pickup and delivery problem (DPDP). In this article, we ameliorate the attention model to make it possible to perceive environmental changes. A deep reinforcement learning algorithm is proposed to solve DTSP and DPDP instances with a size of up to 40 customers in 100 locations. Experiments show that our method can capture the dynamic changes and produce a highly satisfactory solution within a very short time. Compared with other baseline approaches, more than 5% improvements can be observed in many cases.
引用
收藏
页码:2119 / 2132
页数:14
相关论文
共 50 条
  • [21] Solving biobjective traveling thief problems with multiobjective reinforcement learning
    Santiyuda, Gemilang
    Wardoyo, Retantyo
    Pulungan, Reza
    [J]. APPLIED SOFT COMPUTING, 2024, 161
  • [22] Learning to cooperate in solving the traveling salesman problem
    Qi, DH
    Sun, R
    [J]. INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2005, 15 (1-2) : 151 - 162
  • [23] Solving constrained traveling salesman problems by genetic algorithms
    Wu, CG
    Liang, YC
    Lee, HP
    Lu, C
    Lin, WZ
    [J]. PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2004, 14 (07) : 631 - 637
  • [24] A Graph Pointer Network-Based Multi-Objective Deep Reinforcement Learning Algorithm for Solving the Traveling Salesman Problem
    Perera, Jeewaka
    Liu, Shih-Hsi
    Mernik, Marjan
    Crepinsek, Matej
    Ravber, Miha
    [J]. MATHEMATICS, 2023, 11 (02)
  • [25] LIMITATIONS OF NEURAL NETWORKS FOR SOLVING TRAVELING SALESMAN PROBLEMS
    GEE, AH
    PRAGER, RW
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (01): : 280 - 282
  • [26] Solving constrained traveling salesman problems by genetic algorithms
    WU Chunguo 1
    Key Laboratory for Symbol Computation and Knowledge Engineering
    2. Institute of High Performance Computing
    [J]. Progress in Natural Science:Materials International, 2004, (07) : 79 - 85
  • [27] Solving traveling salesman problems by artificial immune response
    Gong, Maoguo
    Jiao, Licheng
    Zhang, Lining
    [J]. SIMULATED EVOLUTION AND LEARNING, PROCEEDINGS, 2006, 4247 : 64 - 71
  • [28] EXPERIENCES WITH A PARALLELIZED METHOD FOR SOLVING THE TRAVELING SALESMAN PROBLEMS
    FERSTL, F
    THURNER, G
    [J]. ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 1990, 70 (06): : T598 - T600
  • [29] Solving Physical Traveling Salesman Problems with Policy Adaptation
    Edelkamp, Stefan
    Greulich, Christoph
    [J]. 2014 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2014,
  • [30] Solving asymmetric traveling salesman problems using dynamic scheduling on a heterogeneous computing system
    Brest, J
    Zumer, V
    Zerovnik, J
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, PROCEEDINGS, 1999, : 1350 - 1356