A deep reinforcement learning approach for solving the Traveling Salesman Problem with Drone

被引:21
|
作者
Bogyrbayeva, Aigerim [1 ]
Yoon, Taehyun [2 ]
Ko, Hanbum [2 ]
Lim, Sungbin [3 ]
Yun, Hyokun [4 ]
Kwon, Changhyun [5 ]
机构
[1] Suleyman Demirel Univ, Kaskelen, Kazakhstan
[2] UNIST, Ulsan, South Korea
[3] Korea Univ, Seoul, South Korea
[4] Amazon, Seattle, WA USA
[5] Univ S Florida, Tampa, FL 33620 USA
基金
新加坡国家研究基金会; 美国国家科学基金会;
关键词
Vehicle routing; Traveling salesman problem; Drones; Reinforcement learning; Neural networks; NEIGHBORHOOD SEARCH; OPTIMIZATION; LOGISTICS; TRUCK;
D O I
10.1016/j.trc.2022.103981
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Reinforcement learning has recently shown promise in learning quality solutions in many combinatorial optimization problems. In particular, the attention-based encoder-decoder models show high effectiveness on various routing problems, including the Traveling Salesman Problem (TSP). Unfortunately, they perform poorly for the TSP with Drone (TSP-D), requiring routing a heterogeneous fleet of vehicles in coordination-a truck and a drone. In TSP-D, the two vehicles are moving in tandem and may need to wait at a node for the other vehicle to join. State-less attention-based decoder fails to make such coordination between vehicles. We propose a hybrid model that uses an attention encoder and a Long Short-Term Memory (LSTM) network decoder, in which the decoder's hidden state can represent the sequence of actions made. We empirically demonstrate that such a hybrid model improves upon a purely attention-based model for both solution quality and computational efficiency. Our experiments on the min-max Capacitated Vehicle Routing Problem (mmCVRP) also confirm that the hybrid model is more suitable for the coordinated routing of multiple vehicles than the attention-based model. The proposed model demonstrates comparable results as the operations research baseline methods.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Applying Deep Learning and Reinforcement Learning to Traveling Salesman Problem
    Miki, Shoma
    Yamamoto, Daisuke
    Ebara, Hiroyuki
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRONICS & COMMUNICATIONS ENGINEERING (ICCECE), 2018, : 65 - 70
  • [2] Solving Dynamic Traveling Salesman Problems With Deep Reinforcement Learning
    Zhang, Zizhen
    Liu, Hong
    Zhou, MengChu
    Wang, Jiahai
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (04) : 2119 - 2132
  • [3] Solving Time-Dependent Traveling Salesman Problem with Time Windows with Deep Reinforcement Learning
    Wu, Guojin
    Zhang, Zizhen
    Liu, Hong
    Wang, Jiahai
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 558 - 563
  • [4] Deep Reinforcement Learning for Traveling Salesman Problem with Time Windows and Rejections
    Zhang, Rongkai
    Prokhorchuk, Anatolii
    Dauwels, Justin
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [5] Reinforcement learning for the traveling salesman problem with refueling
    André L. C. Ottoni
    Erivelton G. Nepomuceno
    Marcos S. de Oliveira
    Daniela C. R. de Oliveira
    [J]. Complex & Intelligent Systems, 2022, 8 : 2001 - 2015
  • [6] Reinforcement learning for the traveling salesman problem with refueling
    Ottoni, Andre L. C.
    Nepomuceno, Erivelton G.
    Oliveira, Marcos S. de
    Oliveira, Daniela C. R. de
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (03) : 2001 - 2015
  • [7] Prize-Collecting Traveling Salesman Problem: a Reinforcement Learning Approach
    Ruiz, Justin
    Gonzalez, Christopher
    Chen, Yutian
    Tang, Bin
    [J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 4416 - 4421
  • [8] Learning to cooperate in solving the traveling salesman problem
    Qi, DH
    Sun, R
    [J]. INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2005, 15 (1-2) : 151 - 162
  • [9] G-DGANet: Gated deep graph attention network with reinforcement learning for solving traveling salesman problem
    Fellek, Getu
    Farid, Ahmed
    Fujimura, Shigeru
    Yoshie, Osamu
    Gebreyesus, Goytom
    [J]. NEUROCOMPUTING, 2024, 579
  • [10] G-DGANet: Gated deep graph attention network with reinforcement learning for solving traveling salesman problem
    Fellek, Getu
    Farid, Ahmed
    Fujimura, Shigeru
    Yoshie, Osamu
    Gebreyesus, Goytom
    [J]. Neurocomputing, 2024, 579