A deep reinforcement learning approach for solving the Traveling Salesman Problem with Drone

被引：21

作者：

Bogyrbayeva, Aigerim ^{[1
]}

Yoon, Taehyun ^{[2
]}

Ko, Hanbum ^{[2
]}

Lim, Sungbin ^{[3
]}

Yun, Hyokun ^{[4
]}

Kwon, Changhyun ^{[5
]}

机构：

[1] Suleyman Demirel Univ, Kaskelen, Kazakhstan

[2] UNIST, Ulsan, South Korea

[3] Korea Univ, Seoul, South Korea

[4] Amazon, Seattle, WA USA

[5] Univ S Florida, Tampa, FL 33620 USA

来源：

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES | 2023年 / 148卷

基金：

新加坡国家研究基金会; 美国国家科学基金会;

关键词：

Vehicle routing; Traveling salesman problem; Drones; Reinforcement learning; Neural networks; NEIGHBORHOOD SEARCH; OPTIMIZATION; LOGISTICS; TRUCK;

D O I：

10.1016/j.trc.2022.103981

中图分类号：

U [交通运输];

学科分类号：

08 ; 0823 ;

摘要：

Reinforcement learning has recently shown promise in learning quality solutions in many combinatorial optimization problems. In particular, the attention-based encoder-decoder models show high effectiveness on various routing problems, including the Traveling Salesman Problem (TSP). Unfortunately, they perform poorly for the TSP with Drone (TSP-D), requiring routing a heterogeneous fleet of vehicles in coordination-a truck and a drone. In TSP-D, the two vehicles are moving in tandem and may need to wait at a node for the other vehicle to join. State-less attention-based decoder fails to make such coordination between vehicles. We propose a hybrid model that uses an attention encoder and a Long Short-Term Memory (LSTM) network decoder, in which the decoder's hidden state can represent the sequence of actions made. We empirically demonstrate that such a hybrid model improves upon a purely attention-based model for both solution quality and computational efficiency. Our experiments on the min-max Capacitated Vehicle Routing Problem (mmCVRP) also confirm that the hybrid model is more suitable for the coordinated routing of multiple vehicles than the attention-based model. The proposed model demonstrates comparable results as the operations research baseline methods.

引用

页数：19

共 50 条

[1] Applying Deep Learning and Reinforcement Learning to Traveling Salesman Problem
Miki, Shoma
Yamamoto, Daisuke
Ebara, Hiroyuki
[J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRONICS & COMMUNICATIONS ENGINEERING (ICCECE), 2018, : 65 - 70
[2] Solving Dynamic Traveling Salesman Problems With Deep Reinforcement Learning
Zhang, Zizhen
Liu, Hong
Zhou, MengChu
Wang, Jiahai
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (04) : 2119 - 2132
[3] Solving Time-Dependent Traveling Salesman Problem with Time Windows with Deep Reinforcement Learning
Wu, Guojin
Zhang, Zizhen
Liu, Hong
Wang, Jiahai
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 558 - 563
[4] Deep Reinforcement Learning for Traveling Salesman Problem with Time Windows and Rejections
Zhang, Rongkai
Prokhorchuk, Anatolii
Dauwels, Justin
[J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[5] Reinforcement learning for the traveling salesman problem with refueling
André L. C. Ottoni
Erivelton G. Nepomuceno
Marcos S. de Oliveira
Daniela C. R. de Oliveira
[J]. Complex & Intelligent Systems, 2022, 8 : 2001 - 2015
[6] Reinforcement learning for the traveling salesman problem with refueling
Ottoni, Andre L. C.
Nepomuceno, Erivelton G.
Oliveira, Marcos S. de
Oliveira, Daniela C. R. de
[J]. COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (03) : 2001 - 2015
[7] Prize-Collecting Traveling Salesman Problem: a Reinforcement Learning Approach
Ruiz, Justin
Gonzalez, Christopher
Chen, Yutian
Tang, Bin
[J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 4416 - 4421
[8] Learning to cooperate in solving the traveling salesman problem
Qi, DH
Sun, R
[J]. INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2005, 15 (1-2) : 151 - 162
[9] G-DGANet: Gated deep graph attention network with reinforcement learning for solving traveling salesman problem
Fellek, Getu
Farid, Ahmed
Fujimura, Shigeru
Yoshie, Osamu
Gebreyesus, Goytom
[J]. NEUROCOMPUTING, 2024, 579
[10] G-DGANet: Gated deep graph attention network with reinforcement learning for solving traveling salesman problem
Fellek, Getu
Farid, Ahmed
Fujimura, Shigeru
Yoshie, Osamu
Gebreyesus, Goytom
[J]. Neurocomputing, 2024, 579

← 1 2 3 4 5 →