Deep Reinforcement Learning-Based Multi-Agent Algorithm for Vehicle Routing Problem in Complex Logistics Scenarios

被引：0

作者：

Zhang, Xinzhi ^{[1
]}

Yang, Yeming ^{[1
]}

Cai, Junchuang ^{[1
]}

Zhu, Qingling ^{[1
]}

Chen, Weineng ^{[1
]}

Lin, Qiuzhen ^{[1
]}

机构：

[1] Shenzhen Univ, Shenzhen, Peoples R China

来源：

2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024 | 2024年

基金：

中国国家自然科学基金;

关键词：

vehicle routing problem; multi-agent; deep reinforcement learning; metaheuristics; SIMULTANEOUS DELIVERY; NEIGHBORHOOD SEARCH; REVERSE LOGISTICS; PICKUP;

D O I：

10.1109/IJCNN60899.2024.10650335

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The Vehicle Routing Problem with Simultaneous Pickup-Delivery and Time Windows (VRPSPDTW) is a highly challenging issue in complex logistics distribution scenarios, requiring an optimal balance between cost and efficiency. Traditional methods often rely on single heuristic or metaheuristic algorithms, which perform not so well when dealing with VRPSPDTW. To overcome this challenge, we propose a deep reinforcement learning-based multi-agent algorithm (DRL-MA) to tackle the VRPSPDTW. Our algorithm includes explorative, exploitative, and perturbative agents, which are responsible for balancing exploration and exploitation. The action space of each agent comprises a combination of neighborhood operators, and then the Deep Q-network (DQN) is used to learn effective neighborhood transition sequences from a long-term perspective, which can effectively explore large and complex solution spaces. The cooperation and competition among agents during the search process offer a more flexible and effective strategy. Experimental studies conducted on a real test suite of large-scale VRPSPDTW instances validate the superiority of our proposed DRL-MA over some state-of-the-art algorithms.

引用

页数：8

共 50 条

[1] Multi-Agent Deep Reinforcement Learning-Based Algorithm For Fast Generalization On Routing Problems
Barbahan, Ibraheem
Baikalov, Vladimir
Vyatkin, Valeriy
Filchenkov, Andrey
10TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE (YSC2021), 2021, 193 : 228 - 238
[2] A multi-agent deep reinforcement learning approach for solving the multi-depot vehicle routing problem
Arishi, Ali
Krishnan, Krishna
JOURNAL OF MANAGEMENT ANALYTICS, 2023, 10 (03) : 493 - 515
[3] Energy-Saving Multi-Agent Deep Reinforcement Learning Algorithm for Drone Routing Problem
Shu, Xiulan
Lin, Anping
Wen, Xupeng
SENSORS, 2024, 24 (20)
[4] Energy-Saving Multi-Agent Deep Reinforcement Learning Algorithm for Drone Routing Problem
Shu, Xiulan
Lin, Anping
Wen, Xupeng
Sensors, 24 (20):
[5] A Hybrid Reinforcement Learning-Based Model for the Vehicle Routing Problem in Transportation Logistics
Phiboonbanakit, Thananut
Horanont, Teerayut
Huynh, Van-Nam
Supnithi, Thepchai
IEEE ACCESS, 2021, 9 : 163325 - 163347
[6] Multi-agent Reinforcement Learning-Based UAS Control for Logistics Environments
Jo, Hyungeun
Lee, Hoeun
Jeon, Sangwoo
Kaliappan, Vishnu Kumar
Nguyen, Tuan Anh
Min, Dugki
Lee, Jae-Woo
PROCEEDINGS OF THE 2021 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON AEROSPACE TECHNOLOGY (APISAT 2021), VOL 2, 2023, 913 : 963 - 972
[7] Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach
Mak, Stephen
Xu, Liming
Pearce, Tim
Ostroumov, Michael
Brintrup, Alexandra
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 157
[8] A Multi-Agent Reinforcement Learning-Based Optimized Routing for QoS in IoT
Jeaunita, T. C. Jermin
Sarasvathi, V
CYBERNETICS AND INFORMATION TECHNOLOGIES, 2021, 21 (04) : 45 - 61
[9] Research on intelligent algorithm for alerting vehicle impact based on multi-agent deep reinforcement learning
Zexue Wang
Qidong Wan
Yangmei Qin
Senqing Fan
Zeyi Xiao
Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 1337 - 1347
[10] Research on intelligent algorithm for alerting vehicle impact based on multi-agent deep reinforcement learning
Wang, Zexue
Wan, Qidong
Qin, Yangmei
Fan, Senqing
Xiao, Zeyi
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (01) : 1337 - 1347

← 1 2 3 4 5 →