Deep Reinforcement Learning-Based Multi-Agent Algorithm for Vehicle Routing Problem in Complex Logistics Scenarios

被引:0
|
作者
Zhang, Xinzhi [1 ]
Yang, Yeming [1 ]
Cai, Junchuang [1 ]
Zhu, Qingling [1 ]
Chen, Weineng [1 ]
Lin, Qiuzhen [1 ]
机构
[1] Shenzhen Univ, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
vehicle routing problem; multi-agent; deep reinforcement learning; metaheuristics; SIMULTANEOUS DELIVERY; NEIGHBORHOOD SEARCH; REVERSE LOGISTICS; PICKUP;
D O I
10.1109/IJCNN60899.2024.10650335
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Vehicle Routing Problem with Simultaneous Pickup-Delivery and Time Windows (VRPSPDTW) is a highly challenging issue in complex logistics distribution scenarios, requiring an optimal balance between cost and efficiency. Traditional methods often rely on single heuristic or metaheuristic algorithms, which perform not so well when dealing with VRPSPDTW. To overcome this challenge, we propose a deep reinforcement learning-based multi-agent algorithm (DRL-MA) to tackle the VRPSPDTW. Our algorithm includes explorative, exploitative, and perturbative agents, which are responsible for balancing exploration and exploitation. The action space of each agent comprises a combination of neighborhood operators, and then the Deep Q-network (DQN) is used to learn effective neighborhood transition sequences from a long-term perspective, which can effectively explore large and complex solution spaces. The cooperation and competition among agents during the search process offer a more flexible and effective strategy. Experimental studies conducted on a real test suite of large-scale VRPSPDTW instances validate the superiority of our proposed DRL-MA over some state-of-the-art algorithms.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Multi-Agent Deep Reinforcement Learning-Based Algorithm For Fast Generalization On Routing Problems
    Barbahan, Ibraheem
    Baikalov, Vladimir
    Vyatkin, Valeriy
    Filchenkov, Andrey
    10TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE (YSC2021), 2021, 193 : 228 - 238
  • [2] A multi-agent deep reinforcement learning approach for solving the multi-depot vehicle routing problem
    Arishi, Ali
    Krishnan, Krishna
    JOURNAL OF MANAGEMENT ANALYTICS, 2023, 10 (03) : 493 - 515
  • [3] Energy-Saving Multi-Agent Deep Reinforcement Learning Algorithm for Drone Routing Problem
    Shu, Xiulan
    Lin, Anping
    Wen, Xupeng
    SENSORS, 2024, 24 (20)
  • [4] Energy-Saving Multi-Agent Deep Reinforcement Learning Algorithm for Drone Routing Problem
    Shu, Xiulan
    Lin, Anping
    Wen, Xupeng
    Sensors, 24 (20):
  • [5] A Hybrid Reinforcement Learning-Based Model for the Vehicle Routing Problem in Transportation Logistics
    Phiboonbanakit, Thananut
    Horanont, Teerayut
    Huynh, Van-Nam
    Supnithi, Thepchai
    IEEE ACCESS, 2021, 9 : 163325 - 163347
  • [6] Multi-agent Reinforcement Learning-Based UAS Control for Logistics Environments
    Jo, Hyungeun
    Lee, Hoeun
    Jeon, Sangwoo
    Kaliappan, Vishnu Kumar
    Nguyen, Tuan Anh
    Min, Dugki
    Lee, Jae-Woo
    PROCEEDINGS OF THE 2021 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON AEROSPACE TECHNOLOGY (APISAT 2021), VOL 2, 2023, 913 : 963 - 972
  • [7] Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach
    Mak, Stephen
    Xu, Liming
    Pearce, Tim
    Ostroumov, Michael
    Brintrup, Alexandra
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 157
  • [8] A Multi-Agent Reinforcement Learning-Based Optimized Routing for QoS in IoT
    Jeaunita, T. C. Jermin
    Sarasvathi, V
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2021, 21 (04) : 45 - 61
  • [9] Research on intelligent algorithm for alerting vehicle impact based on multi-agent deep reinforcement learning
    Zexue Wang
    Qidong Wan
    Yangmei Qin
    Senqing Fan
    Zeyi Xiao
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 1337 - 1347
  • [10] Research on intelligent algorithm for alerting vehicle impact based on multi-agent deep reinforcement learning
    Wang, Zexue
    Wan, Qidong
    Qin, Yangmei
    Fan, Senqing
    Xiao, Zeyi
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (01) : 1337 - 1347