C-SPPO: A deep reinforcement learning framework for large-scale dynamic logistics UAV routing problem

被引:0
|
作者
Wang, Fei [1 ]
Zhang, Honghai [1 ,2 ]
Du, Sen [1 ]
Hua, Mingzhuang [2 ]
Zhong, Gang [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Civil Aviat, Nanjing 211106, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Coll Gen Aviat & Flight, Nanjing 211106, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Unmanned aerial vehicle; Vehicle routing problem; Order delivery; Reinforcement learning; Multi-agent; Proximal policy optimization;
D O I
10.1016/j.cja.2024.09.005
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Unmanned Aerial Vehicle (UAV) stands as a burgeoning electric transportation carrier, holding substantial promise for the logistics sector. A reinforcement learning framework Centralized - S Proximal Policy Optimization (C-SPPO) based on centralized decision process and considering policy entropy (S) is proposed. The proposed framework aims to plan the best scheduling scheme with the objective of minimizing both the timeout of order requests and the flight impact of UAVs that may lead to conflicts. In this framework, the intents of matching act are generated through the observations of UAV agents, and the ultimate conflict-free matching results are output under the guidance of a centralized decision maker. Concurrently, a pre-activation operation is introduced to further enhance the cooperation among UAV agents. Simulation experiments based on real-world data from New York City are conducted. The results indicate that the proposed CSPPO outperforms the baseline algorithms in the Average Delay Time (ADT), the Maximum Delay Time (MDT), the Order Delay Rate (ODR), the Average Flight Distance (AFD), and the Flight Impact Ratio (FIR). Furthermore, the framework demonstrates scalability to scenarios of different sizes without requiring additional training. (c) 2024 Production and hosting by Elsevier Ltd. on behalf of Chinese Society of Aeronautics and Astronautics. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/ licenses/by-nc-nd/4.0/).
引用
收藏
页数:21
相关论文
共 50 条
  • [41] DL-DRL: A Double-Level Deep Reinforcement Learning Approach for Large-Scale Task Scheduling of Multi-UAV
    Mao, Xiao
    Wu, Guohua
    Fan, Mingfeng
    Cao, Zhiguang
    Pedrycz, Witold
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 1028 - 1044
  • [42] A dynamic spatial-temporal deep learning framework for traffic speed prediction on large-scale road networks
    Zheng, Ge
    Chai, Wei Koong
    Katos, Vasilis
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 195
  • [43] Capacity planning in logistics corridors: Deep reinforcement learning for the dynamic stochastic temporal bin packing problem
    Farahani, Amirreza
    Genga, Laura
    Schrotenboer, Albert H.
    Dijkman, Remco
    TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2024, 191
  • [44] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
    Chu, Tianshu
    Wang, Jie
    Codeca, Lara
    Li, Zhaojian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095
  • [45] A LARGE-SCALE PATH PLANNING ALGORITHM FOR UNDERWATER ROBOTS BASED ON DEEP REINFORCEMENT LEARNING
    Wang, Wenhui
    Li, Leqing
    Ye, Fumeng
    Peng, Yumin
    Ma, Yiming
    INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2024, 39 (03): : 204 - 210
  • [46] Hierarchical Mean-Field Deep Reinforcement Learning for Large-Scale Multiagent Systems
    Yu, Chao
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11744 - 11752
  • [47] A DEEP REINFORCEMENT LEARNING APPROACH TO FLOCKING AND NAVIGATION OF UAVS IN LARGE-SCALE COMPLEX ENVIRONMENTS
    Wang, Chao
    Wang, Jian
    Zhang, Xudong
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 1228 - 1232
  • [48] Autonomous Navigation of UAVs in Large-Scale Complex Environments: A Deep Reinforcement Learning Approach
    Wang, Chao
    Wang, Jian
    Shen, Yuan
    Zhang, Xudong
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (03) : 2124 - 2136
  • [49] Large-scale traffic control using autonomous vehicles and decentralized deep reinforcement learning
    Maske, Harshal
    Chu, Tianshu
    Kalabic, Uros
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3816 - 3821
  • [50] Implementation of a Large-Scale Image Curation Workflow Using Deep Learning Framework
    Domalpally, Amitha
    Slater, Robert
    Barrett, Nancy
    Voland, Rick
    Balaji, Rohit
    Heathcote, Jennifer
    Channa, Roomasa
    Blodi, Barbara
    OPHTHALMOLOGY SCIENCE, 2022, 2 (04):