A path planning method based on deep reinforcement learning for crowd evacuation

被引:0
|
作者
Meng X. [1 ,2 ]
Liu H. [1 ,2 ]
Li W. [1 ,2 ]
机构
[1] School of Information Science and Engineering, Shandong Normal University, Jinan
[2] Shandong Provincial Key Laboratory for Novel Distributed Computer Software Technology, Jinan
基金
中国国家自然科学基金;
关键词
Crowd evacuation; Deep reinforcement learning; Optimized multi-agent deep deterministic policy gradient; Path planning;
D O I
10.1007/s12652-024-04787-x
中图分类号
学科分类号
摘要
Deep reinforcement learning (DRL) is suitable for solving complex path-planning problems due to its excellent ability to make continuous decisions in a complex environment. However, the increase in the population size in the crowd evacuation path-planning problem causes a substantial computational burden for the algorithm, which leads to an unsatisfactory efficiency of the current DRL algorithm. This paper presents a path planning method based on DRL for crowd evacuation to solve the problem. First, we divide crowds into groups based on their relationship and distance from each other and select leaders from them. Next, we expand the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) to propose an Optimized Multi-Agent Deep Deterministic Policy Gradient (OMADDPG) algorithm to obtain the global evacuation path. The OMADDPG algorithm uses the Cross-Entropy Method (CEM) to optimize policy and improve the neural network’s training efficiency by applying the Data Pruning (DP) algorithm. In addition, the social force model is improved, incorporating the relationship between individuals and psychological factors into the model. Finally, this paper combines the improved social force model and the OMADDPG algorithm. The OMADDPG algorithm transmits the path information to the leaders. Pedestrians in the environment are driven by the improved social force model to follow the leaders to complete the evacuation simulation. The method can use a leader to guide pedestrians safely arrive the exit and reduce evacuation time in different environments. The simulation results prove the efficiency of the path planning method. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.
引用
收藏
页码:2925 / 2939
页数:14
相关论文
共 50 条
  • [1] Crowd evacuation path planning and simulation method based on deep reinforcement learning and repulsive force fieldCrowd evacuation path planning and simulation method based on deep...H. Wang and H. Liu
    Hongyue Wang
    Hong Liu
    Wenhao Li
    Applied Intelligence, 2025, 55 (4)
  • [2] AFSA based path planning method for crowd evacuation
    Lu, Dianjie
    Zhang, Guijuan
    Liu, Yiliang
    Wang, Dequan
    Liu, Hong
    Journal of Information and Computational Science, 2014, 11 (11): : 3815 - 3823
  • [3] A double-layer crowd evacuation simulation method based on deep reinforcement learning
    Zhang, Yong
    Yang, Bo
    Zhu, Jianlin
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2024, 35 (03)
  • [4] A UAV Path Planning Method Based on Deep Reinforcement Learning
    Li, Yibing
    Zhang, Sitong
    Ye, Fang
    Jiang, Tao
    Li, Yingsong
    2020 IEEE USNC-CNC-URSI NORTH AMERICAN RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2020, : 93 - 94
  • [5] Improved Robot Path Planning Method Based on Deep Reinforcement Learning
    Han, Huiyan
    Wang, Jiaqi
    Kuang, Liqun
    Han, Xie
    Xue, Hongxin
    SENSORS, 2023, 23 (12)
  • [6] Path planning of manipulator based on deep reinforcement learning and screw method
    Wang Y.
    Wang Y.-H.
    Yin Z.-Z.
    Wan P.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (03): : 516 - 524
  • [7] Robot path planning based on deep reinforcement learning
    Long, Yinxin
    He, Huajin
    2020 IEEE CONFERENCE ON TELECOMMUNICATIONS, OPTICS AND COMPUTER SCIENCE (TOCS), 2020, : 151 - 154
  • [8] Crowd Evacuation Simulation Using Hierarchical Deep Reinforcement Learning
    Zhang, Zheng
    Lu, Dianjie
    Li, Jialiuyuan
    Liu, Pingshan
    Zhang, Guijuan
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 563 - 568
  • [9] Mobile Robot Path Planning Method Based on Deep Reinforcement Learning Algorithm
    Meng, Haitao
    Zhang, Hengrui
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (15)
  • [10] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
    Yanglong Liu
    Zuguo Chen
    Yonggang Li
    Ming Lu
    Chaoyang Chen
    Xuzhuo Zhang
    International Journal of Control, Automation and Systems, 2022, 20 : 2669 - 2680