A path planning method based on deep reinforcement learning for crowd evacuation

被引:0
|
作者
Xiangdong Meng
Hong Liu
Wenhao Li
机构
[1] Shandong Normal University,School of Information Science and Engineering
[2] Shandong Provincial Key Laboratory for Novel Distributed Computer Software Technology,undefined
关键词
Path planning; Deep reinforcement learning; Crowd evacuation; Optimized multi-agent deep deterministic policy gradient;
D O I
10.1007/s12652-024-04787-x
中图分类号
学科分类号
摘要
Deep reinforcement learning (DRL) is suitable for solving complex path-planning problems due to its excellent ability to make continuous decisions in a complex environment. However, the increase in the population size in the crowd evacuation path-planning problem causes a substantial computational burden for the algorithm, which leads to an unsatisfactory efficiency of the current DRL algorithm. This paper presents a path planning method based on DRL for crowd evacuation to solve the problem. First, we divide crowds into groups based on their relationship and distance from each other and select leaders from them. Next, we expand the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) to propose an Optimized Multi-Agent Deep Deterministic Policy Gradient (OMADDPG) algorithm to obtain the global evacuation path. The OMADDPG algorithm uses the Cross-Entropy Method (CEM) to optimize policy and improve the neural network’s training efficiency by applying the Data Pruning (DP) algorithm. In addition, the social force model is improved, incorporating the relationship between individuals and psychological factors into the model. Finally, this paper combines the improved social force model and the OMADDPG algorithm. The OMADDPG algorithm transmits the path information to the leaders. Pedestrians in the environment are driven by the improved social force model to follow the leaders to complete the evacuation simulation. The method can use a leader to guide pedestrians safely arrive the exit and reduce evacuation time in different environments. The simulation results prove the efficiency of the path planning method.
引用
收藏
页码:2925 / 2939
页数:14
相关论文
共 50 条
  • [1] A double-layer crowd evacuation simulation method based on deep reinforcement learning
    Zhang, Yong
    Yang, Bo
    Zhu, Jianlin
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2024, 35 (03)
  • [2] A UAV Path Planning Method Based on Deep Reinforcement Learning
    Li, Yibing
    Zhang, Sitong
    Ye, Fang
    Jiang, Tao
    Li, Yingsong
    [J]. 2020 IEEE USNC-CNC-URSI NORTH AMERICAN RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2020, : 93 - 94
  • [3] Improved Robot Path Planning Method Based on Deep Reinforcement Learning
    Han, Huiyan
    Wang, Jiaqi
    Kuang, Liqun
    Han, Xie
    Xue, Hongxin
    [J]. SENSORS, 2023, 23 (12)
  • [4] Path planning of manipulator based on deep reinforcement learning and screw method
    Wang, Yin
    Wang, Yong-Hua
    Yin, Ze-Zhong
    Wan, Pin
    [J]. Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (03): : 516 - 524
  • [5] Robot path planning based on deep reinforcement learning
    Long, Yinxin
    He, Huajin
    [J]. 2020 IEEE CONFERENCE ON TELECOMMUNICATIONS, OPTICS AND COMPUTER SCIENCE (TOCS), 2020, : 151 - 154
  • [6] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
    Yanglong Liu
    Zuguo Chen
    Yonggang Li
    Ming Lu
    Chaoyang Chen
    Xuzhuo Zhang
    [J]. International Journal of Control, Automation and Systems, 2022, 20 : 2669 - 2680
  • [7] Mobile Robot Path Planning Method Based on Deep Reinforcement Learning Algorithm
    Meng, Haitao
    Zhang, Hengrui
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (15)
  • [8] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
    Liu, Yanglong
    Chen, Zuguo
    Li, Yonggang
    Lu, Ming
    Chen, Chaoyang
    Zhang, Xuzhuo
    [J]. INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 20 (08) : 2669 - 2680
  • [9] Crowd Evacuation Simulation Using Hierarchical Deep Reinforcement Learning
    Zhang, Zheng
    Lu, Dianjie
    Li, Jialiuyuan
    Liu, Pingshan
    Zhang, Guijuan
    [J]. PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 563 - 568
  • [10] Crowd evacuation guidance based on combined action-space deep reinforcement learning
    Xue, Yiran
    Wu, Rui
    Liu, Jiafeng
    [J]. Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2021, 53 (08): : 29 - 38