Collision Avoidance in Pedestrian-Rich Environments With Deep Reinforcement Learning

被引:97
|
作者
Everett, Michael [1 ]
Chen, Yu Fan [2 ]
How, Jonathan P. [3 ]
机构
[1] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA
[2] Facebook Real Labs, Redmond, WA 98052 USA
[3] MIT, Aeronaut & Astronaut, Cambridge, MA 02139 USA
关键词
Collision avoidance; Robots; Reinforcement learning; Vehicle dynamics; Robot sensing systems; Heuristic algorithms; Dynamics; deep reinforcement learning; motion planning; multiagent systems; decentralized execution;
D O I
10.1109/ACCESS.2021.3050338
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Collision avoidance algorithms are essential for safe and efficient robot operation among pedestrians. This work proposes using deep reinforcement (RL) learning as a framework to model the complex interactions and cooperation with nearby, decision-making agents, such as pedestrians and other robots. Existing RL-based works assume homogeneity of agent properties, use specific motion models over short timescales, or lack a principled method to handle a large, possibly varying number of agents. Therefore, this work develops an algorithm that learns collision avoidance among a variety of heterogeneous, non-communicating, dynamic agents without assuming they follow any particular behavior rules. It extends our previous work by introducing a strategy using Long Short-Term Memory (LSTM) that enables the algorithm to use observations of an arbitrary number of other agents, instead of a small, fixed number of neighbors. The proposed algorithm is shown to outperform a classical collision avoidance algorithm, another deep RL-based algorithm, and scales with the number of agents better (fewer collisions, shorter time to goal) than our previously published learning-based approach. Analysis of the LSTM provides insights into how observations of nearby agents affect the hidden state and quantifies the performance impact of various agent ordering heuristics. The learned policy generalizes to several applications beyond the training scenarios: formation control (arrangement into letters), demonstrations on a fleet of four multirotors and on a fully autonomous robotic vehicle capable of traveling at human walking speed among pedestrians.
引用
收藏
页码:10357 / 10377
页数:21
相关论文
共 50 条
  • [31] Research on MASS Collision Avoidance in Complex Waters Based on Deep Reinforcement Learning
    Liu, Jiao
    Shi, Guoyou
    Zhu, Kaige
    Shi, Jiahui
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (04)
  • [32] Collision Avoidance for Indoor Service Robots Through Multimodal Deep Reinforcement Learning
    Leiva, Francisco
    Lobos-Tsunekawa, Kenzo
    Ruiz-del-Solar, Javier
    ROBOT WORLD CUP XXIII, ROBOCUP 2019, 2019, 11531 : 140 - 153
  • [33] Collision Avoidance Among Dense Heterogeneous Agents Using Deep Reinforcement Learning
    Zhu, Kai
    Li, Bin
    Zhe, Wenming
    Zhang, Tao
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (01) : 57 - 64
  • [34] Deep reinforcement learning with predictive auxiliary task for autonomous train collision avoidance
    Plissonneau, Antoine
    Jourdan, Luca
    Trentesaux, Damien
    Abdi, Lotfi
    Sallak, Mohamed
    Bekrar, Abdelghani
    Quost, Benjamin
    Schoen, Walter
    JOURNAL OF RAIL TRANSPORT PLANNING & MANAGEMENT, 2024, 31
  • [35] Research on Method of Collision Avoidance Planning for UUV Based on Deep Reinforcement Learning
    Gao, Wei
    Han, Mengxue
    Wang, Zhao
    Deng, Lihui
    Wang, Hongjian
    Ren, Jingfei
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (12)
  • [36] Deep Reinforcement Learning Based Collision Avoidance Algorithm for Differential Drive Robot
    Lu, Xinglong
    Cao, Yiwen
    Zhao, Zhonghua
    Yan, Yilin
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2018), PT I, 2018, 10984 : 186 - 198
  • [37] Collision Detection and Avoidance for Multi-UAV based on Deep Reinforcement Learning
    Wang, Guanzheng
    Liu, Zhihong
    Xiao, Kun
    Xu, Yinbo
    Yang, Lingjie
    Wang, Xiangke
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7783 - 7789
  • [38] Collision avoidance for AGV based on deep reinforcement learning in complex dynamic environment
    Cai Z.
    Hu Y.
    Wen J.
    Zhang L.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (01): : 236 - 245
  • [39] Online Trajectory Planning with Reinforcement Learning for Pedestrian Avoidance
    Feher, Arpad
    Aradi, Szilard
    Becsi, Tamas
    ELECTRONICS, 2022, 11 (15)
  • [40] A COLREGs-Compliant Collision Avoidance Decision Approach Based on Deep Reinforcement Learning
    Wang, Weiqiang
    Huang, Liwen
    Liu, Kezhong
    Wu, Xiaolie
    Wang, Jingyao
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (07)