Collision Avoidance in Pedestrian-Rich Environments With Deep Reinforcement Learning

被引：97

作者：

Everett, Michael ^{[1
]}

Chen, Yu Fan ^{[2
]}

How, Jonathan P. ^{[3
]}

机构：

[1] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA

[2] Facebook Real Labs, Redmond, WA 98052 USA

[3] MIT, Aeronaut & Astronaut, Cambridge, MA 02139 USA

来源：

IEEE ACCESS | 2021年 / 9卷

关键词：

Collision avoidance; Robots; Reinforcement learning; Vehicle dynamics; Robot sensing systems; Heuristic algorithms; Dynamics; deep reinforcement learning; motion planning; multiagent systems; decentralized execution;

D O I：

10.1109/ACCESS.2021.3050338

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Collision avoidance algorithms are essential for safe and efficient robot operation among pedestrians. This work proposes using deep reinforcement (RL) learning as a framework to model the complex interactions and cooperation with nearby, decision-making agents, such as pedestrians and other robots. Existing RL-based works assume homogeneity of agent properties, use specific motion models over short timescales, or lack a principled method to handle a large, possibly varying number of agents. Therefore, this work develops an algorithm that learns collision avoidance among a variety of heterogeneous, non-communicating, dynamic agents without assuming they follow any particular behavior rules. It extends our previous work by introducing a strategy using Long Short-Term Memory (LSTM) that enables the algorithm to use observations of an arbitrary number of other agents, instead of a small, fixed number of neighbors. The proposed algorithm is shown to outperform a classical collision avoidance algorithm, another deep RL-based algorithm, and scales with the number of agents better (fewer collisions, shorter time to goal) than our previously published learning-based approach. Analysis of the LSTM provides insights into how observations of nearby agents affect the hidden state and quantifies the performance impact of various agent ordering heuristics. The learned policy generalizes to several applications beyond the training scenarios: formation control (arrangement into letters), demonstrations on a fleet of four multirotors and on a fully autonomous robotic vehicle capable of traveling at human walking speed among pedestrians.

引用

页码：10357 / 10377

页数：21

共 50 条

[1] Pedestrian Collision Avoidance Using Deep Reinforcement Learning
Rafiei, Alireza
Fasakhodi, Amirhossein Oliaei
Hajati, Farshid
INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2022, 23 (03) : 613 - 622
[2] Pedestrian Collision Avoidance Using Deep Reinforcement Learning
Alireza Rafiei
Amirhossein Oliaei Fasakhodi
Farshid Hajati
International Journal of Automotive Technology, 2022, 23 : 613 - 622
[3] Learning to Socially Navigate in Pedestrian-rich Environments with Interaction Capacity
Qiu, Quecheng
Yao, Shunyi
Wang, Jing
Ma, Jun
Chen, Guangda
Ji, Jianmin
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022,
[4] Deep reinforcement learning for pedestrian collision avoidance and human-machine cooperative driving
Li, Junxiang
Yao, Liang
Xu, Xin
Cheng, Bang
Ren, Junkai
INFORMATION SCIENCES, 2020, 532 : 110 - 124
[5] Deep Reinforcement Learning for Collision Avoidance of Robotic Manipulators
Sangiovanni, Bianca
Rendiniello, Angelo
Incremona, Gian Paolo
Ferrara, Antonella
Piastra, Marco
2018 EUROPEAN CONTROL CONFERENCE (ECC), 2018, : 2063 - 2068
[6] A Collision Avoidance Method Based on Deep Reinforcement Learning
Feng, Shumin
Sebastian, Bijo
Ben-Tzvi, Pinhas
ROBOTICS, 2021, 10 (02)
[7] Deep Reinforcement Learning for Collision Avoidance of Autonomous Vehicle
Tseng, Hsiao-Ting
Hsieh, Chen-Chiung
Lin, Wei-Ting
Lin, Jyun-Ting
2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
[8] Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments
Fei WANG
Xiaoping ZHU
Zhou ZHOU
Yang TANG
Chinese Journal of Aeronautics, 2024, 37 (03) : 237 - 257
[9] Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments
Wang, Fei
Zhu, Xiaoping
Zhou, Zhou
Tang, Yang
CHINESE JOURNAL OF AERONAUTICS, 2024, 37 (03) : 237 - 257
[10] A learning method for AUV collision avoidance through deep reinforcement learning
Xu, Jian
Huang, Fei
Wu, Di
Cui, Yunfei
Yan, Zheping
Du, Xue
OCEAN ENGINEERING, 2022, 260

← 1 2 3 4 5 →