Learning to Navigate Through Complex Dynamic Environment With Modular Deep Reinforcement Learning

被引：64

作者：

Wang, Yuanda ^{[1
,2
]}

He, Haibo ^{[3
]}

Sun, Changyin ^{[1
,2
]}

机构：

[1] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China

[2] Southeast Univ, Minist Educ, Key Lab Measurement & Control Complex Syst Engn, Nanjing 210096, Jiangsu, Peoples R China

[3] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA

来源：

IEEE TRANSACTIONS ON GAMES | 2018年 / 10卷 / 04期

基金：

中国国家自然科学基金; 美国国家科学基金会;

关键词：

Deep learning; navigation; reinforcement learning; two-stream Q-network; OBSTACLE AVOIDANCE; SIMULTANEOUS LOCALIZATION;

D O I：

10.1109/TG.2018.2849942

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose an end-to-end modular reinforcement learning architecture for a navigation task in complex dynamic environments with rapidly moving obstacles. In this architecture, the main task is divided into two subtasks: local obstacle avoidance and global navigation. For obstacle avoidance, we develop a two-stream Q-network, which processes spatial and temporal information separately and generates action values. The global navigation subtask is resolved by a conventional Q-network framework. An online learning network and an action scheduler are introduced to first combine two pretrained policies, and then continue exploring and optimizing until a stable policy is obtained. The two-stream Q-network obtains better performance than the conventional deep Q-learning approach in the obstacle avoidance subtask. Experiments on the main task demonstrate that the proposed architecture can efficiently avoid moving obstacles and complete the navigation task at a high success rate. The modular architecture enables parallel training and also demonstrates good generalization capability in different environments.

引用

页码：400 / 412

页数：13

共 50 条

[1] Learn to Navigate Autonomously Through Deep Reinforcement Learning
Wu, Keyu
Wang, Han
Esfahani, Mahdi Abolfazli
Yuan, Shenghai
[J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (05) : 5342 - 5352
[2] Learning to navigate a crystallization model with Deep Reinforcement Learning
Manee, Vidhyadhar
Baratti, Roberto
Romagnoli, Jose A.
[J]. CHEMICAL ENGINEERING RESEARCH & DESIGN, 2022, 178 : 111 - 123
[3] iTD3-CLN: Learn to navigate in dynamic scene through Deep Reinforcement Learning
Jiang, Haoge
Esfahani, Mahdi Abolfazli
Wu, Keyu
Wan, Kong-wah
Heng, Kuan-kian
Wang, Han
Jiang, Xudong
[J]. NEUROCOMPUTING, 2022, 503 : 118 - 128
[4] Learning to Navigate in Human Environments via Deep Reinforcement Learning
Gao, Xingyuan
Sun, Shiying
Zhao, Xiaoguang
Tan, Min
[J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I, 2019, 11953 : 418 - 429
[5] Collision avoidance for AGV based on deep reinforcement learning in complex dynamic environment
Cai, Ze
Hu, Yaoguang
Wen, Jingqian
Zhang, Lixiang
[J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (01): : 236 - 245
[6] Navigating Robots in Dynamic Environment With Deep Reinforcement Learning
Zhou, Zhiqian
Zeng, Zhiwen
Lang, Lin
Yao, Weijia
Lu, Huimin
Zheng, Zhiqiang
Zhou, Zongtan
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25201 - 25211
[7] Learning How Pedestrians Navigate: A Deep Inverse Reinforcement Learning Approach
Fahad, Muhammad
Chen, Zhuo
Guo, Yi
[J]. 2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 819 - 826
[8] Automated Deep Reinforcement Learning Environment for Hardware of a Modular Legged Robot
Ha, Sehoon
Kim, Joohyung
Yamane, Katsu
[J]. 2018 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS (UR), 2018, : 348 - 354
[9] Deep Reinforcement Learning for Dynamic Workflow Scheduling in Cloud Environment
Dong, Tingting
Xue, Fei
Xiao, Changbai
Zhang, Jiangjiang
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2021), 2021, : 107 - 115
[10] Improving the Ability of Robots to Navigate Through Crowded Environments Safely using Deep Reinforcement Learning
Shan, Qinfeng
Wang, Weijie
Guo, Dingfei
Sun, Xiangrong
Jia, Lihao
[J]. 2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 575 - 580

← 1 2 3 4 5 →