Learning to Navigate Through Complex Dynamic Environment With Modular Deep Reinforcement Learning

被引：64

作者：

Wang, Yuanda ^{[1
,2
]}

He, Haibo ^{[3
]}

Sun, Changyin ^{[1
,2
]}

机构：

[1] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China

[2] Southeast Univ, Minist Educ, Key Lab Measurement & Control Complex Syst Engn, Nanjing 210096, Jiangsu, Peoples R China

[3] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA

来源：

IEEE TRANSACTIONS ON GAMES | 2018年 / 10卷 / 04期

基金：

中国国家自然科学基金; 美国国家科学基金会;

关键词：

Deep learning; navigation; reinforcement learning; two-stream Q-network; OBSTACLE AVOIDANCE; SIMULTANEOUS LOCALIZATION;

D O I：

10.1109/TG.2018.2849942

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose an end-to-end modular reinforcement learning architecture for a navigation task in complex dynamic environments with rapidly moving obstacles. In this architecture, the main task is divided into two subtasks: local obstacle avoidance and global navigation. For obstacle avoidance, we develop a two-stream Q-network, which processes spatial and temporal information separately and generates action values. The global navigation subtask is resolved by a conventional Q-network framework. An online learning network and an action scheduler are introduced to first combine two pretrained policies, and then continue exploring and optimizing until a stable policy is obtained. The two-stream Q-network obtains better performance than the conventional deep Q-learning approach in the obstacle avoidance subtask. Experiments on the main task demonstrate that the proposed architecture can efficiently avoid moving obstacles and complete the navigation task at a high success rate. The modular architecture enables parallel training and also demonstrates good generalization capability in different environments.

引用

页码：400 / 412

页数：13

共 50 条

[21] Active particles using reinforcement learning to navigate in complex motility landscapes
Monderkamp, Paul A.
Schwarzendahl, Fabian Jan
Klatt, Michael A.
Loewen, Hartmut
[J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (04):
[22] A modular framework for stabilizing deep reinforcement learning control
Lawrence, Nathan P.
Loewen, Philip D.
Wang, Shuyuan
Forbes, Michael G.
Gopaluni, R. Bhushan
[J]. IFAC PAPERSONLINE, 2023, 56 (02): : 8006 - 8011
[23] Finding key players in complex networks through deep reinforcement learning
Fan, Changjun
Zeng, Li
Sun, Yizhou
Liu, Yang-Yu
[J]. NATURE MACHINE INTELLIGENCE, 2020, 2 (06) : 317 - 324
[24] Finding key players in complex networks through deep reinforcement learning
Changjun Fan
Li Zeng
Yizhou Sun
Yang-Yu Liu
[J]. Nature Machine Intelligence, 2020, 2 : 317 - 324
[25] A Framework for Learning Dynamic Movement Primitives with Deep Reinforcement Learning
Noohian, Amirhossein
Raisi, Mehran
Khodaygan, Saeed
[J]. 2022 10TH RSI INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM), 2022, : 329 - 334
[26] A deep reinforcement learning based distributed multi-UAV dynamic area coverage algorithm for complex environment
Xiao, Jian
Yuan, Guohui
Xue, Yuxi
He, Jinhui
Wang, Yaoting
Zou, Yuanjiang
Wang, Zhuoran
[J]. NEUROCOMPUTING, 2024, 595
[27] Deep Reinforcement Learning for Dynamic Things of Interest Recommendation in Intelligent Ambient Environment
Altulyan, May S.
Huang, Chaoran
Yao, Lina
Wang, Xianzhi
Kanhere, Salil
[J]. AI 2021: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13151 : 393 - 404
[28] Multi-agent Deep Reinforcement Learning for Task Allocation in Dynamic Environment
Ben Noureddine, Dhouha
Gharbi, Atef
Ben Ahmed, Samir
[J]. ICSOFT: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2017, : 17 - 26
[29] Cooperative Multi-Robot Navigation in Dynamic Environment with Deep Reinforcement Learning
Han, Ruihua
Chen, Shengduo
Hao, Qi
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 448 - 454
[30] Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
Rajeswaran, Aravind
Kumar, Vikash
Gupta, Abhishek
Vezzani, Giulia
Schulman, John
Todorov, Emanuel
Levine, Sergey
[J]. ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,

← 1 2 3 4 5 →