Learning to Navigate Through Complex Dynamic Environment With Modular Deep Reinforcement Learning

被引:64
|
作者
Wang, Yuanda [1 ,2 ]
He, Haibo [3 ]
Sun, Changyin [1 ,2 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Minist Educ, Key Lab Measurement & Control Complex Syst Engn, Nanjing 210096, Jiangsu, Peoples R China
[3] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Deep learning; navigation; reinforcement learning; two-stream Q-network; OBSTACLE AVOIDANCE; SIMULTANEOUS LOCALIZATION;
D O I
10.1109/TG.2018.2849942
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an end-to-end modular reinforcement learning architecture for a navigation task in complex dynamic environments with rapidly moving obstacles. In this architecture, the main task is divided into two subtasks: local obstacle avoidance and global navigation. For obstacle avoidance, we develop a two-stream Q-network, which processes spatial and temporal information separately and generates action values. The global navigation subtask is resolved by a conventional Q-network framework. An online learning network and an action scheduler are introduced to first combine two pretrained policies, and then continue exploring and optimizing until a stable policy is obtained. The two-stream Q-network obtains better performance than the conventional deep Q-learning approach in the obstacle avoidance subtask. Experiments on the main task demonstrate that the proposed architecture can efficiently avoid moving obstacles and complete the navigation task at a high success rate. The modular architecture enables parallel training and also demonstrates good generalization capability in different environments.
引用
收藏
页码:400 / 412
页数:13
相关论文
共 50 条
  • [1] Learn to Navigate Autonomously Through Deep Reinforcement Learning
    Wu, Keyu
    Wang, Han
    Esfahani, Mahdi Abolfazli
    Yuan, Shenghai
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (05) : 5342 - 5352
  • [2] Learning to navigate a crystallization model with Deep Reinforcement Learning
    Manee, Vidhyadhar
    Baratti, Roberto
    Romagnoli, Jose A.
    [J]. CHEMICAL ENGINEERING RESEARCH & DESIGN, 2022, 178 : 111 - 123
  • [3] iTD3-CLN: Learn to navigate in dynamic scene through Deep Reinforcement Learning
    Jiang, Haoge
    Esfahani, Mahdi Abolfazli
    Wu, Keyu
    Wan, Kong-wah
    Heng, Kuan-kian
    Wang, Han
    Jiang, Xudong
    [J]. NEUROCOMPUTING, 2022, 503 : 118 - 128
  • [4] Learning to Navigate in Human Environments via Deep Reinforcement Learning
    Gao, Xingyuan
    Sun, Shiying
    Zhao, Xiaoguang
    Tan, Min
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I, 2019, 11953 : 418 - 429
  • [5] Collision avoidance for AGV based on deep reinforcement learning in complex dynamic environment
    Cai, Ze
    Hu, Yaoguang
    Wen, Jingqian
    Zhang, Lixiang
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (01): : 236 - 245
  • [6] Navigating Robots in Dynamic Environment With Deep Reinforcement Learning
    Zhou, Zhiqian
    Zeng, Zhiwen
    Lang, Lin
    Yao, Weijia
    Lu, Huimin
    Zheng, Zhiqiang
    Zhou, Zongtan
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25201 - 25211
  • [7] Learning How Pedestrians Navigate: A Deep Inverse Reinforcement Learning Approach
    Fahad, Muhammad
    Chen, Zhuo
    Guo, Yi
    [J]. 2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 819 - 826
  • [8] Automated Deep Reinforcement Learning Environment for Hardware of a Modular Legged Robot
    Ha, Sehoon
    Kim, Joohyung
    Yamane, Katsu
    [J]. 2018 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS (UR), 2018, : 348 - 354
  • [9] Deep Reinforcement Learning for Dynamic Workflow Scheduling in Cloud Environment
    Dong, Tingting
    Xue, Fei
    Xiao, Changbai
    Zhang, Jiangjiang
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2021), 2021, : 107 - 115
  • [10] Improving the Ability of Robots to Navigate Through Crowded Environments Safely using Deep Reinforcement Learning
    Shan, Qinfeng
    Wang, Weijie
    Guo, Dingfei
    Sun, Xiangrong
    Jia, Lihao
    [J]. 2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 575 - 580