Learning to Navigate Through Complex Dynamic Environment With Modular Deep Reinforcement Learning

被引:64
|
作者
Wang, Yuanda [1 ,2 ]
He, Haibo [3 ]
Sun, Changyin [1 ,2 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Minist Educ, Key Lab Measurement & Control Complex Syst Engn, Nanjing 210096, Jiangsu, Peoples R China
[3] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Deep learning; navigation; reinforcement learning; two-stream Q-network; OBSTACLE AVOIDANCE; SIMULTANEOUS LOCALIZATION;
D O I
10.1109/TG.2018.2849942
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an end-to-end modular reinforcement learning architecture for a navigation task in complex dynamic environments with rapidly moving obstacles. In this architecture, the main task is divided into two subtasks: local obstacle avoidance and global navigation. For obstacle avoidance, we develop a two-stream Q-network, which processes spatial and temporal information separately and generates action values. The global navigation subtask is resolved by a conventional Q-network framework. An online learning network and an action scheduler are introduced to first combine two pretrained policies, and then continue exploring and optimizing until a stable policy is obtained. The two-stream Q-network obtains better performance than the conventional deep Q-learning approach in the obstacle avoidance subtask. Experiments on the main task demonstrate that the proposed architecture can efficiently avoid moving obstacles and complete the navigation task at a high success rate. The modular architecture enables parallel training and also demonstrates good generalization capability in different environments.
引用
收藏
页码:400 / 412
页数:13
相关论文
共 50 条
  • [21] Active particles using reinforcement learning to navigate in complex motility landscapes
    Monderkamp, Paul A.
    Schwarzendahl, Fabian Jan
    Klatt, Michael A.
    Loewen, Hartmut
    [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (04):
  • [22] A modular framework for stabilizing deep reinforcement learning control
    Lawrence, Nathan P.
    Loewen, Philip D.
    Wang, Shuyuan
    Forbes, Michael G.
    Gopaluni, R. Bhushan
    [J]. IFAC PAPERSONLINE, 2023, 56 (02): : 8006 - 8011
  • [23] Finding key players in complex networks through deep reinforcement learning
    Fan, Changjun
    Zeng, Li
    Sun, Yizhou
    Liu, Yang-Yu
    [J]. NATURE MACHINE INTELLIGENCE, 2020, 2 (06) : 317 - 324
  • [24] Finding key players in complex networks through deep reinforcement learning
    Changjun Fan
    Li Zeng
    Yizhou Sun
    Yang-Yu Liu
    [J]. Nature Machine Intelligence, 2020, 2 : 317 - 324
  • [25] A Framework for Learning Dynamic Movement Primitives with Deep Reinforcement Learning
    Noohian, Amirhossein
    Raisi, Mehran
    Khodaygan, Saeed
    [J]. 2022 10TH RSI INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM), 2022, : 329 - 334
  • [26] A deep reinforcement learning based distributed multi-UAV dynamic area coverage algorithm for complex environment
    Xiao, Jian
    Yuan, Guohui
    Xue, Yuxi
    He, Jinhui
    Wang, Yaoting
    Zou, Yuanjiang
    Wang, Zhuoran
    [J]. NEUROCOMPUTING, 2024, 595
  • [27] Deep Reinforcement Learning for Dynamic Things of Interest Recommendation in Intelligent Ambient Environment
    Altulyan, May S.
    Huang, Chaoran
    Yao, Lina
    Wang, Xianzhi
    Kanhere, Salil
    [J]. AI 2021: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13151 : 393 - 404
  • [28] Multi-agent Deep Reinforcement Learning for Task Allocation in Dynamic Environment
    Ben Noureddine, Dhouha
    Gharbi, Atef
    Ben Ahmed, Samir
    [J]. ICSOFT: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2017, : 17 - 26
  • [29] Cooperative Multi-Robot Navigation in Dynamic Environment with Deep Reinforcement Learning
    Han, Ruihua
    Chen, Shengduo
    Hao, Qi
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 448 - 454
  • [30] Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
    Rajeswaran, Aravind
    Kumar, Vikash
    Gupta, Abhishek
    Vezzani, Giulia
    Schulman, John
    Todorov, Emanuel
    Levine, Sergey
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,