Autonomous navigation of mobile robots in unknown environments using off-policy reinforcement learning with curriculum learning

被引:3
|
作者
Yin, Yan [1 ]
Chen, Zhiyu [1 ,2 ]
Liu, Gang [1 ,2 ]
Yin, Jiasong [1 ]
Guo, Jianwei [1 ,2 ]
机构
[1] Changchun Univ Technol, Sch Comp Sci & Engn, Changchun 130012, Peoples R China
[2] Jilin Prov Data Serv Ind Publ Technol Res Ctr, Changchun, Peoples R China
关键词
Soft actor critic (SAC); CEP; Trajectory energy; Curriculum learning; Fuzzy logic control; Sampling efficiency; VISUAL NAVIGATION;
D O I
10.1016/j.eswa.2024.123202
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) is effective for autonomous navigation tasks without prior knowledge of the environment. However, traditional mobile robot navigation algorithms, based on off -policy RL, often face challenges such as low sample efficiency during training and lack of adequate safety mechanisms. In this paper, we present an off -policy RL navigation model named Soft Actor -Critic with Curriculum Prioritization and Fuzzy Logic (SCF). The model uses energy as a prioritized evaluation metric for experience replay. And through task -level curriculum, the agent's learning sequence is formulated, thereby enhancing sampling efficiency and safety. We propose a Curriculum -based Energy Prioritization (CEP) approach. It selects a replay trajectory that matches the current agent's capability based on trajectory energy. Our results show that robots using off -policy RL often have limitations in dynamic obstacle avoidance. To rectify this, our model uses a fuzzy logic controller to enhance real-time obstacle avoidance. The SCF approach enables mobile robots to navigate adeptly in unpredictable and dynamic environments, ensuring optimal planning control while being safe and robust. Experiments in Gazebo simulation environment and real world confirm the effectiveness of our proposed method. The comparison results show the superior performance of this method, especially in unknown and dynamic environments.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Off-Policy Deep Reinforcement Learning without Exploration
    Fujimoto, Scott
    Meger, David
    Precup, Doina
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [42] Flexible Data Augmentation in Off-Policy Reinforcement Learning
    Rak, Alexandra
    Skrynnik, Alexey
    Panov, Aleksandr I.
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2021), PT I, 2021, 12854 : 224 - 235
  • [43] Mixed experience sampling for off-policy reinforcement learning
    Yu, Jiayu
    Li, Jingyao
    Lu, Shuai
    Han, Shuai
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251
  • [44] Research on Off-Policy Evaluation in Reinforcement Learning: A Survey
    Wang S.-R.
    Niu W.-J.
    Tong E.-D.
    Chen T.
    Li H.
    Tian Y.-Z.
    Liu J.-Q.
    Han Z.
    Li Y.-D.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (09): : 1926 - 1945
  • [45] Off-Policy Reinforcement Learning for H∞ Control Design
    Luo, Biao
    Wu, Huai-Ning
    Huang, Tingwen
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (01) : 65 - 76
  • [46] An Extended Navigation Framework for Autonomous Mobile Robot in Dynamic Environments using Reinforcement Learning Algorithm
    Nguyen Van Dinh
    Nguyen Hong Viet
    Lan Anh Nguyen
    Hong Toan Dinh
    Nguyen Tran Hiep
    Pham Trung Dung
    Trung-Dung Ngo
    Xuan-Tung Truong
    2017 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2017, : 336 - 339
  • [47] Three-Dimensional Waypoint Navigation of Multicopters by Attitude and Throttle Commands using Off-Policy Reinforcement Learning
    d'Apolito, Francesco
    Sulzbachner, Christoph
    2022 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS), 2022, : 1359 - 1366
  • [48] Bayesian reinforcement learning for navigation planning in unknown environments
    Alali, Mohammad
    Imani, Mahdi
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [49] Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
    Thomas, Philip S.
    Brunskill, Emma
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [50] Route Planning for Autonomous Mobile Robots Using a Reinforcement Learning Algorithm
    Talaat, Fatma M. M.
    Ibrahim, Abdelhameed
    El-Kenawy, El-Sayed M.
    Abdelhamid, Abdelaziz M. A.
    Alhussan, Amel Ali
    Khafaga, Doaa Sami
    Salem, Dina Ahmed
    ACTUATORS, 2023, 12 (01)