Residual Reinforcement Learning for Motion Control of a Bionic Exploration Robot-RoboDact

被引:1
|
作者
Zhang, Tiandong [1 ,2 ]
Wang, Rui [1 ]
Wang, Shuo [1 ,2 ,3 ,4 ]
Wang, Yu [1 ]
Zheng, Gang [5 ]
Tan, Min
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Shanghai 200031, Peoples R China
[4] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, F-59000 Lille, Peoples R China
[5] Univ Lille, CRIStAL Ctr Rech Informat Signal & Automat Lille, Cent Lille, Lille 100190, France
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Active disturbance rejection control (ADRC); bionic exploration robot; motion control; residual reinforcement learning (RRL); soft actor-critic (SAC); FISH; IMPLEMENTATION; MANEUVERS; SYSTEM;
D O I
10.1109/TIM.2023.3282297
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article aims to investigate the motion control method of a bionic underwater exploration robot (RoboDact). The robot is equipped with a double-joint tail fin and two undulating pectoral fins to obtain good mobility and stability. The hybrid propulsion mode helps perform stable and effective underwater exploration and measurement. To coordinate these two kinds of bionic propulsion fins and address the challenge of measurement noises and external disturbances during underwater exploration, a novel residual reinforcement learning method with parameter randomization (PR-RRL) is proposed. The control strategy is a weighted superposition of a feedback controller and a residual controller. The observation feedback controller based on active disturbance rejection control (ADRC) is adapted to improve stability and convergence. And the residual controller based on the soft actor-critic (SAC) algorithm is adapted to improve adaptability to uncertainties and disturbances. Moreover, the parameter randomization training strategy is proposed for adapting natural complicated scenarios by randomizing the partial dynamics of the underwater exploration robot during the training phase. Finally, the feasibility and efficacy of the presented motion control method are validated by comprehensive simulation tests and RoboDact prototype physical experiments.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Residual Reinforcement Learning for Robot Control
    Johannink, Tobias
    Bahl, Shikhar
    Nair, Ashvin
    Luo, Jianlan
    Kumar, Avinash
    Loskyll, Matthias
    Ojea, Juan Aparicio
    Solowjow, Eugen
    Levine, Sergey
    [J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 6023 - 6029
  • [2] Autonomous Motion Control Using Deep Reinforcement Learning for Exploration Robot on Rough Terrain
    Wang, Zijie
    Ji, Yonghoon
    Fujii, Hiromitsu
    Kono, Hitoshi
    [J]. 2022 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII 2022), 2022, : 1021 - 1025
  • [3] Bionic learning strategy applied on snake robot motion control
    Cai, Jianxian
    Ao, Dun
    Yu, Ruihong
    [J]. INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2013, 19 (03) : 265 - 274
  • [4] Natural Residual Reinforcement Learning for Bicycle Robot Control
    Zhu, Xianjin
    Zheng, Xudong
    Zhang, Qiyuan
    Chen, Zhang
    Liu, Yu
    Liang, Bin
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2021), 2021, : 1201 - 1206
  • [5] Investigating exploration for deep reinforcement learning of concentric tube robot control
    Keshav Iyengar
    George Dwyer
    Danail Stoyanov
    [J]. International Journal of Computer Assisted Radiology and Surgery, 2020, 15 : 1157 - 1165
  • [6] Investigating exploration for deep reinforcement learning of concentric tube robot control
    Iyengar, Keshav
    Dwyer, George
    Stoyanov, Danail
    [J]. INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2020, 15 (07) : 1157 - 1165
  • [7] Research on Robot Massage Force Control Based on Residual Reinforcement Learning
    Xiao, Meng
    Zhang, Tie
    Zou, Yanbiao
    Chen, Shouyan
    Wu, Wen
    [J]. IEEE ACCESS, 2024, 12 : 18270 - 18279
  • [8] Research on Robot Massage Force Control Based on Residual Reinforcement Learning
    Xiao, Meng
    Zhang, Tie
    Zou, Yanbiao
    Chen, Shouyan
    Wu, Wen
    [J]. IEEE Access, 2024, 12 : 18270 - 18279
  • [9] Research progress of robot motion control based on deep reinforcement learning
    Dong, Hao
    Yang, Jing
    Li, Shao-Bo
    Wang, Jun
    Duan, Zhong-Jing
    [J]. Kongzhi yu Juece/Control and Decision, 2022, 37 (02): : 278 - 292
  • [10] Robot Bionic Eye Motion Posture Control System
    Zhang, Hongxin
    Lee, Suan
    [J]. ELECTRONICS, 2023, 12 (03)