Improving Model-Based Deep Reinforcement Learning with Learning Degree Networks and Its Application in Robot Control

被引:1
|
作者
Ma, Guoqing [1 ]
Wang, Zhifu [2 ]
Yuan, Xianfeng [1 ]
Zhou, Fengyu [2 ]
机构
[1] Shandong Univ, Sch Mech, Elect & Informat Engn, Weihai 264209, Peoples R China
[2] Shandong Univ, Control Sci & Engn, Jinan 250061, Peoples R China
基金
中国国家自然科学基金;
关键词
Compendex;
D O I
10.1155/2022/7169594
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Deep reinforcement learning is the technology of artificial neural networks in the field of decision-making and control. The traditional model-free reinforcement learning algorithm requires a large amount of environment interactive data to iterate the algorithm. This model's performance also suffers due to low utilization of training data, while the model-based reinforcement learning (MBRL) algorithm improves the efficiency of the data, MBRL locks into low prediction accuracy. Although MBRL can utilize the additional data generated by the dynamic model, a system dynamics model with low prediction accuracy will provide low-quality data and affect the algorithm's final result. In this paper, based on the A3C (Asynchronous Advantage Actor-Critic) algorithm, an improved model-based deep reinforcement learning algorithm using a learning degree network (MBRL-LDN) is presented. By comparing the differences between the predicted states outputted by the proposed multidynamic model and the original predicted states, the learning degree of the system dynamics model is calculated. The learning degree represents the quality of the data generated by the dynamic model and is used to decide whether to continue to interact with the dynamic model during a particular episode. Thus, low-quality data will be discarded. The superiority of the proposed method is verified by conducting extensive contrast experiments.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Model-Based Reinforcement Learning For Robot Control
    Li, Xiang
    Shang, Weiwei
    Cong, Shuang
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2020), 2020, : 300 - 305
  • [2] Model-based reinforcement learning with model error and its application
    Tajima, Yoshiyuki
    Onisawa, Takehisa
    [J]. PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-8, 2007, : 1333 - 1336
  • [3] Learning to Paint With Model-based Deep Reinforcement Learning
    Huang, Zhewei
    Heng, Wen
    Zhou, Shuchang
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8708 - 8717
  • [4] DATA-EFFICIENT MODEL-BASED REINFORCEMENT LEARNING FOR ROBOT CONTROL
    Sun, Ming
    Gao, Yue
    Liu, Wei
    Li, Shaoyuan
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2021, 36 (04): : 211 - 218
  • [5] Hybrid LMC: Hybrid Learning and Model-based Control for Wheeled Humanoid Robot via Ensemble Deep Reinforcement Learning
    Baek, Donghoon
    Purushottam, Amartya
    Ramos, Joao
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 9347 - 9354
  • [6] Calibrated Model-Based Deep Reinforcement Learning
    Malik, Ali
    Kuleshov, Volodymyr
    Song, Jiaming
    Nemer, Danny
    Seymour, Harlan
    Ermon, Stefano
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [7] Safe Robot Execution in Model-Based Reinforcement Learning
    Martinez, David
    Alenya, Guillem
    Torras, Carme
    [J]. 2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 6422 - 6427
  • [8] A Composite Control Strategy for Quadruped Robot by Integrating Reinforcement Learning and Model-Based Control
    Lyu, Shangke
    Zhao, Han
    Wang, Donglin
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 751 - 758
  • [9] Study on force control for robot massage with a model-based reinforcement learning algorithm
    Meng Xiao
    Tie Zhang
    Yanbiao Zou
    Xiaohu Yan
    Wen Wu
    [J]. Intelligent Service Robotics, 2023, 16 : 509 - 519
  • [10] Nonholonomic Yaw Control of an Underactuated Flying Robot With Model-Based Reinforcement Learning
    Lambert, Nathan O.
    Schindler, Craig B.
    Drew, Daniel S.
    Pister, Kristofer S. J.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 455 - 461