Improving Model-Based Deep Reinforcement Learning with Learning Degree Networks and Its Application in Robot Control

被引:1
|
作者
Ma, Guoqing [1 ]
Wang, Zhifu [2 ]
Yuan, Xianfeng [1 ]
Zhou, Fengyu [2 ]
机构
[1] Shandong Univ, Sch Mech, Elect & Informat Engn, Weihai 264209, Peoples R China
[2] Shandong Univ, Control Sci & Engn, Jinan 250061, Peoples R China
基金
中国国家自然科学基金;
关键词
Compendex;
D O I
10.1155/2022/7169594
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Deep reinforcement learning is the technology of artificial neural networks in the field of decision-making and control. The traditional model-free reinforcement learning algorithm requires a large amount of environment interactive data to iterate the algorithm. This model's performance also suffers due to low utilization of training data, while the model-based reinforcement learning (MBRL) algorithm improves the efficiency of the data, MBRL locks into low prediction accuracy. Although MBRL can utilize the additional data generated by the dynamic model, a system dynamics model with low prediction accuracy will provide low-quality data and affect the algorithm's final result. In this paper, based on the A3C (Asynchronous Advantage Actor-Critic) algorithm, an improved model-based deep reinforcement learning algorithm using a learning degree network (MBRL-LDN) is presented. By comparing the differences between the predicted states outputted by the proposed multidynamic model and the original predicted states, the learning degree of the system dynamics model is calculated. The learning degree represents the quality of the data generated by the dynamic model and is used to decide whether to continue to interact with the dynamic model during a particular episode. Thus, low-quality data will be discarded. The superiority of the proposed method is verified by conducting extensive contrast experiments.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Multiple model-based reinforcement learning for nonlinear control
    Samejima, K
    Katagiri, K
    Doya, K
    Kawato, M
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2006, 89 (09): : 54 - 69
  • [32] Offline Model-Based Reinforcement Learning for Tokamak Control
    Char, Ian
    Abbate, Joseph
    Bardoczi, Laszlo
    Boyer, Mark D.
    Chung, Youngseog
    Conlin, Rory
    Erickson, Keith
    Mehta, Viraj
    Richner, Nathan
    Kolemen, Egemen
    Schneider, Jeff
    [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [33] Control Approach Combining Reinforcement Learning and Model-Based Control
    Okawa, Yoshihiro
    Sasaki, Tomotake
    Iwane, Hidenao
    [J]. 2019 12TH ASIAN CONTROL CONFERENCE (ASCC), 2019, : 1419 - 1424
  • [34] Comparison of Deep Reinforcement Learning Algorithms in a Robot Manipulator Control Application
    Chu, Chang
    Takahashi, Kazuhiko
    Hashimoto, Masafumi
    [J]. 2020 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2020), 2021, : 284 - 287
  • [35] Model-based deep reinforcement learning for wind energy bidding
    Sanayha, Manassakan
    Vateekul, Peerapon
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2022, 136
  • [36] Knowledge Transfer using Model-Based Deep Reinforcement Learning
    Boloka, Tlou
    Makondo, Ndivhuwo
    Rosman, Benjamin
    [J]. 2021 SOUTHERN AFRICAN UNIVERSITIES POWER ENGINEERING CONFERENCE/ROBOTICS AND MECHATRONICS/PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA (SAUPEC/ROBMECH/PRASA), 2021,
  • [37] The bottleneck simulator: A model-based deep reinforcement learning approach
    Serban, Iulian Vlad
    Sankar, Chinnadhurai
    Pieper, Michael
    Pineau, Joelle
    Bengio, Yoshua
    [J]. Journal of Artificial Intelligence Research, 2020, 69 : 571 - 612
  • [38] Deep Reinforcement Learning with Model-based Acceleration for Hyperparameter Optimization
    Chen, SenPeng
    Wu, Jia
    Chen, XiuYun
    [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 170 - 177
  • [39] SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning
    Zhang, Marvin
    Vikram, Sharad
    Smith, Laura
    Abbeel, Pieter
    Johnson, Matthew J.
    Levine, Sergey
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [40] The Bottleneck Simulator: A Model-Based Deep Reinforcement Learning Approach
    Serban, Iulian Vlad
    Sankar, Chinnadhurai
    Pieper, Michael
    Pineau, Joelle
    Bengio, Yoshua
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2020, 69 : 571 - 612