Multiple model-based reinforcement learning for nonlinear control

被引:2
|
作者
Samejima, K [1 ]
Katagiri, K
Doya, K
Kawato, M
机构
[1] Japan Sci & Technol Corp, Kyoto 6190288, Japan
[2] Nara Inst Sci & Technol, Ikoma, Japan
[3] ATR Int, Kyoto 6190288, Japan
[4] ATR Human Informat Proc Res Labs, Kyoto 6190288, Japan
关键词
module partition; model-based reinforcement learning; nonlinear control; linear quadratic controller;
D O I
10.1002/ecjc.20266
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a reinforcement learning scheme using multiple prediction models (multiple model-based reinforcement learning, MMRL). MMRL prepares multiple pairs, consisting of the prediction model used to predict the future state of the control object and the reinforcement learning controller used to learn the control output. Using a soft-max function of the prediction error of each prediction model, the "responsibility signal" is calculated, which takes a larger value for the module with a more accurate prediction. By weighting the learning and the control output of each module by means of the responsibility signal, modules to deal with various situations are formed. In order to achieve a robust modular structure of MMRL without a priori knowledge, such as (lie number of modules and the region to be covered, a prior responsibility signal is formulated, assuming spatial and temporal continuity. As a method for efficient implementation of MMRL, an optimal controller (MLQC) based on multiple linear prediction and quadratic reward models is formulated. In order to verify the performance of MLQC, a simulation was performed on the swing-up of a single pendulum. It was shown that the linear prediction model and the corresponding controller were acquired by learning for the range near the suspended point and upright point of the single pendulum. The task can be learned in a shorter time than in the conventional method, and it is possible to handle redundancy of modules. (c) 2006 Wiley Periodicals, Inc.
引用
收藏
页码:54 / 69
页数:16
相关论文
共 50 条
  • [31] Transmission Control in NB-IoT With Model-Based Reinforcement Learning
    Alcaraz, Juan J.
    Losilla, Fernando
    Gonzalez-Castano, Francisco-Javier
    [J]. IEEE ACCESS, 2023, 11 : 57991 - 58005
  • [32] Control of Magnetic Surgical Robots With Model-Based Simulators and Reinforcement Learning
    Barnoy, Yotam
    Erin, Onder
    Raval, Suraj
    Pryor, Will
    Mair, Lamar O.
    Weinberg, Irving N.
    Diaz-Mercado, Yancy
    Krieger, Axel
    Hager, Gregory D.
    [J]. IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2022, 4 (04): : 945 - 956
  • [33] Model-Based Reinforcement Learning Control of Electrohydraulic Position Servo Systems
    Yao, Zhikai
    Liang, Xianglong
    Jiang, Guo-Ping
    Yao, Jianyong
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2023, 28 (03) : 1446 - 1455
  • [34] Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation
    Nikishin, Evgenii
    Abachi, Romina
    Agarwal, Rishabh
    Bacon, Pierre-Luc
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7886 - 7894
  • [35] Efficient model-based reinforcement learning for approximate online optimal control
    Kamalapurkar, Rushikesh
    Rosenfeld, Joel A.
    Dixon, Warren E.
    [J]. AUTOMATICA, 2016, 74 : 247 - 258
  • [36] DATA-EFFICIENT MODEL-BASED REINFORCEMENT LEARNING FOR ROBOT CONTROL
    Sun, Ming
    Gao, Yue
    Liu, Wei
    Li, Shaoyuan
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2021, 36 (04): : 211 - 218
  • [37] Delay-aware model-based reinforcement learning for continuous control
    Chen, Baiming
    Xu, Mengdi
    Li, Liang
    Zhao, Ding
    [J]. NEUROCOMPUTING, 2021, 450 : 119 - 128
  • [38] Model-Based Reinforcement Learning for Time-Optimal Velocity Control
    Hartmann, Gabriel
    Shiller, Zvi
    Azaria, Amos
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04): : 6185 - 6192
  • [39] Learning to Paint With Model-based Deep Reinforcement Learning
    Huang, Zhewei
    Heng, Wen
    Zhou, Shuchang
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8708 - 8717
  • [40] A Composite Control Strategy for Quadruped Robot by Integrating Reinforcement Learning and Model-Based Control
    Lyu, Shangke
    Zhao, Han
    Wang, Donglin
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 751 - 758