Multiple model-based reinforcement learning for nonlinear control

被引:2
|
作者
Samejima, K [1 ]
Katagiri, K
Doya, K
Kawato, M
机构
[1] Japan Sci & Technol Corp, Kyoto 6190288, Japan
[2] Nara Inst Sci & Technol, Ikoma, Japan
[3] ATR Int, Kyoto 6190288, Japan
[4] ATR Human Informat Proc Res Labs, Kyoto 6190288, Japan
关键词
module partition; model-based reinforcement learning; nonlinear control; linear quadratic controller;
D O I
10.1002/ecjc.20266
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a reinforcement learning scheme using multiple prediction models (multiple model-based reinforcement learning, MMRL). MMRL prepares multiple pairs, consisting of the prediction model used to predict the future state of the control object and the reinforcement learning controller used to learn the control output. Using a soft-max function of the prediction error of each prediction model, the "responsibility signal" is calculated, which takes a larger value for the module with a more accurate prediction. By weighting the learning and the control output of each module by means of the responsibility signal, modules to deal with various situations are formed. In order to achieve a robust modular structure of MMRL without a priori knowledge, such as (lie number of modules and the region to be covered, a prior responsibility signal is formulated, assuming spatial and temporal continuity. As a method for efficient implementation of MMRL, an optimal controller (MLQC) based on multiple linear prediction and quadratic reward models is formulated. In order to verify the performance of MLQC, a simulation was performed on the swing-up of a single pendulum. It was shown that the linear prediction model and the corresponding controller were acquired by learning for the range near the suspended point and upright point of the single pendulum. The task can be learned in a shorter time than in the conventional method, and it is possible to handle redundancy of modules. (c) 2006 Wiley Periodicals, Inc.
引用
收藏
页码:54 / 69
页数:16
相关论文
共 50 条
  • [1] Multiple model-based reinforcement learning
    Doya, K
    Samejima, K
    Katagiri, K
    Kawato, M
    [J]. NEURAL COMPUTATION, 2002, 14 (06) : 1347 - 1369
  • [2] Model-Based Reinforcement Learning For Robot Control
    Li, Xiang
    Shang, Weiwei
    Cong, Shuang
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2020), 2020, : 300 - 305
  • [3] Model-based reinforcement learning for nonlinear optimal control with practical asymptotic stability guarantees
    Kim, Yeonsoo
    Lee, Jong Min
    [J]. AICHE JOURNAL, 2020, 66 (10)
  • [4] Safe control of nonlinear systems in LPV framework using model-based reinforcement learning
    Bao, Yajie
    Velni, Javad Mohammadpour
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2023, 96 (04) : 1078 - 1089
  • [5] Safe model-based reinforcement learning for nonlinear optimal control with state and input constraints
    Kim, Yeonsoo
    Kim, Jong Woo
    [J]. AICHE JOURNAL, 2022, 68 (05)
  • [6] Control Approach Combining Reinforcement Learning and Model-Based Control
    Okawa, Yoshihiro
    Sasaki, Tomotake
    Iwane, Hidenao
    [J]. 2019 12TH ASIAN CONTROL CONFERENCE (ASCC), 2019, : 1419 - 1424
  • [7] Efficient reinforcement learning: Model-based acrobot control
    Boone, G
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION - PROCEEDINGS, VOLS 1-4, 1997, : 229 - 234
  • [8] Offline Model-Based Reinforcement Learning for Tokamak Control
    Char, Ian
    Abbate, Joseph
    Bardoczi, Laszlo
    Boyer, Mark D.
    Chung, Youngseog
    Conlin, Rory
    Erickson, Keith
    Mehta, Viraj
    Richner, Nathan
    Kolemen, Egemen
    Schneider, Jeff
    [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [9] Multiple-Timescale PIA for Model-Based Reinforcement Learning
    Yamaguchi, Tomohiro
    Imatani, Eri
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2009, 13 (06) : 658 - 666
  • [10] Model-based reinforcement learning for output-feedback optimal control of a class of nonlinear systems
    Self, Ryan
    Harlan, Michael
    Kamalapurkar, Rushikesh
    [J]. 2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 2378 - 2383