Adaptive Risk Sensitive Path Integral for Model Predictive Control via Reinforcement Learning

被引:0
|
作者
Yoon, Hyung-Jin [2 ]
Tao, Chuyuan [1 ]
Kim, Hunmin [3 ]
Hovakimyan, Naira [1 ]
Voulgaris, Petros [2 ]
机构
[1] Univ Illinois, Dept Mech Sci & Engn, Champaign, IL USA
[2] Univ Nevada, Dept Mech Engn, Reno, NV 89557 USA
[3] Mercer Univ, Dept Elect & Comp Engn, Macon, GA USA
关键词
D O I
10.1109/MED59994.2023.10185876
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a reinforcement learning framework where an agent uses an internal nominal model for stochastic model predictive control (MPC) while compensating for a disturbance. Our work builds on the existing risk-aware optimal control with stochastic differential equations (SDEs) that aims to deal with such disturbance. However, the risk sensitivity and the noise strength of the nominal SDE in the risk-aware optimal control are often heuristically chosen. In the proposed framework, the risk-taking policy determines the behavior of the MPC to be risk-seeking (exploration) or risk-averse (exploitation). Specifically, we employ the risk-aware path integral control that can be implemented as a Monte-Carlo (MC) sampling with fast parallel simulations using a GPU. The MC sampling implementations of the MPC have been successful in robotic applications due to their real-time computation capability. The proposed framework that adapts the noise model and the risk sensitivity outperforms the standard model predictive path integral in simulation environments that have disturbances.
引用
收藏
页码:926 / 931
页数:6
相关论文
共 50 条
  • [31] Safe Reinforcement Learning via Statistical Model Predictive Shielding
    Bastani, Osbert
    Li, Shuo
    Xu, Anton
    [J]. ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,
  • [32] Adaptive predictive control of a differential drive robot tuned with reinforcement learning
    Jardine, P. Travis
    Kogan, Michael
    Givigi, Sidney N.
    Yousefi, Shahram
    [J]. INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2019, 33 (02) : 410 - 423
  • [33] Adaptive Nonlinear Model Predictive Control of an Engine Air Path
    Meier, Max Haugeneder Florian
    Adelberger, Daniel
    del Re, Luigi
    [J]. IFAC PAPERSONLINE, 2021, 54 (10): : 114 - 119
  • [34] Model Predictive Path Integral Control: From Theory to Parallel Computation
    Williams, Grady
    Aldrich, Andrew
    Theodorou, Evangelos A.
    [J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2017, 40 (02) : 344 - 357
  • [35] Risk sensitive reinforcement learning
    Neuneier, R
    Mihatsch, O
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 1031 - 1037
  • [36] Robust Model Predictive Path Integral Control: Analysis and Performance Guarantees
    Gandhi, Manan S.
    Vlahov, Bogdan
    Gibson, Jason
    Williams, Grady
    Theodorou, Evangelos A.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 1423 - 1430
  • [37] Model predictive path integral method for nonlinear random vibration control
    Guo K.
    Jiang J.
    Xu Y.
    [J]. Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2020, 47 (04): : 48 - 54
  • [38] Distributed adaptive cooperative optimal output regulation via integral reinforcement learning
    Lin, Liquan
    Huang, Jie
    [J]. AUTOMATICA, 2024, 170
  • [39] Reinforcement Learning versus Model Predictive Control on greenhouse climate control
    Morcego, Bernardo
    Yin, Wenjie
    Boersma, Sjoerd
    van Henten, Eldert
    Puig, Vicenc
    Sun, Congcong
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 215
  • [40] Adaptive Smoothing for Path Integral Control
    Thalmeier, Dominik
    Kappen, Hilbert J.
    Totaro, Simone
    Gomez, Vicenc
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21