Variational Optimization Based Reinforcement Learning for Infinite Dimensional Stochastic Systems

被引:0
|
作者
Evans, Ethan N. [1 ]
Periera, Marcus A. [2 ]
Boutselis, George I. [1 ]
Theodorou, Evangelos A. [1 ]
机构
[1] Georgia Inst Technol, Dept Aerosp Engn, Atlanta, GA 30332 USA
[2] Georgia Inst Technol, Inst Robot & Intelligent Machines, Atlanta, GA 30332 USA
来源
关键词
Reinforcement Learning; Planning and Control;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Systems involving Partial Differential Equations (PDEs) have recently become more popular among the machine learning community. However prior methods usually treat infinite dimensional problems in finite dimensions with Reduced Order Models. This leads to committing to specific approximation schemes and subsequent derivation of control laws. Additionally, prior work does not consider spatio-temporal descriptions of noise that realistically represent the stochastic nature of physical systems. In this paper we suggest a new reinforcement learning framework that is mostly model-free for Stochastic PDEs with additive spacetime noise, based on variational optimization in infinite dimensions. In addition, our algorithm incorporates sparse representations that allow for efficient learning of feedback policies in high dimensions. We demonstrate the efficacy of the proposed approach with several simulated experiments on a variety of SPDEs.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Stochastic optimization of multireservoir systems via reinforcement learning
    Lee, Jin-Hee
    Labadie, John W.
    [J]. WATER RESOURCES RESEARCH, 2007, 43 (11)
  • [2] Stochastic variational inequalities in infinite dimensional spaces
    Bensoussan, A
    Rascanu, A
    [J]. NUMERICAL FUNCTIONAL ANALYSIS AND OPTIMIZATION, 1997, 18 (1-2) : 19 - 54
  • [3] Stability radii of infinite dimensional systems with stochastic uncertainty and their optimization
    Kada, M.
    Rebiai, S. E.
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2006, 16 (17) : 819 - 841
  • [4] Infinite-dimensional optimization and Bayesian nonparametric learning of stochastic differential equations
    Ganguly, Arnab
    Mitra, Riten
    Zhou, Jinpu
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [5] Reinforcement learning for POMDPs based on action values and stochastic optimization
    Perkins, TJ
    [J]. EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 199 - 204
  • [6] Controller Optimization for Multirate Systems Based on Reinforcement Learning
    Zhan Li
    Sheng-Ri Xue
    Xing-Hu Yu
    Hui-Jun Gao
    [J]. International Journal of Automation and Computing, 2020, 17 : 417 - 427
  • [7] Controller Optimization for Multirate Systems Based on Reinforcement Learning
    Zhan Li
    Sheng-Ri Xue
    Xing-Hu Yu
    Hui-Jun Gao
    [J]. International Journal of Automation and Computing, 2020, 17 (03) : 417 - 427
  • [8] Controller Optimization for Multirate Systems Based on Reinforcement Learning
    Li, Zhan
    Xue, Sheng-Ri
    Yu, Xing-Hu
    Gao, Hui-Jun
    [J]. INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2020, 17 (03) : 417 - 427
  • [9] Selective maintenance optimization with stochastic break duration based on reinforcement learning
    Liu, Yilai
    Qian, Xinbo
    [J]. EKSPLOATACJA I NIEZAWODNOSC-MAINTENANCE AND RELIABILITY, 2022, 24 (04): : 771 - 784
  • [10] Variational quantum reinforcement learning via evolutionary optimization
    Chen, Samuel Yen-Chi
    Huang, Chih-Min
    Hsing, Chia-Wei
    Goan, Hsi-Sheng
    Kao, Ying-Jer
    [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (01):