Variational Optimization Based Reinforcement Learning for Infinite Dimensional Stochastic Systems

被引：0

作者：

Evans, Ethan N. ^{[1
]}

Periera, Marcus A. ^{[2
]}

Boutselis, George I. ^{[1
]}

Theodorou, Evangelos A. ^{[1
]}

机构：

[1] Georgia Inst Technol, Dept Aerosp Engn, Atlanta, GA 30332 USA

[2] Georgia Inst Technol, Inst Robot & Intelligent Machines, Atlanta, GA 30332 USA

来源：

CONFERENCE ON ROBOT LEARNING, VOL 100 | 2019年 / 100卷

关键词：

Reinforcement Learning; Planning and Control;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Systems involving Partial Differential Equations (PDEs) have recently become more popular among the machine learning community. However prior methods usually treat infinite dimensional problems in finite dimensions with Reduced Order Models. This leads to committing to specific approximation schemes and subsequent derivation of control laws. Additionally, prior work does not consider spatio-temporal descriptions of noise that realistically represent the stochastic nature of physical systems. In this paper we suggest a new reinforcement learning framework that is mostly model-free for Stochastic PDEs with additive spacetime noise, based on variational optimization in infinite dimensions. In addition, our algorithm incorporates sparse representations that allow for efficient learning of feedback policies in high dimensions. We demonstrate the efficacy of the proposed approach with several simulated experiments on a variety of SPDEs.

引用

页数：16

共 50 条

[1] Stochastic optimization of multireservoir systems via reinforcement learning
Lee, Jin-Hee
Labadie, John W.
[J]. WATER RESOURCES RESEARCH, 2007, 43 (11)
[2] Stochastic variational inequalities in infinite dimensional spaces
Bensoussan, A
Rascanu, A
[J]. NUMERICAL FUNCTIONAL ANALYSIS AND OPTIMIZATION, 1997, 18 (1-2) : 19 - 54
[3] Stability radii of infinite dimensional systems with stochastic uncertainty and their optimization
Kada, M.
Rebiai, S. E.
[J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2006, 16 (17) : 819 - 841
[4] Infinite-dimensional optimization and Bayesian nonparametric learning of stochastic differential equations
Ganguly, Arnab
Mitra, Riten
Zhou, Jinpu
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
[5] Reinforcement learning for POMDPs based on action values and stochastic optimization
Perkins, TJ
[J]. EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 199 - 204
[6] Controller Optimization for Multirate Systems Based on Reinforcement Learning
Zhan Li
Sheng-Ri Xue
Xing-Hu Yu
Hui-Jun Gao
[J]. International Journal of Automation and Computing, 2020, 17 : 417 - 427
[7] Controller Optimization for Multirate Systems Based on Reinforcement Learning
Zhan Li
Sheng-Ri Xue
Xing-Hu Yu
Hui-Jun Gao
[J]. International Journal of Automation and Computing, 2020, 17 (03) : 417 - 427
[8] Controller Optimization for Multirate Systems Based on Reinforcement Learning
Li, Zhan
Xue, Sheng-Ri
Yu, Xing-Hu
Gao, Hui-Jun
[J]. INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2020, 17 (03) : 417 - 427
[9] Selective maintenance optimization with stochastic break duration based on reinforcement learning
Liu, Yilai
Qian, Xinbo
[J]. EKSPLOATACJA I NIEZAWODNOSC-MAINTENANCE AND RELIABILITY, 2022, 24 (04): : 771 - 784
[10] Variational quantum reinforcement learning via evolutionary optimization
Chen, Samuel Yen-Chi
Huang, Chih-Min
Hsing, Chia-Wei
Goan, Hsi-Sheng
Kao, Ying-Jer
[J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (01):

← 1 2 3 4 5 →