Connecting stochastic optimal control and reinforcement learning

被引:1
|
作者
Quer, J. [1 ]
Borrell, Enric Ribera [1 ,2 ]
机构
[1] Free Univ Berlin, Inst Math, D-14195 Berlin, Germany
[2] Zuse Inst Berlin, D-14195 Berlin, Germany
关键词
PARTIAL-DIFFERENTIAL-EQUATIONS; ALGORITHMS; SIMULATION;
D O I
10.1063/5.0140665
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In this paper the connection between stochastic optimal control and reinforcement learning is investigated. Our main motivation is to apply importance sampling to sampling rare events which can be reformulated as an optimal control problem. By using a parameterised approach the optimal control problem becomes a stochastic optimization problem which still raises some open questions regarding how to tackle the scalability to high-dimensional problems and how to deal with the intrinsic metastability of the system. To explore new methods we link the optimal control problem to reinforcement learning since both share the same underlying framework, namely a Markov Decision Process (MDP). For the optimal control problem we show how the MDP can be formulated. In addition we discuss how the stochastic optimal control problem can be interpreted in the framework of reinforcement learning. At the end of the article we present the application of two different reinforcement learning algorithms to the optimal control problem and a comparison of the advantages and disadvantages of the two algorithms.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Reinforcement Learning for Model Problems of Optimal Control
    S. S. Semenov
    V. I. Tsurkov
    Journal of Computer and Systems Sciences International, 2023, 62 : 508 - 521
  • [22] Reinforcement Learning of Impedance Control In Stochastic Force Fields
    Stulp, Freek
    Buchli, Jonas
    Ellmer, Alice
    Mistry, Michael
    Theodorou, Evangelos
    Schaal, Stefan
    2011 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING (ICDL), 2011,
  • [23] Hierarchical control for stochastic network traffic with reinforcement learning
    Su, Z. C.
    Chow, Andy H. F.
    Fang, C. L.
    Liang, E. M.
    Zhong, R. X.
    TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2023, 167 : 196 - 216
  • [24] Adaptive Optimal Control of Networked Nonlinear Systems With Stochastic Sensor and Actuator Dropouts Based on Reinforcement Learning
    Jiang, Yi
    Liu, Lu
    Feng, Gang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3107 - 3120
  • [25] Constrained Reinforcement Learning for Predictive Control in Real-Time Stochastic Dynamic Optimal Power Flow
    Wu, Tong
    Scaglione, Anna
    Arnold, Daniel
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2024, 39 (03) : 5077 - 5090
  • [26] Optimal Control of Stochastic Markovian Jump Systems With Wiener and Poisson Noises: Two Reinforcement Learning Approaches
    Yan, Zhiguo
    Sun, Tingkun
    Hu, Guolin
    IEEE Transactions on Artificial Intelligence, 2024, 5 (12): : 6591 - 6600
  • [27] Stochastic Reinforcement Learning
    Kuang, Nikki Lijing
    Leung, Clement H. C.
    Sung, Vienne W. K.
    2018 IEEE FIRST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE), 2018, : 244 - 248
  • [28] Nonlinear Optimal Control Using Deep Reinforcement Learning
    Bucci, Michele Alessandro
    Semeraro, Onofrio
    Allauzen, Alexandre
    Cordier, Laurent
    Mathelin, Lionel
    IUTAM LAMINAR-TURBULENT TRANSITION, 2022, 38 : 279 - 290
  • [29] Optimal and Autonomous Control Using Reinforcement Learning: A Survey
    Kiumarsi, Bahare
    Vamvoudakis, Kyriakos G.
    Modares, Hamidreza
    Lewis, Frank L.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2042 - 2062
  • [30] Reinforcement learning for optimal control of low exergy buildings
    Yang, Lei
    Nagy, Zoltan
    Goffin, Philippe
    Schlueter, Arno
    APPLIED ENERGY, 2015, 156 : 577 - 586