Connecting stochastic optimal control and reinforcement learning

被引:1
|
作者
Quer, J. [1 ]
Borrell, Enric Ribera [1 ,2 ]
机构
[1] Free Univ Berlin, Inst Math, D-14195 Berlin, Germany
[2] Zuse Inst Berlin, D-14195 Berlin, Germany
关键词
PARTIAL-DIFFERENTIAL-EQUATIONS; ALGORITHMS; SIMULATION;
D O I
10.1063/5.0140665
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In this paper the connection between stochastic optimal control and reinforcement learning is investigated. Our main motivation is to apply importance sampling to sampling rare events which can be reformulated as an optimal control problem. By using a parameterised approach the optimal control problem becomes a stochastic optimization problem which still raises some open questions regarding how to tackle the scalability to high-dimensional problems and how to deal with the intrinsic metastability of the system. To explore new methods we link the optimal control problem to reinforcement learning since both share the same underlying framework, namely a Markov Decision Process (MDP). For the optimal control problem we show how the MDP can be formulated. In addition we discuss how the stochastic optimal control problem can be interpreted in the framework of reinforcement learning. At the end of the article we present the application of two different reinforcement learning algorithms to the optimal control problem and a comparison of the advantages and disadvantages of the two algorithms.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Reinforcement learning for optimal control of stochastic nonlinear systems
    Zhu, Xinji
    Wang, Yujia
    Wu, Zhe
    AICHE JOURNAL, 2025,
  • [2] Stochastic optimal well control in subsurface reservoirs using reinforcement learning
    Dixit, Atish
    ElSheikh, Ahmed H.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
  • [3] Constrained Reinforcement Learning for Stochastic Dynamic Optimal Power Flow Control
    Wu, Tong
    Scaglione, Anna
    2023 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, PESGM, 2023,
  • [4] Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic Systems
    Pang, Bo
    Jiang, Zhong-Ping
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (04) : 2383 - 2390
  • [5] Stochastic Linear Quadratic Optimal Control Problem: A Reinforcement Learning Method
    Li, Na
    Li, Xun
    Peng, Jing
    Xu, Zuo Quan
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (09) : 5009 - 5016
  • [6] Reinforcement Learning for Decentralized Stochastic Control
    Yongacoglu, Bora
    Arslan, Gurdal
    Yuksel, Serdar
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 5556 - 5561
  • [7] Connecting planning horizons in mining complexes with reinforcement learning and stochastic programming
    Levinson, Zachary
    Dimitrakopoulos, Roussos
    RESOURCES POLICY, 2023, 86
  • [8] A reinforcement learning-based scheme for adaptive optimal control of linear stochastic systems
    Wong, Wee Chin
    Lee, Jay H.
    2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 57 - 62
  • [9] Optimal Greedy Control in Reinforcement Learning
    Gorobtsov, Alexander
    Sychev, Oleg
    Orlova, Yulia
    Smirnov, Evgeniy
    Grigoreva, Olga
    Bochkin, Alexander
    Andreeva, Marina
    SENSORS, 2022, 22 (22)
  • [10] Reinforcement Learning Informed by Optimal Control
    Onnheim, Magnus
    Andersson, Pontus
    Gustavsson, Emil
    Jirstrand, Mats
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 403 - 407