Connecting stochastic optimal control and reinforcement learning

被引:1
|
作者
Quer, J. [1 ]
Borrell, Enric Ribera [1 ,2 ]
机构
[1] Free Univ Berlin, Inst Math, D-14195 Berlin, Germany
[2] Zuse Inst Berlin, D-14195 Berlin, Germany
关键词
PARTIAL-DIFFERENTIAL-EQUATIONS; ALGORITHMS; SIMULATION;
D O I
10.1063/5.0140665
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In this paper the connection between stochastic optimal control and reinforcement learning is investigated. Our main motivation is to apply importance sampling to sampling rare events which can be reformulated as an optimal control problem. By using a parameterised approach the optimal control problem becomes a stochastic optimization problem which still raises some open questions regarding how to tackle the scalability to high-dimensional problems and how to deal with the intrinsic metastability of the system. To explore new methods we link the optimal control problem to reinforcement learning since both share the same underlying framework, namely a Markov Decision Process (MDP). For the optimal control problem we show how the MDP can be formulated. In addition we discuss how the stochastic optimal control problem can be interpreted in the framework of reinforcement learning. At the end of the article we present the application of two different reinforcement learning algorithms to the optimal control problem and a comparison of the advantages and disadvantages of the two algorithms.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Intelligent Farm Based on Deep Reinforcement Learning for optimal control
    Yassine, Hanafi M.
    Roufaida, Khebbache
    Shkodyrev, Viacheslav P.
    Abdelhak, Merizig
    Zarour, Lotfi
    Khaled, Rezeg
    2022 INTERNATIONAL SYMPOSIUM ON INNOVATIVE INFORMATICS OF BISKRA, ISNIB, 2022, : 106 - 111
  • [42] Implementing Traffic Signal Optimal Control by Multiagent Reinforcement Learning
    Song, Jiong
    Jin, Zhao
    Zhu, WenJun
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 2578 - 2582
  • [43] Optimal Containment Control of Multiple Quadrotors via Reinforcement Learning
    Cheng, Ming
    Liu, Hao
    Liu, Deyuan
    Gu, Haibo
    Wang, Xiangke
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 2632 - 2637
  • [44] Reinforcement learning robust optimal control for spacecraft attitude stabilization
    Xiao B.
    Zhang H.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (01):
  • [45] A tutorial on optimal control and reinforcement learning methods for quantum technologies
    Giannelli, Luigi
    Sgroi, Sofia
    Brown, Jonathon
    Paraoanu, Gheorghe Sorin
    Paternostro, Mauro
    Paladino, Elisabetta
    Falci, Giuseppe
    PHYSICS LETTERS A, 2022, 434
  • [46] Reinforcement Learning and Optimal Control of PMSM Speed Servo System
    Zhao, Jianguo
    Yang, Chunyu
    Gao, Weinan
    Zhou, Linna
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (08) : 8305 - 8313
  • [47] Fixed-time reinforcement learning and optimal control design
    Niroomand, Mahdi
    Moghaddam, Reihaneh Kardehi
    Modares, Hamidreza
    Sistani, Mohammad-Bagher
    JOURNAL OF VIBRATION AND CONTROL, 2024,
  • [48] Optimal Routing Control of a Construction Machine by Deep Reinforcement Learning
    Sun, Zeyuan
    Nakatani, Masayuki
    Uchimura, Yutaka
    2018 IEEE 15TH INTERNATIONAL WORKSHOP ON ADVANCED MOTION CONTROL (AMC), 2018, : 187 - 192
  • [49] Reinforcement Learning for Optimal Primary Frequency Control: A Lyapunov Approach
    Cui, Wenqi
    Jiang, Yan
    Zhang, Baosen
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2023, 38 (02) : 1676 - 1688
  • [50] Constrained adaptive optimal control using a reinforcement learning agent
    Lin, Wei-Song
    Zheng, Chen-Hong
    AUTOMATICA, 2012, 48 (10) : 2614 - 2619