Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation

被引:0
|
作者
Nikishin, Evgenii [1 ]
Abachi, Romina [2 ]
Agarwal, Rishabh [1 ,3 ]
Bacon, Pierre-Luc [1 ,4 ]
机构
[1] Univ Montreal, Mila, Montreal, PQ, Canada
[2] Univ Toronto, Vector Inst, Toronto, ON, Canada
[3] Google Res, Mountain View, CA USA
[4] Facebook CIFAR AI Chair, Montreal, PQ, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The shortcomings of maximum likelihood estimation in the context of model-based reinforcement learning have been highlighted by an increasing number of papers. When the model class is misspecified or has a limited representational capacity, model parameters with high likelihood might not necessarily result in high performance of the agent on a downstream control task. To alleviate this problem, we propose an end-to-end approach for model learning which directly optimizes the expected returns using implicit differentiation. We treat a value function that satisfies the Bellman optimality operator induced by the model as an implicit function of model parameters and show how to differentiate the function. We provide theoretical and empirical evidence highlighting the benefits of our approach in the model misspecification regime compared to likelihood-based methods.
引用
收藏
页码:7886 / 7894
页数:9
相关论文
共 50 条
  • [31] Sample-efficient model-based reinforcement learning for quantum control
    Khalid, Irtaza
    Weidner, Carrie A.
    Jonckheere, Edmond A.
    Schirmer, Sophie G.
    Langbein, Frank C.
    [J]. PHYSICAL REVIEW RESEARCH, 2023, 5 (04):
  • [32] Model-Based Reinforcement Learning for Optimal Feedback Control of Switched Systems
    Greene, Max L.
    Abudia, Moad
    Kamalapurkar, Rushikesh
    Dixon, Warren E.
    [J]. 2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 162 - 167
  • [33] Model-Based Reinforcement Learning Control of Electrohydraulic Position Servo Systems
    Yao, Zhikai
    Liang, Xianglong
    Jiang, Guo-Ping
    Yao, Jianyong
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2023, 28 (03) : 1446 - 1455
  • [34] Control of Magnetic Surgical Robots With Model-Based Simulators and Reinforcement Learning
    Barnoy, Yotam
    Erin, Onder
    Raval, Suraj
    Pryor, Will
    Mair, Lamar O.
    Weinberg, Irving N.
    Diaz-Mercado, Yancy
    Krieger, Axel
    Hager, Gregory D.
    [J]. IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2022, 4 (04): : 945 - 956
  • [35] Model-Based Cross-Scale Reinforcement Learning Optimal Control
    School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
    不详
    [J]. Int. Conf. Electron. Eng. Informatics, EEI, (906-910):
  • [36] Model-Based Graph Reinforcement Learning for Inductive Traffic Signal Control
    Devailly, Francois-Xavier
    Larocque, Denis
    Charlin, Laurent
    [J]. IEEE OPEN JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 5 : 238 - 250
  • [37] Robust Model-Based Reinforcement Learning Control of a Batch Crystallization Process
    Benyahia, B.
    Anandan, P. D.
    Rielly, C.
    [J]. 2021 9TH INTERNATIONAL CONFERENCE ON SYSTEMS AND CONTROL (ICSC'21), 2021, : 89 - 94
  • [38] Control-Oriented Engine Thermal Model
    Corti, Enrico
    Abbondanza, Marco
    Ravaglioli, Vittorio
    Taccioli, Michele
    [J]. ATI 2018 - 73RD CONFERENCE OF THE ITALIAN THERMAL MACHINES ENGINEERING ASSOCIATION, 2018, 148 : 766 - 773
  • [39] Delay-aware model-based reinforcement learning for continuous control
    Chen, Baiming
    Xu, Mengdi
    Li, Liang
    Zhao, Ding
    [J]. NEUROCOMPUTING, 2021, 450 : 119 - 128
  • [40] Model-based graph reinforcement learning for inductive traffic signal control
    Devailly, François-Xavier
    Larocque, Denis
    Charlin, Laurent
    [J]. arXiv, 2022,