Calibrated Model-Based Deep Reinforcement Learning

被引:0
|
作者
Malik, Ali [1 ]
Kuleshov, Volodymyr [1 ,2 ]
Song, Jiaming [1 ]
Nemer, Danny [2 ]
Seymour, Harlan [2 ]
Ermon, Stefano [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[2] Afresh Technol, San Francisco, CA 94107 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimates of predictive uncertainty are important for accurate model-based planning and reinforcement learning. However, predictive uncertainties - especially ones derived from modern deep learning systems - can be inaccurate and impose a bottleneck on performance. This paper explores which uncertainties are needed for model-based reinforcement learning and argues that good uncertainties must be calibrated, i.e. their probabilities should match empirical frequencies of predicted events. We describe a simple way to augment any model-based reinforcement learning agent with a calibrated model and show that doing so consistently improves planning, sample complexity, and exploration. On the HALFCHEETAH MuJoCo task, our system achieves state-of-the-art performance using 50% fewer samples than the current leading approach. Our findings suggest that calibration can improve the performance of model-based reinforcement learning with minimal computational and implementation overhead.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Learning to Paint With Model-based Deep Reinforcement Learning
    Huang, Zhewei
    Heng, Wen
    Zhou, Shuchang
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8708 - 8717
  • [2] Knowledge Transfer using Model-Based Deep Reinforcement Learning
    Boloka, Tlou
    Makondo, Ndivhuwo
    Rosman, Benjamin
    [J]. 2021 SOUTHERN AFRICAN UNIVERSITIES POWER ENGINEERING CONFERENCE/ROBOTICS AND MECHATRONICS/PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA (SAUPEC/ROBMECH/PRASA), 2021,
  • [3] Model-based deep reinforcement learning for wind energy bidding
    Sanayha, Manassakan
    Vateekul, Peerapon
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2022, 136
  • [4] Deep Reinforcement Learning with Model-based Acceleration for Hyperparameter Optimization
    Chen, SenPeng
    Wu, Jia
    Chen, XiuYun
    [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 170 - 177
  • [5] The bottleneck simulator: A model-based deep reinforcement learning approach
    Serban, Iulian Vlad
    Sankar, Chinnadhurai
    Pieper, Michael
    Pineau, Joelle
    Bengio, Yoshua
    [J]. Journal of Artificial Intelligence Research, 2020, 69 : 571 - 612
  • [6] SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning
    Zhang, Marvin
    Vikram, Sharad
    Smith, Laura
    Abbeel, Pieter
    Johnson, Matthew J.
    Levine, Sergey
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [7] The Bottleneck Simulator: A Model-Based Deep Reinforcement Learning Approach
    Serban, Iulian Vlad
    Sankar, Chinnadhurai
    Pieper, Michael
    Pineau, Joelle
    Bengio, Yoshua
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2020, 69 : 571 - 612
  • [8] Cloud Reasoning Model-based Exploration for Deep Reinforcement Learning
    Li Chenxi
    Cao Lei
    Chen Xiliang
    Zhang Yongliang
    Xu Zhixiong
    Peng Hui
    Duan Liwen
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (01) : 244 - 248
  • [9] Model-based deep reinforcement learning for accelerated learning from flow simulations
    Weiner, Andre
    Geise, Janis
    [J]. MECCANICA, 2024,
  • [10] Imitation Game: A Model-based and Imitation Learning Deep Reinforcement Learning Hybrid
    Veith, Eric Msp
    Logemann, Torben
    Berezin, Aleksandr
    Wellssow, Arlena
    Balduin, Stephan
    [J]. 2024 12TH WORKSHOP ON MODELING AND SIMULATION OF CYBER-PHYSICAL ENERGY SYSTEMS, MSCPES, 2024,