Model-Based Planning with Energy-Based Models

被引:0
|
作者
Du, Yilun [1 ]
Lin, Toru [1 ]
Mordatch, Igor [2 ]
机构
[1] MIT, CSAIL, Cambridge, MA 02139 USA
[2] Google Brain, London, England
来源
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Model-based planning holds great promise for improving both sample efficiency and generalization in reinforcement learning (RL). We show that energy-based models (EBMs) are a promising class of models to use for model-based planning. EBMs naturally support inference of intermediate states given start and goal state distributions. We provide an online algorithm to train EBMs while interacting with the environment, and show that EBMs allow for significantly better online learning than corresponding feed-forward networks. We further show that EBMs support maximum entropy state inference and are able to generate diverse state space plans. We show that inference purely in state space - without planning actions - allows for better generalization to previously unseen obstacles in the environment and prevents the planner from exploiting the dynamics model by applying uncharacteristic action sequences. Finally, we show that online EBM training naturally leads to intentionally planned state exploration which performs significantly better than random exploration.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Regularizing Model-Based Planning with Energy-Based Models
    Boney, Rinu
    Kannala, Juho
    Ilin, Alexander
    [J]. CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [2] The physics of energy-based models
    Huembeli, Patrick
    Arrazola, Juan Miguel
    Killoran, Nathan
    Mohseni, Masoud
    Wittek, Peter
    [J]. QUANTUM MACHINE INTELLIGENCE, 2022, 4 (01)
  • [3] The physics of energy-based models
    Patrick Huembeli
    Juan Miguel Arrazola
    Nathan Killoran
    Masoud Mohseni
    Peter Wittek
    [J]. Quantum Machine Intelligence, 2022, 4
  • [4] Conjugate Energy-Based Models
    Wu, Hao
    Esmaeili, Babak
    Wick, Michael
    Tristan, Jean-Baptiste
    van de Meent, Jan-Willem
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [5] Model-based energy planning: A methodology to choose and combine models to support policy decisions
    Oliveira, Dilayne Santos
    Lumbreras, Sara
    Alvarez, Erik F.
    Ramos, Andres
    Olmos, Luis
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2024, 159
  • [6] Energy-Based Reranking: Improving Neural Machine Translation Using Energy-Based Models
    Bhattacharyya, Sumanta
    Rooshenas, Amirmohammad
    Naskar, Subhajit
    Sun, Simeng
    Iyyer, Mohit
    McCallum, Andrew
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4528 - 4537
  • [7] Deep Energy-Based NARX Models
    Hendriks, Johannes N.
    Gustafsson, Fredrik K.
    Ribeiro, Antonio H.
    Wills, Adrian G.
    Schon, Thomas B.
    [J]. IFAC PAPERSONLINE, 2021, 54 (07): : 505 - 510
  • [8] Energy-based models for environmental biotechnology
    Rodriguez, Jorge
    Lema, Juan M.
    Kleerebezem, Robbert
    [J]. TRENDS IN BIOTECHNOLOGY, 2008, 26 (07) : 366 - 374
  • [9] Residual energy-based models for text
    Bakhtin, Anton
    Deng, Yuntian
    Gross, Sam
    Ott, Myle
    Ranzato, Marc'Aurelio
    Szlam, Arthur
    [J]. Journal of Machine Learning Research, 2021, 22
  • [10] Residual Energy-Based Models for Text
    Bakhtin, Anton
    Deng, Yuntian
    Gross, Sam
    Ott, Myle
    Ranzato, Marc'Aurelio
    Szlam, Arthur
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22