Model-Based Planning with Energy-Based Models

被引:0
|
作者
Du, Yilun [1 ]
Lin, Toru [1 ]
Mordatch, Igor [2 ]
机构
[1] MIT, CSAIL, Cambridge, MA 02139 USA
[2] Google Brain, London, England
来源
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Model-based planning holds great promise for improving both sample efficiency and generalization in reinforcement learning (RL). We show that energy-based models (EBMs) are a promising class of models to use for model-based planning. EBMs naturally support inference of intermediate states given start and goal state distributions. We provide an online algorithm to train EBMs while interacting with the environment, and show that EBMs allow for significantly better online learning than corresponding feed-forward networks. We further show that EBMs support maximum entropy state inference and are able to generate diverse state space plans. We show that inference purely in state space - without planning actions - allows for better generalization to previously unseen obstacles in the environment and prevents the planner from exploiting the dynamics model by applying uncharacteristic action sequences. Finally, we show that online EBM training naturally leads to intentionally planned state exploration which performs significantly better than random exploration.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] An energy-based model for the wear of UHMWPE
    R. Colaço
    M.P. Gispert
    A.P. Serro
    B. Saramago
    [J]. Tribology Letters, 2007, 26 : 119 - 124
  • [22] Energy-Based Liquefaction Triggering Model
    Ulmer, K. J.
    Green, R. A.
    Rodriguez-Marek, A.
    Mitchell, J. K.
    [J]. JOURNAL OF GEOTECHNICAL AND GEOENVIRONMENTAL ENGINEERING, 2023, 149 (11)
  • [23] An energy-based model of the Mullins effect
    Ogden, RW
    Roxburgh, DG
    [J]. CONSTITUTIVE MODELS FOR RUBBER, 1999, : 23 - 28
  • [24] Finite mixture models and model-based clusteringFinite mixture models and model-based clustering
    Melnykov, Volodymyr
    Maitra, Ranjan
    [J]. STATISTICS SURVEYS, 2010, 4 : 80 - 116
  • [25] Energy-based models for sparse overcomplete representations
    Teh, YW
    Welling, M
    Osindero, S
    Hinton, GE
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 4 (7-8) : 1235 - 1260
  • [26] An energy-based model for dynamic hysteresis
    Mazauric, V
    Malobertil, O
    Meunier, G
    Kedous-Lebouc, A
    Geoffroy, O
    Rebière, Y
    [J]. IEEE TRANSACTIONS ON MAGNETICS, 2005, 41 (10) : 3766 - 3768
  • [27] Rate -Distortion via Energy-Based Models
    Li, Qing
    Kim, Yongjune
    Guyot, Cyril
    [J]. 2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 351 - 351
  • [28] An energy-based model for the wear of UHMWPE
    Colaco, R.
    Gispert, M. P.
    Serro, A. P.
    Saramago, B.
    [J]. TRIBOLOGY LETTERS, 2007, 26 (02) : 119 - 124
  • [29] Towards understanding retrosynthesis by energy-based models
    Sun, Ruoxi
    Dai, Hanjun
    Li, Li
    Kearnes, Steven
    Dai, Bo
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [30] LOCAL AND NONLOCAL ENERGY-BASED COUPLING MODELS
    Acosta, Gabriel
    Bersetche, Francisco
    Rossi, Julio D.
    [J]. SIAM JOURNAL ON MATHEMATICAL ANALYSIS, 2022, 54 (06) : 6288 - 6322