Imitation Game: A Model-based and Imitation Learning Deep Reinforcement Learning Hybrid

被引:0
|
作者
Veith, Eric Msp [1 ,2 ]
Logemann, Torben [1 ]
Berezin, Aleksandr [2 ]
Wellssow, Arlena [1 ,2 ]
Balduin, Stephan [2 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Res Grp Adversarial Resilience Learning, Oldenburg, Germany
[2] OFFIS Inst Informat Technol, R&D Div Energy, Oldenburg, Germany
关键词
D O I
10.1109/MSCPES62135.2024.10542713
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Autonomous and learning systems based on Deep Reinforcement Learning have firmly established themselves as a foundation for approaches to creating resilient and efficient Cyber-Physical Energy Systems. However, most current approaches suffer from two distinct problems: Modern model-free algorithms such as Soft Actor Critic need a high number of samples to learn a meaningful policy, as well as a fallback to ward against concept drifts (e. g., catastrophic forgetting). In this paper, we present the work in progress towards a hybrid agent architecture that combines model-based Deep Reinforcement Learning with imitation learning to overcome both problems.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Probabilistic model-based imitation learning
    Englert, Peter
    Paraschos, Alexandros
    Deisenroth, Marc Peter
    Peters, Jan
    [J]. ADAPTIVE BEHAVIOR, 2013, 21 (05) : 388 - 403
  • [2] Playing the Imitation Game With Deep Learning
    Lipton, Zachary C.
    Elkan, Charles
    [J]. IEEE SPECTRUM, 2016, 53 (02) : 40 - 45
  • [3] Model-Based Imitation Learning for Urban Driving
    Hu, Anthony
    Corrado, Gianluca
    Griffiths, Nicolas
    Murez, Zak
    Gurau, Corina
    Yeo, Hudson
    Kendall, Alex
    Cipolla, Roberto
    Shotton, Jamie
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [4] A Probabilistic Framework for Model-Based Imitation Learning
    Shon, Aaron P.
    Grimes, David B.
    Baker, Chris L.
    Rao, Rajesh P. N.
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 2004, : 1237 - 1242
  • [5] Learning for a robot: Deep reinforcement learning, imitation learning, transfer learning
    Hua, Jiang
    Zeng, Liangcai
    Li, Gongfa
    Ju, Zhaojie
    [J]. Sensors (Switzerland), 2021, 21 (04): : 1 - 21
  • [6] Learning for a Robot: Deep Reinforcement Learning, Imitation Learning, Transfer Learning
    Hua, Jiang
    Zeng, Liangcai
    Li, Gongfa
    Ju, Zhaojie
    [J]. SENSORS, 2021, 21 (04) : 1 - 21
  • [7] Tracking the Race Between Deep Reinforcement Learning and Imitation Learning
    Gros, Timo P.
    Hoeller, Daniel
    Hoffmann, Joerg
    Wolf, Verena
    [J]. QUANTITATIVE EVALUATION OF SYSTEMS (QEST 2020), 2020, 12289 : 11 - 17
  • [8] A Penetration Strategy Combining Deep Reinforcement Learning and Imitation Learning
    Wang, Xiaofang
    Gu, Kunren
    [J]. Yuhang Xuebao/Journal of Astronautics, 2023, 44 (06): : 914 - 925
  • [9] Learning How to Play Bomberman with Deep Reinforcement and Imitation Learning
    Goulart, Icaro
    Paes, Aline
    Clua, Esteban
    [J]. ENTERTAINMENT COMPUTING AND SERIOUS GAMES, ICEC-JCSG 2019, 2019, 11863 : 121 - 133
  • [10] Cloud Resource Scheduling With Deep Reinforcement Learning and Imitation Learning
    Guo, Wenxia
    Tian, Wenhong
    Ye, Yufei
    Xu, Lingxiao
    Wu, Kui
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05): : 3576 - 3586