Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft

被引:0
|
作者
Scheller, Christian [1 ]
Schraner, Yanick [1 ]
Vogel, Manfred [1 ]
机构
[1] Univ Appl Sci Northwestern Switzerland, Inst Data Sci, Basel, Switzerland
关键词
Imitation learning; deep reinforcement learning; MineRL competition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sample inefficiency of deep reinforcement learning methods is a major obstacle for their use in real-world applications. In this work, we show how human demonstrations can improve final performance of agents on the Minecraft minigame ObtainDiamond with only 8M frames of environment interaction. We propose a training procedure where policy networks are first trained on human data and later fine-tuned by reinforcement learning. Using a policy exploitation mechanism, experience replay and an additional loss against catastrophic forgetting, our best agent was able to achieve a mean score of 48. Our proposed solution placed 3rd in the NeurIPSMineRL Competition for Sample-Efficient Reinforcement Learning.
引用
下载
收藏
页码:67 / 76
页数:10
相关论文
共 50 条
  • [31] Mapless navigation for UAVs via reinforcement learning from demonstrations
    YANG JiaNan
    LU ShengAo
    HAN MingHao
    LI YunPeng
    MA YuTing
    LIN ZeFeng
    LI HaoWei
    Science China Technological Sciences, 2023, (05) : 1263 - 1270
  • [32] Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
    Rajeswaran, Aravind
    Kumar, Vikash
    Gupta, Abhishek
    Vezzani, Giulia
    Schulman, John
    Todorov, Emanuel
    Levine, Sergey
    ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,
  • [33] Mapless navigation for UAVs via reinforcement learning from demonstrations
    JiaNan Yang
    ShengAo Lu
    MingHao Han
    YunPeng Li
    YuTing Ma
    ZeFeng Lin
    HaoWei Li
    Science China Technological Sciences, 2023, 66 : 1263 - 1270
  • [34] Mapless navigation for UAVs via reinforcement learning from demonstrations
    YANG JiaNan
    LU ShengAo
    HAN MingHao
    LI YunPeng
    MA YuTing
    LIN ZeFeng
    LI HaoWei
    Science China(Technological Sciences), 2023, 66 (05) : 1263 - 1270
  • [35] Efficient hindsight reinforcement learning using demonstrations for robotic tasks with sparse rewards
    Zuo, Guoyu
    Zhao, Qishen
    Lu, Jiahao
    Li, Jiangeng
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (01)
  • [36] Efficient reinforcement learning through symbiotic evolution
    Moriarty, DE
    Miikkulainen, R
    MACHINE LEARNING, 1996, 22 (1-3) : 11 - 32
  • [37] Efficient Distributed Reinforcement Learning through Agreement
    Varshavskaya, Paulina
    Kaelbling, Leslie Pack
    Rus, Daniela
    DISTRIBUTED AUTONOMOUS ROBOTIC SYSTEMS 8, 2009, : 367 - 378
  • [39] Efficient Reinforcement Learning Through Trajectory Generation
    Cui, Wenqi
    Huang, Linbin
    Yang, Weiwei
    Zhang, Baosen
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [40] Analysis of Inverse Reinforcement Learning with Perturbed Demonstrations
    Melo, Francisco S.
    Lopes, Manuel
    Ferreira, Ricardo
    ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 349 - 354