Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft

被引:0
|
作者
Scheller, Christian [1 ]
Schraner, Yanick [1 ]
Vogel, Manfred [1 ]
机构
[1] Univ Appl Sci Northwestern Switzerland, Inst Data Sci, Basel, Switzerland
关键词
Imitation learning; deep reinforcement learning; MineRL competition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sample inefficiency of deep reinforcement learning methods is a major obstacle for their use in real-world applications. In this work, we show how human demonstrations can improve final performance of agents on the Minecraft minigame ObtainDiamond with only 8M frames of environment interaction. We propose a training procedure where policy networks are first trained on human data and later fine-tuned by reinforcement learning. Using a policy exploitation mechanism, experience replay and an additional loss against catastrophic forgetting, our best agent was able to achieve a mean score of 48. Our proposed solution placed 3rd in the NeurIPSMineRL Competition for Sample-Efficient Reinforcement Learning.
引用
收藏
页码:67 / 76
页数:10
相关论文
共 50 条
  • [1] Sample-Efficient Multi-Agent Reinforcement Learning with Demonstrations for Flocking Control
    Qiu, Yunbo
    Zhan, Yuzhu
    Jin, Yue
    Wang, Jian
    Zhang, Xudong
    [J]. 2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
  • [2] Reinforced Imitation: Sample Efficient Deep Reinforcement Learning for Mapless Navigation by Leveraging Prior Demonstrations
    Pfeiffer, Mark
    Shukla, Samarth
    Turchetta, Matteo
    Cadena, Cesar
    Krause, Andreas
    Siegwart, Roland
    Nieto, Juan
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 4423 - 4430
  • [3] Learning Fairness from Demonstrations via Inverse Reinforcement Learning
    Blandin, Jack
    Kash, Ian
    [J]. PROCEEDINGS OF THE 2024 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, ACM FACCT 2024, 2024, : 51 - 61
  • [4] Towards Sample Efficient Reinforcement Learning
    Yu, Yang
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5739 - 5743
  • [5] Sample Efficient Reinforcement Learning with REINFORCE
    Zhang, Junzi
    Kim, Jongho
    O'Donoghue, Brendan
    Boyd, Stephen
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10887 - 10895
  • [6] V-MIN: Efficient Reinforcement Learning through Demonstrations and Relaxed Reward Demands
    Martinez, David
    Alenya, Guillem
    Torras, Carme
    [J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2857 - 2863
  • [7] Efficient Performance Bounds for Primal-Dual Reinforcement Learning from Demonstrations
    Kamoutsi, Angeliki
    Banjac, Goran
    Lygeros, John
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [8] Real-time power scheduling through reinforcement learning from demonstrations
    Liu, Shaohuai
    Liu, Jinbo
    Yang, Nan
    Huang, Yupeng
    Jiang, Qirong
    Gao, Yang
    [J]. ELECTRIC POWER SYSTEMS RESEARCH, 2024, 235
  • [9] An Efficient Unified Approach Using Demonstrations for Inverse Reinforcement Learning
    Hwang, Maxwell
    Jiang, Wei-Cheng
    Chen, Yu-Jen
    Hwang, Kao-Shing
    Tseng, Yi-Chia
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2021, 13 (03) : 444 - 452
  • [10] Integrating Reinforcement Learning and Learning From Demonstrations to Learn Nonprehensile Manipulation
    Sun, Xilong
    Li, Jiqing
    Kovalenko, Anna Vladimirovna
    Feng, Wei
    Ou, Yongsheng
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 20 (03) : 1735 - 1744