A Data-Efficient Training Method for Deep Reinforcement Learning

被引:0
|
作者
Feng, Wenhui [1 ]
Han, Chongzhao [1 ]
Lian, Feng [1 ]
Liu, Xia [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Automat Sci & Engn, Key Lab Intelligent Networks, Minist Educ, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
deep reinforcement learning; data efficiency; curriculum learning; transfer learning; LEVEL; ENVIRONMENT; GO;
D O I
10.3390/electronics11244205
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data inefficiency is one of the major challenges for deploying deep reinforcement learning algorithms widely in industry control fields, especially in regard to long-horizon sparse reward tasks. Even in a simulation-based environment, it is often prohibitive to take weeks to train an algorithm. In this study, a data-efficient training method is proposed in which a DQN is used as a base algorithm, and an elaborate curriculum is designed for the agent in the simulation scenario to accelerate the training process. In the early stage of the training process, the distribution of the initial state is set close to the goal so the agent can obtain an informative reward easily. As the training continues, the initial state distribution is set farther from the goal for the agent to explore more state space. Thus, the agent can obtain a reasonable policy through fewer interactions with the environment. To bridge the sim-to-real gap, the parameters for the output layer of the neural network for the value function are fine-tuned. An experiment on UAV maneuver control is conducted in the proposed training framework to verify the method. We demonstrate that data efficiency is different for the same data in different training stages.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] A Data-Efficient Method of Deep Reinforcement Learning for Chinese Chess
    Xu, Changming
    Ding, Hengfeng
    Zhang, Xuejian
    Wang, Cong
    Yang, Hongji
    [J]. 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 687 - 693
  • [2] Data-Efficient Deep Reinforcement Learning with Symmetric Consistency
    Zhang, Xianchao
    Yang, Wentao
    Zhang, Xiaotong
    Liu, Han
    Wang, Guanglu
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2430 - 2436
  • [3] Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control
    Frauenknecht, Bernd
    Ehlgen, Tobias
    Trimpe, Sebastian
    [J]. 2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 894 - 901
  • [4] Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning
    Maulana, Muhammad Rizki
    Lee, Wee Sun
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 122 - 138
  • [5] A self-supervised deep learning method for data-efficient training in genomics
    Hüseyin Anil Gündüz
    Martin Binder
    Xiao-Yin To
    René Mreches
    Bernd Bischl
    Alice C. McHardy
    Philipp C. Münch
    Mina Rezaei
    [J]. Communications Biology, 6
  • [6] A self-supervised deep learning method for data-efficient training in genomics
    Guenduez, Hueseyin Anil
    Binder, Martin
    To, Xiao-Yin
    Mreches, Rene
    Bischl, Bernd
    McHardy, Alice C.
    Muench, Philipp C.
    Rezaei, Mina
    [J]. COMMUNICATIONS BIOLOGY, 2023, 6 (01)
  • [7] Data-Efficient Hierarchical Reinforcement Learning
    Nachum, Ofir
    Gu, Shixiang
    Lee, Honglak
    Levine, Sergey
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [8] A data-efficient goal-directed deep reinforcement learning method for robot visuomotor skill
    Jiang, Rong
    Wang, Zhipeng
    He, Bin
    Zhou, Yanmin
    Li, Gang
    Zhu, Zhongpan
    [J]. NEUROCOMPUTING, 2021, 462 : 389 - 401
  • [9] Data-Efficient Reinforcement Learning for Malaria Control
    Zou, Lixin
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 507 - 513
  • [10] A Data-Efficient Reinforcement Learning Method Based on Local Koopman Operators
    Song, Lixing
    Wang, Junheng
    Xu, Junhong
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 515 - 520