Self-Augmenting Strategy for Reinforcement Learning

被引:91
|
作者
Huang, Xin [1 ]
Xiao, Shuangjiu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
关键词
Reinforcement Learning; Deep Q Learning; Self-Augmenting;
D O I
10.1145/3168390.3168392
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Training the agent to interact with environment intelligently is one of the core problems in reinforcement learning. In this paper, we propose a Self-Augmenting strategy for reinforcement learning to accelerate the learning process and agent performance by imitating and augmenting practical human experience. Instead of exploring randomly at the beginning as Deep Q Learning algorithms do, our strategy uses a short series of expert experience of human as augmenting guide in the training for the agent. Because of the imitation from the similar states of the augmented human experience, the agent trained with our strategy scores higher and converges faster than the original Deep Q Learning method does.
引用
收藏
页码:1 / 4
页数:4
相关论文
共 50 条
  • [1] A DATA STRUCTURE AND ALGORITHM FOR A SELF-AUGMENTING HEURISTIC PROGRAM
    HUTCHINSON, A
    [J]. COMPUTER JOURNAL, 1986, 29 (02): : 135 - 150
  • [2] Effective Sentiment Stream Analysis with Self-Augmenting Training and Demand-Driven Projection
    Silva, Ismael S.
    Gomide, Janaina
    Veloso, Adriano
    Meira Jr, Wagner
    Ferreira, Renato
    [J]. PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 475 - 484
  • [3] A self-augmenting gene expression cassette for enhanced and sustained transgene expression in the presence of proinflammatory cytokines
    Luo, P.
    Reed, B. D.
    Tsang, T. C.
    Harris, D. T.
    Flavell, R. A.
    [J]. DNA AND CELL BIOLOGY, 2006, 25 (12) : 659 - 667
  • [4] Augmenting Automated Game Testing with Deep Reinforcement Learning
    Bergdahl, Joakim
    Gordillo, Camilo
    Tollmar, Konrad
    Gisslen, Linus
    [J]. 2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020), 2020, : 600 - 603
  • [5] Nucleus-selective self-augmenting cascade nanoassemblies for targeted synergistic photo-chemo therapy of tumors
    Yang, Lan
    Ma, Huijie
    Liu, Ye
    Cao, Rumeng
    Chen, Shaofeng
    Wang, Jiajia
    Xiang, Ling
    Zhang, Jiumeng
    Feng, Xuli
    Wang, Chenhui
    [J]. CHEMICAL COMMUNICATIONS, 2023, 59 (73) : 10940 - 10943
  • [6] An Online Training Method for Augmenting MPC with Deep Reinforcement Learning
    Bellegarda, Guillaume
    Byl, Katie
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5453 - 5459
  • [7] Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation Tasks
    Nasiriany, Soroush
    Liu, Huihan
    Zhu, Yuke
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 7477 - 7484
  • [8] The reinforcement learning Kelly strategy
    Jiang, R.
    Saunders, D.
    Weng, C.
    [J]. QUANTITATIVE FINANCE, 2022, 22 (08) : 1445 - 1464
  • [9] Interactive Reinforcement Learning Strategy
    Shi, Zhenjie
    Ma, Wenming
    Yin, Shuai
    Zhang, Hailiang
    Zhao, Xiaofan
    [J]. 2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021), 2021, : 507 - 512
  • [10] Augmenting Reinforcement Learning to Enhance Cooperation in the Iterated Prisoner's Dilemma
    Feehan, Grace
    Fatima, Shaheen
    [J]. ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 146 - 157