Self-Augmenting Strategy for Reinforcement Learning

被引:91
|
作者
Huang, Xin [1 ]
Xiao, Shuangjiu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
关键词
Reinforcement Learning; Deep Q Learning; Self-Augmenting;
D O I
10.1145/3168390.3168392
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Training the agent to interact with environment intelligently is one of the core problems in reinforcement learning. In this paper, we propose a Self-Augmenting strategy for reinforcement learning to accelerate the learning process and agent performance by imitating and augmenting practical human experience. Instead of exploring randomly at the beginning as Deep Q Learning algorithms do, our strategy uses a short series of expert experience of human as augmenting guide in the training for the agent. Because of the imitation from the similar states of the augmented human experience, the agent trained with our strategy scores higher and converges faster than the original Deep Q Learning method does.
引用
收藏
页码:1 / 4
页数:4
相关论文
共 50 条
  • [31] Jamming strategy learning based on positive reinforcement learning and orthogonal decomposition
    Zhuansun, Shaoshuai
    Yang, Junan
    Liu, Hui
    Huang, Keju
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2018, 40 (03): : 518 - 525
  • [32] SELF-REINFORCEMENT AND EXTERNAL REINFORCEMENT IN VISUAL-MOTOR LEARNING
    MARSTON, A
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1967, 74 (01): : 93 - &
  • [33] Strategy acquisition for the game "Othello" based on reinforcement learning
    Yoshioka, T
    Ishii, S
    Ito, M
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1999, E82D (12): : 1618 - 1626
  • [34] Reinforcement learning for pricing strategy optimization in the insurance industry
    Krasheninnikova, Elena
    Garcia, Javier
    Maestre, Roberto
    Fernandez, Fernando
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 80 : 8 - 19
  • [35] Hierarchical Deep Reinforcement Learning for VWAP Strategy Optimization
    Li, Xiaodong
    Wu, Pangjing
    Zou, Chenxin
    Li, Qing
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (03) : 288 - 300
  • [36] An improved reinforcement learning control strategy for batch processes
    Zhang, Peng
    Zhang, Jie
    Long, Yang
    Hu, Bingzhang
    [J]. 2019 24TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2019, : 360 - 365
  • [37] A Precision Advertising Strategy Based on Deep Reinforcement Learning
    Liang, Haiqing
    [J]. Ingenierie des Systemes d'Information, 2020, 25 (03): : 397 - 403
  • [38] Reinforcement Learning Strategy for Solving the MRCPSP by a Team of Agents
    Jedrzejowicz, Piotr
    Ratajczak-Ropel, Ewa
    [J]. INTELLIGENT DECISION TECHNOLOGIES, 2015, 39 : 537 - 548
  • [39] Strategy acquisition for the game "Othello" based on reinforcement learning
    Yoshioka, T
    Ishii, S
    Ito, M
    [J]. ICONIP'98: THE FIFTH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING JOINTLY WITH JNNS'98: THE 1998 ANNUAL CONFERENCE OF THE JAPANESE NEURAL NETWORK SOCIETY - PROCEEDINGS, VOLS 1-3, 1998, : 841 - 844
  • [40] A Provably Efficient Sample Collection Strategy for Reinforcement Learning
    Tarbouriech, Jean
    Pirotta, Matteo
    Valko, Michal
    Lazaric, Alessandro
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34