A Novel Adaptive Sampling Strategy for Deep Reinforcement Learning

被引:1
|
作者
Liang, Xingxing [1 ]
Chen, Li [1 ]
Feng, Yanghe [1 ]
Liu, Zhong [1 ]
Ma, Yang [1 ]
Huang, Kuihua [1 ]
机构
[1] Natl Univ Def Technol, Coll Syst Engn, Changsha, Peoples R China
关键词
Deep reinforcement learning; an adaptive factor; DQN; Actor-Critic (AC) algorithm; GAME; GO;
D O I
10.1142/S1469026821500115
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning, as an effective method to solve complex sequential decision-making problems, plays an important role in areas such as intelligent decision-making and behavioral cognition. It is well known that the sample experience replay mechanism contributes to the development of current deep reinforcement learning by reusing past samples to improve the efficiency of samples. However, the existing priority experience replay mechanism changes the sample distribution in the sample set due to the higher sampling frequency assigned to a specific transition, and it cannot be applied to actor-critic and other on-policy reinforcement learning algorithm. To address this, we propose an adaptive factor based on TD-error, which further increases sample utilization by giving more attention weight to samples of larger TD-error, and embeds it flexibly into the original Deep Q Network and Advantage Actor-Critic algorithm to improve their performance. Then we carried out the performance evaluation for the proposed architecture in the context of CartPole-V1 and 6 environments of Atari game experiments, respectively, and the obtained results either on the conditions of fixed temperature or annealing temperature, when compared to those produced by the vanilla DQN and original A2C, highlight the advantages in cumulative rewards and climb speed of the improved algorithms.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Digital twin-enabled adaptive scheduling strategy based on deep reinforcement learning
    GAN XueMei
    ZUO Ying
    ZHANG AnSi
    LI ShaoBo
    TAO Fei
    [J]. Science China(Technological Sciences)., 2023, 66 (07) - 1951
  • [22] A Novel Topology Adaptation Strategy for Dynamic Sparse Training in Deep Reinforcement Learning
    Xu, Meng
    Chen, Xinhong
    Wang, Jianping
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [23] Data-driven Energy-efficient Adaptive Sampling Using Deep Reinforcement Learning
    Demirel, Berken Utku
    Chen, Luke
    Al Faruque, Mohammad Abdullah
    [J]. ACM Transactions on Computing for Healthcare, 2023, 4 (03):
  • [24] RA-TSC: Learning Adaptive Traffic Signal Control Strategy via Deep Reinforcement Learning
    Du, Yu
    Wei ShangGuan
    Rong, Dingchao
    Chai, Linguo
    [J]. 2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3275 - 3280
  • [25] Deep reinforcement learning for adaptive mesh refinement
    Foucart, Corbin
    Charous, Aaron
    Lermusiaux, Pierre F. J.
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2023, 491
  • [26] Adaptive Slope Locomotion with Deep Reinforcement Learning
    Jones, William
    Blum, Tamir
    Yoshida, Kazuya
    [J]. 2020 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2020, : 546 - 550
  • [27] Adaptive beamforming based on the deep reinforcement learning
    Hao, Chuanhui
    Sun, Xubao
    Liu, Yidong
    [J]. ICNSC 2022 - Proceedings of 2022 IEEE International Conference on Networking, Sensing and Control: Autonomous Intelligent Systems, 2022,
  • [28] A Deep Reinforcement Learning Approach to Configuration Sampling Problem
    Abolfazli, Amir
    Spiegetberg, Jakob
    Palmer, Gregory
    Anand, Avishek
    [J]. 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 1 - 10
  • [29] Bayesian Reinforcement Learning via Deep, Sparse Sampling
    Grover, Divya
    Basu, Debabrota
    Dimitrakakis, Christos
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 3036 - 3044
  • [30] Bi-Level Adaptive Storage Expansion Strategy for Microgrids Using Deep Reinforcement Learning
    Huang, Bin
    Zhao, Tianqiao
    Yue, Meng
    Wang, Jianhui
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2024, 15 (02) : 1362 - 1375