A Novel Adaptive Sampling Strategy for Deep Reinforcement Learning

被引:1
|
作者
Liang, Xingxing [1 ]
Chen, Li [1 ]
Feng, Yanghe [1 ]
Liu, Zhong [1 ]
Ma, Yang [1 ]
Huang, Kuihua [1 ]
机构
[1] Natl Univ Def Technol, Coll Syst Engn, Changsha, Peoples R China
关键词
Deep reinforcement learning; an adaptive factor; DQN; Actor-Critic (AC) algorithm; GAME; GO;
D O I
10.1142/S1469026821500115
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning, as an effective method to solve complex sequential decision-making problems, plays an important role in areas such as intelligent decision-making and behavioral cognition. It is well known that the sample experience replay mechanism contributes to the development of current deep reinforcement learning by reusing past samples to improve the efficiency of samples. However, the existing priority experience replay mechanism changes the sample distribution in the sample set due to the higher sampling frequency assigned to a specific transition, and it cannot be applied to actor-critic and other on-policy reinforcement learning algorithm. To address this, we propose an adaptive factor based on TD-error, which further increases sample utilization by giving more attention weight to samples of larger TD-error, and embeds it flexibly into the original Deep Q Network and Advantage Actor-Critic algorithm to improve their performance. Then we carried out the performance evaluation for the proposed architecture in the context of CartPole-V1 and 6 environments of Atari game experiments, respectively, and the obtained results either on the conditions of fixed temperature or annealing temperature, when compared to those produced by the vanilla DQN and original A2C, highlight the advantages in cumulative rewards and climb speed of the improved algorithms.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Novel Deep Reinforcement Algorithm With Adaptive Sampling Strategy for Continuous Portfolio Optimization
    Huang, Szu-Hao
    Miao, Yu-Hsiang
    Hsiao, Yi-Ting
    [J]. IEEE ACCESS, 2021, 9 : 77371 - 77385
  • [2] Novel adaptive stability enhancement strategy for power systems based on deep reinforcement learning
    Zhao, Yincheng
    Hu, Weihao
    Zhang, Guozhou
    Huang, Qi
    Chen, Zhe
    Blaabjerg, Frede
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2023, 152
  • [3] Predictive Energy-Aware Adaptive Sampling with Deep Reinforcement Learning
    Heo, Seonyeong
    Mayer, Philipp
    Magno, Michele
    [J]. 2022 29TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (IEEE ICECS 2022), 2022,
  • [4] ASRL: An Adaptive GPS Sampling Method Using Deep Reinforcement Learning
    Qu, Boting
    Zhao, Mengjiao
    Feng, Jun
    Wang, Xin
    [J]. 2022 23RD IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2022), 2022, : 153 - 158
  • [5] Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
    Wang, Guojian
    Wu, Faguo
    Zhang, Xiao
    Guo, Ning
    Zheng, Zhiming
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 285
  • [6] Adaptive Incident Radiance Field Sampling and Reconstruction Using Deep Reinforcement Learning
    Huo, Yuchi
    Wang, Rui
    Zheng, Ruzahng
    Xu, Hualin
    Bao, Hujun
    Yoon, Sung-Eui
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (01):
  • [7] Curiosity-driven recommendation strategy for adaptive learning via deep reinforcement learning
    Han, Ruijian
    Chen, Kani
    Tan, Chunxi
    [J]. BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2020, 73 (03): : 522 - 540
  • [8] Deep Reinforcement Learning for Adaptive Learning Systems
    Li, Xiao
    Xu, Hanchen
    Zhang, Jinming
    Chang, Hua-hua
    [J]. JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2023, 48 (02) : 220 - 243
  • [9] An adaptive testing item selection strategy via a deep reinforcement learning approach
    Wang, Pujue
    Liu, Hongyun
    Xu, Mingqi
    [J]. BEHAVIOR RESEARCH METHODS, 2024,
  • [10] A Novel Guided Deep Reinforcement Learning Tracking Control Strategy for Multirotors
    Hua, Hean
    Wang, Yaonan
    Zhong, Hang
    Zhang, Hui
    Fang, Yongchun
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 1 - 13