A Novel Adaptive Sampling Strategy for Deep Reinforcement Learning

被引：1

作者：

Liang, Xingxing ^{[1
]}

Chen, Li ^{[1
]}

Feng, Yanghe ^{[1
]}

Liu, Zhong ^{[1
]}

Ma, Yang ^{[1
]}

Huang, Kuihua ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Syst Engn, Changsha, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS | 2021年 / 20卷 / 02期

关键词：

Deep reinforcement learning; an adaptive factor; DQN; Actor-Critic (AC) algorithm; GAME; GO;

D O I：

10.1142/S1469026821500115

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning, as an effective method to solve complex sequential decision-making problems, plays an important role in areas such as intelligent decision-making and behavioral cognition. It is well known that the sample experience replay mechanism contributes to the development of current deep reinforcement learning by reusing past samples to improve the efficiency of samples. However, the existing priority experience replay mechanism changes the sample distribution in the sample set due to the higher sampling frequency assigned to a specific transition, and it cannot be applied to actor-critic and other on-policy reinforcement learning algorithm. To address this, we propose an adaptive factor based on TD-error, which further increases sample utilization by giving more attention weight to samples of larger TD-error, and embeds it flexibly into the original Deep Q Network and Advantage Actor-Critic algorithm to improve their performance. Then we carried out the performance evaluation for the proposed architecture in the context of CartPole-V1 and 6 environments of Atari game experiments, respectively, and the obtained results either on the conditions of fixed temperature or annealing temperature, when compared to those produced by the vanilla DQN and original A2C, highlight the advantages in cumulative rewards and climb speed of the improved algorithms.

引用

页数：20

共 50 条

[21] Digital twin-enabled adaptive scheduling strategy based on deep reinforcement learning
GAN XueMei
ZUO Ying
ZHANG AnSi
LI ShaoBo
TAO Fei
[J]. Science China(Technological Sciences)., 2023, 66 (07) - 1951
[22] A Novel Topology Adaptation Strategy for Dynamic Sparse Training in Deep Reinforcement Learning
Xu, Meng
Chen, Xinhong
Wang, Jianping
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[23] Data-driven Energy-efficient Adaptive Sampling Using Deep Reinforcement Learning
Demirel, Berken Utku
Chen, Luke
Al Faruque, Mohammad Abdullah
[J]. ACM Transactions on Computing for Healthcare, 2023, 4 (03):
[24] RA-TSC: Learning Adaptive Traffic Signal Control Strategy via Deep Reinforcement Learning
Du, Yu
Wei ShangGuan
Rong, Dingchao
Chai, Linguo
[J]. 2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3275 - 3280
[25] Deep reinforcement learning for adaptive mesh refinement
Foucart, Corbin
Charous, Aaron
Lermusiaux, Pierre F. J.
[J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2023, 491
[26] Adaptive Slope Locomotion with Deep Reinforcement Learning
Jones, William
Blum, Tamir
Yoshida, Kazuya
[J]. 2020 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2020, : 546 - 550
[27] Adaptive beamforming based on the deep reinforcement learning
Hao, Chuanhui
Sun, Xubao
Liu, Yidong
[J]. ICNSC 2022 - Proceedings of 2022 IEEE International Conference on Networking, Sensing and Control: Autonomous Intelligent Systems, 2022,
[28] A Deep Reinforcement Learning Approach to Configuration Sampling Problem
Abolfazli, Amir
Spiegetberg, Jakob
Palmer, Gregory
Anand, Avishek
[J]. 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 1 - 10
[29] Bayesian Reinforcement Learning via Deep, Sparse Sampling
Grover, Divya
Basu, Debabrota
Dimitrakakis, Christos
[J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 3036 - 3044
[30] Bi-Level Adaptive Storage Expansion Strategy for Microgrids Using Deep Reinforcement Learning
Huang, Bin
Zhao, Tianqiao
Yue, Meng
Wang, Jianhui
[J]. IEEE TRANSACTIONS ON SMART GRID, 2024, 15 (02) : 1362 - 1375

← 1 2 3 4 5 →