Adaptive deep reinforcement learning for non-stationary environments

被引：0

作者：

Jin ZHU ^{[1
]}

Yutong WEI ^{[1
]}

Yu KANG ^{[1
]}

Xiaofeng JIANG ^{[1
]}

Geir E.DULLERUD ^{[2
]}

机构：

[1] Department of Automation, University of Science and Technology of China

[2] Department of Mechanical Science and Engineering, University of Illinois at Urbana-Champaign

来源：

Science China(Information Sciences) | 2022年 / 10期

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep reinforcement learning(DRL) is currently used to solve Markov decision process problems for which the environment is typically assumed to be stationary. In this paper, we propose an adaptive DRL method for non-stationary environments. First, we introduce model uncertainty and propose the self-adjusting deep Q-learning algorithm, which can achieve the rebalance of exploration and exploitation automatically as the environment changes. Second, we propose a feasible criterion to judge the appropriateness of parameter setting of deep Q-networks and minimize the misjudgment probability based on the large deviation principle(LDP). The effectiveness of the proposed adaptive DRL method is illustrated in terms of an advanced persistent threat(APT) attack simulation game. Experimental results show that compared with the classic deep Q-learning algorithms in non-stationary and stationary environments, the adaptive DRL method improves performance by at least 14.28% and 30.56%, respectively.

引用

页码：225 / 241

页数：17

共 50 条

[1] Adaptive deep reinforcement learning for non-stationary environments
Zhu, Jin
Wei, Yutong
Kang, Yu
Jiang, Xiaofeng
Dullerud, Geir E.
[J]. SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (10)
[2] Adaptive deep reinforcement learning for non-stationary environments
Jin Zhu
Yutong Wei
Yu Kang
Xiaofeng Jiang
Geir E. Dullerud
[J]. Science China Information Sciences, 2022, 65
[3] Towards Reinforcement Learning for Non-stationary Environments
Dal Toe, Sebastian Gregory
Tiddeman, Bernard
Mac Parthalain, Neil
[J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, UKCI 2023, 2024, 1453 : 41 - 52
[4] Reinforcement learning algorithm for non-stationary environments
Padakandla, Sindhu
Prabuchandran, K. J.
Bhatnagar, Shalabh
[J]. APPLIED INTELLIGENCE, 2020, 50 (11) : 3590 - 3606
[5] Reinforcement learning algorithm for non-stationary environments
Sindhu Padakandla
Prabuchandran K. J.
Shalabh Bhatnagar
[J]. Applied Intelligence, 2020, 50 : 3590 - 3606
[6] Reinforcement learning in episodic non-stationary Markovian environments
Choi, SPM
Zhang, NL
Yeung, DY
[J]. IC-AI '04 & MLMTA'04 , VOL 1 AND 2, PROCEEDINGS, 2004, : 752 - 758
[7] Adaptive and on-line learning in non-stationary environments
Lughofer, Edwin
Sayed-Mouchaweh, Moamar
[J]. EVOLVING SYSTEMS, 2015, 6 (02) : 75 - 77
[8] Meta-Reinforcement Learning in Non-Stationary and Dynamic Environments
Bing, Zhenshan
Lerch, David
Huang, Kai
Knoll, Alois
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3476 - 3491
[9] An adaptable fuzzy reinforcement learning method for non-stationary environments
Haighton, Rachel
Asgharnia, Amirhossein
Schwartz, Howard
Givigi, Sidney
[J]. NEUROCOMPUTING, 2024, 604
[10] Adaptive Learning With Extreme Verification Latency in Non-Stationary Environments
Idrees, Mobin M. M.
Stahl, Frederic
Badii, Atta
[J]. IEEE ACCESS, 2022, 10 : 127345 - 127364

← 1 2 3 4 5 →