Adaptive deep reinforcement learning for non-stationary environments

被引:0
|
作者
Jin ZHU [1 ]
Yutong WEI [1 ]
Yu KANG [1 ]
Xiaofeng JIANG [1 ]
Geir E.DULLERUD [2 ]
机构
[1] Department of Automation, University of Science and Technology of China
[2] Department of Mechanical Science and Engineering, University of Illinois at Urbana-Champaign
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep reinforcement learning(DRL) is currently used to solve Markov decision process problems for which the environment is typically assumed to be stationary. In this paper, we propose an adaptive DRL method for non-stationary environments. First, we introduce model uncertainty and propose the self-adjusting deep Q-learning algorithm, which can achieve the rebalance of exploration and exploitation automatically as the environment changes. Second, we propose a feasible criterion to judge the appropriateness of parameter setting of deep Q-networks and minimize the misjudgment probability based on the large deviation principle(LDP). The effectiveness of the proposed adaptive DRL method is illustrated in terms of an advanced persistent threat(APT) attack simulation game. Experimental results show that compared with the classic deep Q-learning algorithms in non-stationary and stationary environments, the adaptive DRL method improves performance by at least 14.28% and 30.56%, respectively.
引用
收藏
页码:225 / 241
页数:17
相关论文
共 50 条
  • [1] Adaptive deep reinforcement learning for non-stationary environments
    Zhu, Jin
    Wei, Yutong
    Kang, Yu
    Jiang, Xiaofeng
    Dullerud, Geir E.
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (10)
  • [2] Adaptive deep reinforcement learning for non-stationary environments
    Jin Zhu
    Yutong Wei
    Yu Kang
    Xiaofeng Jiang
    Geir E. Dullerud
    [J]. Science China Information Sciences, 2022, 65
  • [3] Towards Reinforcement Learning for Non-stationary Environments
    Dal Toe, Sebastian Gregory
    Tiddeman, Bernard
    Mac Parthalain, Neil
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, UKCI 2023, 2024, 1453 : 41 - 52
  • [4] Reinforcement learning algorithm for non-stationary environments
    Padakandla, Sindhu
    Prabuchandran, K. J.
    Bhatnagar, Shalabh
    [J]. APPLIED INTELLIGENCE, 2020, 50 (11) : 3590 - 3606
  • [5] Reinforcement learning algorithm for non-stationary environments
    Sindhu Padakandla
    Prabuchandran K. J.
    Shalabh Bhatnagar
    [J]. Applied Intelligence, 2020, 50 : 3590 - 3606
  • [6] Reinforcement learning in episodic non-stationary Markovian environments
    Choi, SPM
    Zhang, NL
    Yeung, DY
    [J]. IC-AI '04 & MLMTA'04 , VOL 1 AND 2, PROCEEDINGS, 2004, : 752 - 758
  • [7] Adaptive and on-line learning in non-stationary environments
    Lughofer, Edwin
    Sayed-Mouchaweh, Moamar
    [J]. EVOLVING SYSTEMS, 2015, 6 (02) : 75 - 77
  • [8] Meta-Reinforcement Learning in Non-Stationary and Dynamic Environments
    Bing, Zhenshan
    Lerch, David
    Huang, Kai
    Knoll, Alois
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3476 - 3491
  • [9] An adaptable fuzzy reinforcement learning method for non-stationary environments
    Haighton, Rachel
    Asgharnia, Amirhossein
    Schwartz, Howard
    Givigi, Sidney
    [J]. NEUROCOMPUTING, 2024, 604
  • [10] Adaptive Learning With Extreme Verification Latency in Non-Stationary Environments
    Idrees, Mobin M. M.
    Stahl, Frederic
    Badii, Atta
    [J]. IEEE ACCESS, 2022, 10 : 127345 - 127364