Incremental Reinforcement Learning with Dual-Adaptive ∈-Greedy Exploration

被引:0
|
作者
Ding, Wei [1 ]
Jiang, Siyang [1 ]
Chen, Hsi-Wen [1 ]
Chen, Ming-Syan [1 ]
机构
[1] Natl Taiwan Univ, Grad Inst Elect Engn, Taipei, Taiwan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) has achieved impressive performance in various domains. However, most RL frameworks oversimplify the problem by assuming a fixed-yet-known environment and often have difficulty being generalized to real-world scenarios. In this paper, we address a new challenge with a more realistic setting, Incremental Reinforcement Learning, where the search space of the Markov Decision Process continually expands. While previous methods usually suffer from the lack of efficiency in exploring the unseen transitions, especially with increasing search space, we present a new exploration framework named Dual-Adaptive is an element of-greedy Exploration (DAE) to address the challenge of Incremental RL. Specifically, DAE employs a Meta Policy and an Explorer to avoid redundant computation on those sufficiently learned samples. Furthermore, we release a testbed based on a synthetic environment and the Atari benchmark to validate the effectiveness of any exploration algorithms under Incremental RL. Experimental results demonstrate that the proposed framework can efficiently learn the unseen transitions in new environments, leading to notable performance improvement, i.e., an average of more than 80%, over eight baselines examined.
引用
收藏
页码:7387 / 7395
页数:9
相关论文
共 50 条
  • [1] Adaptive ε-Greedy Exploration in Reinforcement Learning Based on Value Differences
    Tokic, Michel
    KI 2010: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2010, 6359 : 203 - 210
  • [2] An Adaptive Implementation of ε-Greedy in Reinforcement Learning
    Mignon, Alexandre dos Santos
    de Azevedo da Rocha, Ricardo Luis
    8TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2017) AND THE 7TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT 2017), 2017, 109 : 1146 - 1151
  • [3] Adaptive Exploration Strategies for Reinforcement Learning
    Hwang, Kao-Shing
    Li, Chih-Wen
    Jiang, Wei-Cheng
    2017 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2017, : 16 - 19
  • [4] Adaptive Exploration for Continual Reinforcement Learning
    Stulp, Freek
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 1631 - 1636
  • [5] A dual-adaptive stochastic reinforcement chimp optimization algorithm for fire detection and multidimensional problem solving
    Zhang, Ziyang
    Tan, Lingye
    Martin, Diego
    Qian, Leren
    Khishe, Mohammad
    Jangir, Pradeep
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [6] Incremental Reinforcement Learning Flight Control with Adaptive Learning Rate
    Liu J.-H.
    Shan J.-Y.
    Rong J.-L.
    Zheng X.
    Yuhang Xuebao/Journal of Astronautics, 2022, 43 (01): : 111 - 121
  • [7] DUAL-ADAPTIVE CONTROL AND ITS PRACTICAL APPLICATIONS
    BADR, O
    REY, D
    LADET, P
    AUTOMATICA, 1979, 15 (01) : 91 - 96
  • [8] Understanding Deep Neural Function Approximation in Reinforcement Learning via ε-Greedy Exploration
    Liu, Fanghui
    Viano, Luca
    Cevher, Volkan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [9] Dual-Adaptive Linear Prediction for Radio Channel with Abrupt Change
    Lv, Changwei
    Hou, Shujuan
    Mei, Wenbo
    2012 IEEE VEHICULAR TECHNOLOGY CONFERENCE (VTC FALL), 2012,
  • [10] A dual-adaptive directed genetic algorithm for construction scheduling
    Shen, Zhaozheng
    Wu, Jie
    Cao, Yilun
    JOURNAL OF BUILDING ENGINEERING, 2024, 96