Transfer reinforcement learning via meta-knowledge extraction using auto-pruned decision trees

被引:8
|
作者
Lan, Yixing [1 ]
Xu, Xin [1 ]
Fang, Qiang [1 ]
Zeng, Yujun [1 ]
Liu, Xinwang [2 ]
Zhang, Xianjian [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha, Peoples R China
[2] Natl Univ Def Technol, Coll Comp, Changsha, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Transfer reinforcement learning; Meta-knowledge; Explainable artificial intelligence; POLICIES;
D O I
10.1016/j.knosys.2022.108221
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transfer reinforcement learning (RL) has recently received increasing attention to make RL agents have better learning performance in target Markov decision problems (MDPs) by using the knowledge learned in source MDPs. However, it is still an open and challenging problem to improve the transfer capability and interpretability of RL algorithms. In this paper, we propose a novel transfer reinforcement learning approach via meta-knowledge extraction using auto-pruned decision trees. In source MDPs, pre-trained policies are firstly learned via RL algorithms using general function approximators. Then, a meta-knowledge extraction algorithm is designed with an auto-pruned decision tree model, where the meta-knowledge is learned by re-training the auto-pruned decision tree based on the data samples generated from the pre-trained policies. The state spaces of meta-knowledge are determined by estimating the uncertainty of state-action pairs in pre-trained policies based on the entropy value of leaf nodes. In target MDPs, according to whether the state is in the state set of meta-knowledge, a hybrid policy is generated by integrating the meta-knowledge and the policies learned on the target MDPs. Based on the proposed transfer RL approach, two meta-knowledge-based transfer reinforcement learning (MKRL) algorithms are developed for MDPs with discrete action spaces and continuous action spaces, respectively. Experimental results in several benchmark tasks show that the MKRL algorithm outperforms other baselines in terms of learning efficiency and interpretability in the target MDPs with generic cases of task similarity. (C)& nbsp;2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Adaptive Jamming Decision-Making Against FHSS Communications via Inexpert Demonstrations Assisted Meta Reinforcement Learning
    Rao, Ning
    Xu, Hua
    Qi, Zisen
    Wang, Dan
    Peng, Xiang
    Jiang, Lei
    IEEE COMMUNICATIONS LETTERS, 2025, 29 (01) : 105 - 109
  • [32] A Dynamic and Context-aware Model of Knowledge Transfer and Learning using a Decision Making Perspective
    Giacchi, Evelina
    La Corte, Aurelio
    Di Pietro, Eleonora
    PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON COMPLEX INFORMATION SYSTEMS (COMPLEXIS), 2016, : 66 - 73
  • [33] Semantic decision Trees: A new learning system for the ID3-Based algorithm using a knowledge base
    Chanmee, Sirichanya
    Kesorn, Kraisak
    ADVANCED ENGINEERING INFORMATICS, 2023, 58
  • [34] Supervised Learning Approach for Knowledge Extraction & Decision-Making Process using Genome Sequence in Bioinformatics
    Almutairi, May Abdullah
    Baig, Abdul Rauf
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2020, 20 (12): : 149 - 158
  • [35] Driving Tasks Transfer Using Deep Reinforcement Learning for Decision-Making of Autonomous Vehicles in Unsignalized Intersection
    Shu, Hong
    Liu, Teng
    Mu, Xingyu
    Cao, Dongpu
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (01) : 41 - 52
  • [36] Adaptive target localization under uncertainty using Multi-Agent Deep Reinforcement Learning with knowledge transfer
    Alagha, Ahmed
    Mizouni, Rabeb
    Singh, Shakti
    Bentahar, Jamal
    Otrok, Hadi
    INTERNET OF THINGS, 2025, 29
  • [37] Probabilistic Wind Power Forecasting Approach via Instance-Based Transfer Learning Embedded Gradient Boosting Decision Trees
    Cai, Long
    Gu, Jie
    Ma, Jinghuan
    Jin, Zhijian
    ENERGIES, 2019, 12 (01)
  • [38] Offshore Petroleum Leaking Source Detection Method From Remote Sensing Data via Deep Reinforcement Learning With Knowledge Transfer
    Wang, Yuewei
    Wang, Lizhe
    Chen, Xiaodao
    Liang, Dong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 5826 - 5840
  • [39] Human-Inspired Meta-Reinforcement Learning Using Bayesian Knowledge and Enhanced Deep Q-Network
    Ho, Joshua
    Wang, Chien-Min
    King, Chung-Ta
    You, Yi-Hsin
    Feng, Chi-Wei
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2024, 18 (04) : 547 - 569
  • [40] A multi-robot path-planning algorithm for autonomous navigation using meta-reinforcement learning based on transfer learning
    Wen, Shuhuan
    Wen, Zeteng
    Zhang, Di
    Zhang, Hong
    Wang, Tao
    APPLIED SOFT COMPUTING, 2021, 110 (110)