Transfer reinforcement learning via meta-knowledge extraction using auto-pruned decision trees

被引:8
|
作者
Lan, Yixing [1 ]
Xu, Xin [1 ]
Fang, Qiang [1 ]
Zeng, Yujun [1 ]
Liu, Xinwang [2 ]
Zhang, Xianjian [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha, Peoples R China
[2] Natl Univ Def Technol, Coll Comp, Changsha, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Transfer reinforcement learning; Meta-knowledge; Explainable artificial intelligence; POLICIES;
D O I
10.1016/j.knosys.2022.108221
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transfer reinforcement learning (RL) has recently received increasing attention to make RL agents have better learning performance in target Markov decision problems (MDPs) by using the knowledge learned in source MDPs. However, it is still an open and challenging problem to improve the transfer capability and interpretability of RL algorithms. In this paper, we propose a novel transfer reinforcement learning approach via meta-knowledge extraction using auto-pruned decision trees. In source MDPs, pre-trained policies are firstly learned via RL algorithms using general function approximators. Then, a meta-knowledge extraction algorithm is designed with an auto-pruned decision tree model, where the meta-knowledge is learned by re-training the auto-pruned decision tree based on the data samples generated from the pre-trained policies. The state spaces of meta-knowledge are determined by estimating the uncertainty of state-action pairs in pre-trained policies based on the entropy value of leaf nodes. In target MDPs, according to whether the state is in the state set of meta-knowledge, a hybrid policy is generated by integrating the meta-knowledge and the policies learned on the target MDPs. Based on the proposed transfer RL approach, two meta-knowledge-based transfer reinforcement learning (MKRL) algorithms are developed for MDPs with discrete action spaces and continuous action spaces, respectively. Experimental results in several benchmark tasks show that the MKRL algorithm outperforms other baselines in terms of learning efficiency and interpretability in the target MDPs with generic cases of task similarity. (C)& nbsp;2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Knowledge transfer for adaptive maintenance policy optimization in engineering fleets based on meta-reinforcement learning
    Cheng, Jianda
    Cheng, Minghui
    Liu, Yan
    Wu, Jun
    Li, Wei
    Frangopol, Dan M.
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2024, 247
  • [22] Improved reinforcement learning in cooperative multi-agent environments using knowledge transfer
    Mahnoosh Mahdavimoghadam
    Amin Nikanjam
    Monireh Abdoos
    The Journal of Supercomputing, 2022, 78 : 10455 - 10479
  • [23] Improved reinforcement learning in cooperative multi-agent environments using knowledge transfer
    Mahdavimoghadam, Mahnoosh
    Nikanjam, Amin
    Abdoos, Monireh
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (08): : 10455 - 10479
  • [24] A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent System
    Hu, Chunyang
    SYMMETRY-BASEL, 2020, 12 (04):
  • [25] Autonomous Navigation via Deep Reinforcement Learning for Resource Constraint Edge Nodes Using Transfer Learning
    Anwar, Aqeel
    Raychowdhury, Arijit
    IEEE ACCESS, 2020, 8 : 26549 - 26560
  • [26] Knowledge transfer from simple to complex: A safe and efficient reinforcement learning framework for autonomous driving decision-making
    Zhou, Rongliang
    Huang, Jiakun
    Li, Mingjun
    Li, Hepeng
    Cao, Haotian
    Song, Xiaolin
    ADVANCED ENGINEERING INFORMATICS, 2025, 65
  • [27] Harnessing Online Knowledge Transfer for Enhanced Search and Rescue Decisions via Multi-Agent Reinforcement Learning
    Song, Luona
    Wen, Zhigang
    Teng, Junjie
    Zhang, Jian
    Nicolas, Merveille
    SUSTAINABILITY, 2023, 15 (24)
  • [28] Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function
    Ruman, Marko
    Guy, Tatiana V.
    IEEE ACCESS, 2024, 12 : 177204 - 177218
  • [29] Knowledge Transfer in Multi-Objective Multi-Agent Reinforcement Learning via Generalized Policy Improvement
    de Almeida, Vicente N.
    Alegre, Lucas N.
    Bazzan, Ana L. C.
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2024, 21 (01) : 335 - 362
  • [30] A Dual-channel Semi-supervised Learning Framework on Graphs via Knowledge Transfer and Meta-learning
    Qiao, Ziyue
    Wang, Pengyang
    Wang, Pengfei
    Ning, Zhiyuan
    Fu, Yanjie
    Du, Yi
    Zhou, Yuanchun
    Huang, Jianqiang
    Hua, Xian-Sheng
    Xiong, Hui
    ACM TRANSACTIONS ON THE WEB, 2024, 18 (02)