REPAINT: Knowledge Transfer in Deep Reinforcement Learning

被引:0
|
作者
Tao, Yunzhe [1 ]
Genc, Sahika [1 ]
Chung, Jonathan [1 ]
Sun, Tao [1 ]
Mallya, Sunil [1 ]
机构
[1] Amazon Web Serv, AI Labs, Seattle, WA 98121 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accelerating learning processes for complex tasks by leveraging previously learned tasks has been one of the most challenging problems in reinforcement learning, especially when the similarity between source and target tasks is low. This work proposes REPresentation And INstance Transfer (REPAINT) algorithm for knowledge transfer in deep reinforcement learning. REPAINT not only transfers the representation of a pre-trained teacher policy in the on-policy learning, but also uses an advantage-based experience selection approach to transfer useful samples collected following the teacher policy in the off-policy learning. Our experimental results on several benchmark tasks show that REPAINT significantly reduces the total training time in generic cases of task similarity. In particular, when the source tasks are dissimilar to, or sub-tasks of, the target tasks, REPAINT outperforms other baselines in both training-time reduction and asymptotic performance of return scores.
引用
收藏
页码:7145 / 7155
页数:11
相关论文
共 50 条
  • [41] Leveraging Domain Knowledge for Robust Deep Reinforcement Learning in Networking
    Zheng, Ying
    Chen, Haoyu
    Duan, Qingyang
    Lin, Lixiang
    Shao, Yiyang
    Wang, Wei
    Wang, Xin
    Xu, Yuedong
    [J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,
  • [42] Deep Reinforcement Learning with IoT System Characterization and Knowledge Adaptation
    Zou, Jiadao
    Zhang, Qingxue
    [J]. 2022 IEEE 13TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2022, : 24 - 27
  • [43] Sentiment and Knowledge Based Algorithmic Trading with Deep Reinforcement Learning
    Nan, Abhishek
    Perumal, Anandh
    Zaiane, Osmar R.
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022, PT I, 2022, 13426 : 167 - 180
  • [44] Knowledge-guided Deep Reinforcement Learning for Interactive Recommendation
    Chen, Xiaocong
    Huang, Chaoran
    Yao, Lina
    Wang, Xianzhi
    Liu, Wei
    Zhang, Wenjie
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [45] Routing Optimization With Deep Reinforcement Learning in Knowledge Defined Networking
    He, Qiang
    Wang, Yu
    Wang, Xingwei
    Xu, Weiqiang
    Li, Fuliang
    Yang, Kaiqi
    Ma, Lianbo
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (02) : 1444 - 1455
  • [46] Deep Reinforcement Learning Task Assignment Based on Domain Knowledge
    Liu, Jiayi
    Wang, Gang
    Guo, Xiangke
    Wang, Siyuan
    Fu, Qiang
    [J]. IEEE ACCESS, 2022, 10 : 114402 - 114413
  • [47] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Morales, Eduardo F.
    Murrieta-Cid, Rafael
    Becerra, Israel
    Esquivel-Basaldua, Marco A.
    [J]. INTELLIGENT SERVICE ROBOTICS, 2021, 14 (05) : 773 - 805
  • [48] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Eduardo F. Morales
    Rafael Murrieta-Cid
    Israel Becerra
    Marco A. Esquivel-Basaldua
    [J]. Intelligent Service Robotics, 2021, 14 : 773 - 805
  • [49] An Optimal Transfer of Knowledge in Reinforcement Learning through Greedy Approach
    Kumari, Deepika
    Chaudhary, Mahima
    Mishra, Ashish Kumar
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [50] Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer
    Zhou, Luowei
    Yang, Pei
    Chen, Chunlin
    Gao, Yang
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (05) : 1238 - 1250