Transferring Domain Knowledge with an Adviser in Continuous Tasks

被引:1
|
作者
Wijesinghe, Rukshan [1 ,2 ]
Vithanage, Kasun [2 ]
Tissera, Dumindu [1 ,2 ]
Xavier, Alex [2 ]
Fernando, Subha [2 ]
Samarawickrama, Jayathu [1 ,2 ]
机构
[1] Univ Moratuwa, Dept Elect & Telecommun Engn, Moratuwa, Sri Lanka
[2] Univ Moratuwa, CODEGEN QBITS Lab, Moratuwa, Sri Lanka
关键词
Actor-critic architecture; Deterministic policy gradient; Reinforcement learning; Transferring domain knowledge;
D O I
10.1007/978-3-030-75768-7_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in Reinforcement Learning (RL) have surpassed human-level performance in many simulated environments. However, existing reinforcement learning techniques are incapable of explicitly incorporating already known domain-specific knowledge into the learning process. Therefore, the agents have to explore and learn the domain knowledge independently through a trial and error approach, which consumes both time and resources to make valid responses. Hence, we adapt the Deep Deterministic Policy Gradient (DDPG) algorithm to incorporate an adviser, which allows integrating domain knowledge in the form of pre-learned policies or pre-defined relationships to enhance the agent's learning process. Our experiments on OpenAi Gym benchmark tasks show that integrating domain knowledge through advisers expedites the learning and improves the policy towards better optima.
引用
收藏
页码:194 / 205
页数:12
相关论文
共 50 条
  • [1] TASKS FOR A BRITISH SCIENCE ADVISER
    不详
    [J]. NATURE, 1995, 376 (6542) : 623 - 624
  • [2] Knowledge Mining and Transferring for Domain Adaptive Object Detection
    Tian, Kun
    Zhang, Chenghao
    Wang, Ying
    Xiang, Shiming
    Pan, Chunhong
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9113 - 9122
  • [3] Transferring Knowledge from Another Domain for Learning Action Models
    Zhuo, Hankui
    Yang, Qiang
    Hu, Derek Hao
    Li, Lei
    [J]. PRICAI 2008: TRENDS IN ARTIFICIAL INTELLIGENCE, 2008, 5351 : 1110 - +
  • [4] Techniques for transferring host-pathogen protein interactions knowledge to new tasks
    Kshirsagar, Meghana
    Schleker, Sylvia
    Carbonell, Jaime
    Klein-Seetharaman, Judith
    [J]. FRONTIERS IN MICROBIOLOGY, 2015, 6
  • [5] Transferring knowledge
    Graham, PJ
    [J]. NOUS, 2000, 34 (01): : 131 - 152
  • [6] PREVENTIVE TASKS OF DIETARY ASSISTANT AND NUTRITIONAL ADVISER
    KRANHOLDT, U
    RUEF, V
    [J]. SOZIAL-UND PRAVENTIVMEDIZIN, 1978, 23 (03): : 208 - 210
  • [7] Transferring Cross-domain Knowledge for Video Sign Language Recognition
    Li, Dongxu
    Yu, Xin
    Xu, Chenchen
    Petersson, Lars
    Li, Hongdong
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6204 - 6213
  • [8] Transferring Structured Knowledge in Unsupervised Domain Adaptation of a Sleep Staging Network
    Yoo, Chaehwa
    Lee, Hyang Woon
    Kang, Je-Won
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (03) : 1273 - 1284
  • [9] Transferring Knowledge Fragments for Learning Distance Metric from a Heterogeneous Domain
    Luo, Yong
    Wen, Yonggang
    Liu, Tongliang
    Tao, Dacheng
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (04) : 1013 - 1026
  • [10] Domain adaptive object detection with model-agnostic knowledge transferring
    Tian, Kun
    Zhang, Chenghao
    Wang, Ying
    Xiang, Shiming
    [J]. NEURAL NETWORKS, 2023, 161 : 213 - 227