Transferring Domain Knowledge with an Adviser in Continuous Tasks

被引:1
|
作者
Wijesinghe, Rukshan [1 ,2 ]
Vithanage, Kasun [2 ]
Tissera, Dumindu [1 ,2 ]
Xavier, Alex [2 ]
Fernando, Subha [2 ]
Samarawickrama, Jayathu [1 ,2 ]
机构
[1] Univ Moratuwa, Dept Elect & Telecommun Engn, Moratuwa, Sri Lanka
[2] Univ Moratuwa, CODEGEN QBITS Lab, Moratuwa, Sri Lanka
关键词
Actor-critic architecture; Deterministic policy gradient; Reinforcement learning; Transferring domain knowledge;
D O I
10.1007/978-3-030-75768-7_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in Reinforcement Learning (RL) have surpassed human-level performance in many simulated environments. However, existing reinforcement learning techniques are incapable of explicitly incorporating already known domain-specific knowledge into the learning process. Therefore, the agents have to explore and learn the domain knowledge independently through a trial and error approach, which consumes both time and resources to make valid responses. Hence, we adapt the Deep Deterministic Policy Gradient (DDPG) algorithm to incorporate an adviser, which allows integrating domain knowledge in the form of pre-learned policies or pre-defined relationships to enhance the agent's learning process. Our experiments on OpenAi Gym benchmark tasks show that integrating domain knowledge through advisers expedites the learning and improves the policy towards better optima.
引用
收藏
页码:194 / 205
页数:12
相关论文
共 50 条
  • [31] Transferring Tacit Knowledge in Process Control
    Osvalder, Anna-Lisa
    Colmsjo, Anders
    [J]. ADVANCES IN HUMAN FACTORS, BUSINESS MANAGEMENT, TRAINING AND EDUCATION, 2017, 498 : 485 - 491
  • [32] Transferring Design Knowledge Challenges and Opportunities
    Hu, Jun
    Chen, Wei
    Bartneck, Christoph
    Rauterberg, Matthias
    [J]. ENTERTAINMENT FOR EDUCATION: DIGITAL TECHNIQUES AND SYSTEMS, 2010, 6249 : 165 - 172
  • [33] Domain Knowledge Transferring for Pre-trained Language Model via Calibrated Activation Boundary Distillation
    Choi, Dongha
    Choi, HongSeok
    Lee, Hyunju
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1658 - 1669
  • [34] Transferring knowledge between learning systems
    Paplinski, Andrew P.
    Mount, William M.
    [J]. 2014 14TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA 2014), 2014,
  • [35] SVBRDF Reconstruction by Transferring Lighting Knowledge
    Zhu, Pengfei
    Lai, Shuichang
    Chen, Mufan
    Guo, Jie
    Liu, Yifan
    Guo, Yanwen
    [J]. COMPUTER GRAPHICS FORUM, 2023, 42 (07)
  • [36] TRANSFERRING EXPERT KNOWLEDGE TO THE MACHINE OPERATOR
    FORRER, MG
    [J]. KUNSTSTOFFE-GERMAN PLASTICS, 1990, 80 (05): : CA111 - +
  • [37] TRANSFERRING EXPERT KNOWLEDGE TO THE MACHINE OPERATOR
    FORRER, MG
    [J]. F&M-FEINWERKTECHNIK & MESSTECHNIK, 1990, 98 (05): : CA111 - +
  • [38] TRANSFERRING PERSONALIZED KNOWLEDGE - GURUS DILEMMA
    MARJIT, S
    [J]. ECONOMIC AND POLITICAL WEEKLY, 1995, 30 (27) : 1663 - 1665
  • [39] Transferring Knowledge from a RNN to a DNN
    Chan, William
    Ke, Nan Rosemary
    Lane, Ian
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3264 - 3268
  • [40] Restructuring information for managing and transferring knowledge
    Le Vie, DS
    [J]. IEEE INTERNATIONAL PROFESSIONAL COMMUNICATION CONFERENCE - PROCEEDINGS, VOL 2: TECHNICAL PAPERS, 1998, : 321 - 328