Transferring Domain Knowledge with an Adviser in Continuous Tasks

被引：1

作者：

Wijesinghe, Rukshan ^{[1
,2
]}

Vithanage, Kasun ^{[2
]}

Tissera, Dumindu ^{[1
,2
]}

Xavier, Alex ^{[2
]}

Fernando, Subha ^{[2
]}

Samarawickrama, Jayathu ^{[1
,2
]}

机构：

[1] Univ Moratuwa, Dept Elect & Telecommun Engn, Moratuwa, Sri Lanka

[2] Univ Moratuwa, CODEGEN QBITS Lab, Moratuwa, Sri Lanka

来源：

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT III | 2021年 / 12714卷

关键词：

Actor-critic architecture; Deterministic policy gradient; Reinforcement learning; Transferring domain knowledge;

D O I：

10.1007/978-3-030-75768-7_16

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent advances in Reinforcement Learning (RL) have surpassed human-level performance in many simulated environments. However, existing reinforcement learning techniques are incapable of explicitly incorporating already known domain-specific knowledge into the learning process. Therefore, the agents have to explore and learn the domain knowledge independently through a trial and error approach, which consumes both time and resources to make valid responses. Hence, we adapt the Deep Deterministic Policy Gradient (DDPG) algorithm to incorporate an adviser, which allows integrating domain knowledge in the form of pre-learned policies or pre-defined relationships to enhance the agent's learning process. Our experiments on OpenAi Gym benchmark tasks show that integrating domain knowledge through advisers expedites the learning and improves the policy towards better optima.

引用

页码：194 / 205

页数：12

共 50 条

[1] TASKS FOR A BRITISH SCIENCE ADVISER
不详
[J]. NATURE, 1995, 376 (6542) : 623 - 624
[2] Knowledge Mining and Transferring for Domain Adaptive Object Detection
Tian, Kun
Zhang, Chenghao
Wang, Ying
Xiang, Shiming
Pan, Chunhong
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9113 - 9122
[3] Transferring Knowledge from Another Domain for Learning Action Models
Zhuo, Hankui
Yang, Qiang
Hu, Derek Hao
Li, Lei
[J]. PRICAI 2008: TRENDS IN ARTIFICIAL INTELLIGENCE, 2008, 5351 : 1110 - +
[4] Techniques for transferring host-pathogen protein interactions knowledge to new tasks
Kshirsagar, Meghana
Schleker, Sylvia
Carbonell, Jaime
Klein-Seetharaman, Judith
[J]. FRONTIERS IN MICROBIOLOGY, 2015, 6
[5] Transferring knowledge
Graham, PJ
[J]. NOUS, 2000, 34 (01): : 131 - 152
[6] PREVENTIVE TASKS OF DIETARY ASSISTANT AND NUTRITIONAL ADVISER
KRANHOLDT, U
RUEF, V
[J]. SOZIAL-UND PRAVENTIVMEDIZIN, 1978, 23 (03): : 208 - 210
[7] Transferring Cross-domain Knowledge for Video Sign Language Recognition
Li, Dongxu
Yu, Xin
Xu, Chenchen
Petersson, Lars
Li, Hongdong
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6204 - 6213
[8] Transferring Structured Knowledge in Unsupervised Domain Adaptation of a Sleep Staging Network
Yoo, Chaehwa
Lee, Hyang Woon
Kang, Je-Won
[J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (03) : 1273 - 1284
[9] Transferring Knowledge Fragments for Learning Distance Metric from a Heterogeneous Domain
Luo, Yong
Wen, Yonggang
Liu, Tongliang
Tao, Dacheng
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (04) : 1013 - 1026
[10] Domain adaptive object detection with model-agnostic knowledge transferring
Tian, Kun
Zhang, Chenghao
Wang, Ying
Xiang, Shiming
[J]. NEURAL NETWORKS, 2023, 161 : 213 - 227

← 1 2 3 4 5 →