Knowledge-based Exploration for Reinforcement Learning in Self-Organizing Neural Networks

被引：10

作者：

Teng, Teck-Hou ^{[1
]}

Tan, Ah-Hwee ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore

来源：

2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 2 | 2012年

关键词：

Reinforcement Learning; Self-Organizing Neural Network; Directed Exploration; Rule-Based System; ARCHITECTURE; PURSUIT; EVASION;

D O I：

10.1109/WI-IAT.2012.154

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Exploration is necessary during reinforcement learning to discover new solutions in a given problem space. Most reinforcement learning systems, however, adopt a simple strategy, by randomly selecting an action among all the available actions. This paper proposes a novel exploration strategy, known as Knowledge-based Exploration, for guiding the exploration of a family of self-organizing neural networks in reinforcement learning. Specifically, exploration is directed towards unexplored and favorable action choices while steering away from those negative action choices that are likely to fail. This is achieved by using the learned knowledge of the agent to identify prior action choices leading to low Q-values in similar situations. Consequently, the agent is expected to learn the right solutions in a shorter time, improving overall learning efficiency. Using a Pursuit-Evasion problem domain, we evaluate the efficacy of the knowledge-based exploration strategy, in terms of task performance, rate of learning and model complexity. Comparison with random exploration and three other heuristic-based directed exploration strategies show that Knowledge-based Exploration is significantly more effective and robust for reinforcement learning in real time.

引用

页码：332 / 339

页数：8

共 50 条

[21] A self-organizing fuzzy neural networks
Lin, Haisheng
Gao, X. Z.
Huang, Xianlin
Song, Zhuoyue
SOFT COMPUTING IN INDUSTRIAL APPLICATIONS: RECENT AND EMERGING METHODS AND TECHNIQUES, 2007, 39 : 200 - +
[22] Self-organizing neural systems based on predictive learning
Rao, RPN
Sejnowski, TJ
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2003, 361 (1807): : 1149 - 1175
[23] Self-organizing neural tree networks
Milone, DH
Sáez, JC
Simón, G
Rufiner, HL
PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 20, PTS 1-6: BIOMEDICAL ENGINEERING TOWARDS THE YEAR 2000 AND BEYOND, 1998, 20 : 1348 - 1351
[24] Self-Organizing Fusion Neural Networks
Wang, Jung-Hua
Tseng, Chun-Shun
Shen, Sih-Yin
Jheng, Ya-Yun
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2007, 11 (06) : 610 - 619
[25] Self-organizing neural networks in chemistry
Gasteiger, Johann
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 254
[26] Knowledge-Based Grasp Planning Using Dynamic Self-Organizing Network
Yang, Shiyi
Jeon, Soo
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 9369 - 9376
[27] Integrating temporal difference methods and self-organizing neural networks for reinforcement learning with delayed evaluative feedback
Tan, Ah-Hwee
Lu, Ning
Xiao, Dan
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (02): : 230 - 244
[28] Chimaera networks: Temporal self-organizing artificial neural networks for sequence learning
Jansen, Peter
CANADIAN JOURNAL OF EXPERIMENTAL PSYCHOLOGY-REVUE CANADIENNE DE PSYCHOLOGIE EXPERIMENTALE, 2008, 62 (04): : 276 - 276
[29] Self-organizing visual servo system based on neural networks
Hashimoto, Hideki
Kubota, Takashi
Kudou, Masaaki
Harashima, Fumio
IEEE Control Systems Magazine, 1992, 12 (02): : 31 - 36
[30] A prediction algorithm based on self-organizing fuzzy neural networks
Liu, M
Gu, YD
Chai, YC
2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 1688 - 1690

← 1 2 3 4 5 →