A teaching strategy for memory-based control

被引：12

作者：

Sheppard, JW

Salzberg, SL

机构：

[1] The Johns Hopkins University,Department of Computer Science

来源：

ARTIFICIAL INTELLIGENCE REVIEW | 1997年 / 11卷 / 1-5期

关键词：

lazy learning; nearest neighbor; genetic algorithms; differential games; pursuit games; teaching; reinforcement learning;

D O I：

10.1023/A:1006597715165

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Combining different machine learning algorithms in the same system can produce benefits above and beyond what either method could achieve alone. This paper demonstrates that genetic algorithms can be used in conjunction with lazy learning to solve examples of a difficult class of delayed reinforcement learning problems better than either method alone. This class, the class of differential games, includes numerous important control problems that arise in robotics, planning, game playing, and other areas, and solutions for differential games suggest solution strategies for the general class of planning and control problems. We conducted a series of experiments applying three learning approaches - lazy Q-learning, k-nearest neighbor (k-NN), and a genetic algorithm - to a particular differential game called a pursuit game. Our experiments demonstrate that Ic-NN had great difficulty solving the problem, while a lazy version of Q-learning performed moderately well and the genetic algorithm performed even better. These results motivated the next step in the experiments, where we hypothesized Ic-NN was having difficulty because it did not have good examples - a common source of difficulty for lazy learning. Therefore, we used the genetic algorithm as a bootstrapping method for Ic-NN to create a system to provide these examples. Our experiments demonstrate that the resulting joint system learned to solve the pursuit games with a high degree of accuracy outperforming either method alone - and with relatively small memory requirements.

引用

页码：343 / 370

页数：28

共 50 条

[1] A Teaching Strategy for Memory-Based Control
John W. Sheppard
Steven L. Salzberg
[J]. Artificial Intelligence Review, 1997, 11 : 343 - 370
[2] Quantized feedback fuzzy sliding mode control design via memory-based strategy
Ran, Suzhen
Xue, Yanmei
Zheng, Bo-Chao
Wang, Zhenyou
[J]. APPLIED MATHEMATICS AND COMPUTATION, 2017, 298 : 283 - 295
[3] Hybrid memory-based control of robotic manipulators
Lee, CY
Lee, JJ
[J]. IEEE REGION 10 INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONIC TECHNOLOGY, VOLS 1 AND 2, 2001, : 433 - 438
[4] Memory-Based Logic Control for Embedded Systems
Dvorak, Vaclav
Mikusek, Petr
[J]. INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, 2015, 325 : 367 - 379
[5] Control of Memory Retrieval Alters Memory-Based Eye Movements
Kulkarni, Mrinmayi
Nickel, Allison E.
Minor, Greta N.
Hannula, Deborah E.
[J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 2023,
[6] Improving the effectiveness of an interruption lag by inducing a memory-based strategy
Morgan, Phillip L.
Patrick, John
Tiley, Leyanne
[J]. ACTA PSYCHOLOGICA, 2013, 142 (01) : 87 - 95
[7] Role of strategy update rules in the spatial memory-based mixed strategy games
Fan Zhang
Juan Wang
Hongyu Gao
Xiaopeng Li
Chengyi Xia
[J]. The European Physical Journal B, 2021, 94
[8] Role of strategy update rules in the spatial memory-based mixed strategy games
Zhang, Fan
Wang, Juan
Gao, Hongyu
Li, Xiaopeng
Xia, Chengyi
[J]. EUROPEAN PHYSICAL JOURNAL B, 2021, 94 (01):
[9] Short Memory-Based Human Strategy Modeling in Social Dilemmas
Yang, Xiang-Hao
Huang, Hui-Yun
Zhang, Yi-Chao
Wang, Jia-Sheng
Guan, Ji-Hong
Zhou, Shui-Geng
[J]. MATHEMATICS, 2023, 11 (12)
[10] Efficient memory-based neural network for control application
Li, CK
[J]. COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION - NEURAL NETWORKS & ADVANCED CONTROL STRATEGIES, 1999, 54 : 81 - 86

← 1 2 3 4 5 →