A teaching strategy for memory-based control

被引:12
|
作者
Sheppard, JW
Salzberg, SL
机构
[1] The Johns Hopkins University,Department of Computer Science
关键词
lazy learning; nearest neighbor; genetic algorithms; differential games; pursuit games; teaching; reinforcement learning;
D O I
10.1023/A:1006597715165
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Combining different machine learning algorithms in the same system can produce benefits above and beyond what either method could achieve alone. This paper demonstrates that genetic algorithms can be used in conjunction with lazy learning to solve examples of a difficult class of delayed reinforcement learning problems better than either method alone. This class, the class of differential games, includes numerous important control problems that arise in robotics, planning, game playing, and other areas, and solutions for differential games suggest solution strategies for the general class of planning and control problems. We conducted a series of experiments applying three learning approaches - lazy Q-learning, k-nearest neighbor (k-NN), and a genetic algorithm - to a particular differential game called a pursuit game. Our experiments demonstrate that Ic-NN had great difficulty solving the problem, while a lazy version of Q-learning performed moderately well and the genetic algorithm performed even better. These results motivated the next step in the experiments, where we hypothesized Ic-NN was having difficulty because it did not have good examples - a common source of difficulty for lazy learning. Therefore, we used the genetic algorithm as a bootstrapping method for Ic-NN to create a system to provide these examples. Our experiments demonstrate that the resulting joint system learned to solve the pursuit games with a high degree of accuracy outperforming either method alone - and with relatively small memory requirements.
引用
收藏
页码:343 / 370
页数:28
相关论文
共 50 条
  • [1] A Teaching Strategy for Memory-Based Control
    John W. Sheppard
    Steven L. Salzberg
    [J]. Artificial Intelligence Review, 1997, 11 : 343 - 370
  • [2] Quantized feedback fuzzy sliding mode control design via memory-based strategy
    Ran, Suzhen
    Xue, Yanmei
    Zheng, Bo-Chao
    Wang, Zhenyou
    [J]. APPLIED MATHEMATICS AND COMPUTATION, 2017, 298 : 283 - 295
  • [3] Hybrid memory-based control of robotic manipulators
    Lee, CY
    Lee, JJ
    [J]. IEEE REGION 10 INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONIC TECHNOLOGY, VOLS 1 AND 2, 2001, : 433 - 438
  • [4] Memory-Based Logic Control for Embedded Systems
    Dvorak, Vaclav
    Mikusek, Petr
    [J]. INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, 2015, 325 : 367 - 379
  • [5] Control of Memory Retrieval Alters Memory-Based Eye Movements
    Kulkarni, Mrinmayi
    Nickel, Allison E.
    Minor, Greta N.
    Hannula, Deborah E.
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 2023,
  • [6] Improving the effectiveness of an interruption lag by inducing a memory-based strategy
    Morgan, Phillip L.
    Patrick, John
    Tiley, Leyanne
    [J]. ACTA PSYCHOLOGICA, 2013, 142 (01) : 87 - 95
  • [7] Role of strategy update rules in the spatial memory-based mixed strategy games
    Fan Zhang
    Juan Wang
    Hongyu Gao
    Xiaopeng Li
    Chengyi Xia
    [J]. The European Physical Journal B, 2021, 94
  • [8] Role of strategy update rules in the spatial memory-based mixed strategy games
    Zhang, Fan
    Wang, Juan
    Gao, Hongyu
    Li, Xiaopeng
    Xia, Chengyi
    [J]. EUROPEAN PHYSICAL JOURNAL B, 2021, 94 (01):
  • [9] Short Memory-Based Human Strategy Modeling in Social Dilemmas
    Yang, Xiang-Hao
    Huang, Hui-Yun
    Zhang, Yi-Chao
    Wang, Jia-Sheng
    Guan, Ji-Hong
    Zhou, Shui-Geng
    [J]. MATHEMATICS, 2023, 11 (12)
  • [10] Efficient memory-based neural network for control application
    Li, CK
    [J]. COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION - NEURAL NETWORKS & ADVANCED CONTROL STRATEGIES, 1999, 54 : 81 - 86