Hidden state and reinforcement learning with instance-based state identification

被引:33
|
作者
McCallum, RA
机构
[1] Department of Computer Science, University of Rochester, Rochester
基金
美国国家科学基金会;
关键词
D O I
10.1109/3477.499796
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real robots with real sensors are not omniscient, When a robot's next course of action depends on information that is hidden from the sensors because of problems such as occlusion, restricted range, bounded field of view and limited attention, we say the robot suffers from the hidden state problem, State identification techniques use history information to uncover hidden state, Some previous approaches to encoding history include: finite state machines [12], [28], recurrent neural networks [25] and genetic programming with indexed memory [49]. A chief disadvantage of all these techniques is their long training time, This paper presents instance-based state identification, a new approach to reinforcement learning with state identification that learns with much fewer training steps. Noting that learning with history and learning in continuous spaces both share the property that they begin without knowing the granularity of the state space, the approach applies instance-based (or ''memory-based'') learning to history sequences-instead of recording instances in a continuous geometrical space, we record instances in action-percept-reward sequence space. The first implementation of this approach, called Nearest Sequence Memory, learns with an order of magnitude fewer steps than several previous approaches.
引用
收藏
页码:464 / 473
页数:10
相关论文
共 50 条
  • [1] Instance-based Generalization in Reinforcement Learning
    Bertran, Martin
    Martinez, Natalia
    Phielipp, Mariano
    Sapiro, Guillermo
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] Local instance-based transfer learning for reinforcement learning
    Li, Xiaoguang
    Ji, Wanting
    Huang, Jidong
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [3] An instance-based state representation for network repair
    Littman, ML
    Ravi, N
    Fenson, E
    Howard, R
    [J]. PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 287 - 292
  • [4] Instance-Based Ensemble Selection Using Deep Reinforcement Learning
    Liu, Zhengshang
    Ramamohanarao, Kotagiri
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [5] Improving the Robustness of Instance-Based Reinforcement Learning Robots by Metalearning
    Yasuda, Toshiyuki
    Araki, Kousuke
    Ohkura, Kazuhiro
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2011, 15 (08) : 1065 - 1072
  • [6] Making Instance-based Learning Theory usable and understandable: The Instance-based Learning Tool
    Dutt, Varun
    Gonzalez, Cleotilde
    [J]. COMPUTERS IN HUMAN BEHAVIOR, 2012, 28 (04) : 1227 - 1240
  • [7] Instance-based learning by searching
    Fuchs, M
    [J]. INTELLIGENT INFORMATION SYSTEMS, (IIS'97) PROCEEDINGS, 1997, : 189 - 193
  • [8] Instance-based defense against adversarial attacks in Deep Reinforcement Learning
    Garcia, Javier
    Sagredo, Ismael
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 107
  • [9] Instance-based reinforcement learning for robot path finding in continuous space
    Nakamura, J
    Ohnishi, S
    Ohkura, K
    Ueda, K
    [J]. SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 1229 - 1234
  • [10] INSTANCE-BASED LEARNING ALGORITHMS
    AHA, DW
    KIBLER, D
    ALBERT, MK
    [J]. MACHINE LEARNING, 1991, 6 (01) : 37 - 66