Hidden state and reinforcement learning with instance-based state identification

被引：33

作者：

McCallum, RA

机构：

[1] Department of Computer Science, University of Rochester, Rochester

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 1996年 / 26卷 / 03期

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/3477.499796

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Real robots with real sensors are not omniscient, When a robot's next course of action depends on information that is hidden from the sensors because of problems such as occlusion, restricted range, bounded field of view and limited attention, we say the robot suffers from the hidden state problem, State identification techniques use history information to uncover hidden state, Some previous approaches to encoding history include: finite state machines [12], [28], recurrent neural networks [25] and genetic programming with indexed memory [49]. A chief disadvantage of all these techniques is their long training time, This paper presents instance-based state identification, a new approach to reinforcement learning with state identification that learns with much fewer training steps. Noting that learning with history and learning in continuous spaces both share the property that they begin without knowing the granularity of the state space, the approach applies instance-based (or ''memory-based'') learning to history sequences-instead of recording instances in a continuous geometrical space, we record instances in action-percept-reward sequence space. The first implementation of this approach, called Nearest Sequence Memory, learns with an order of magnitude fewer steps than several previous approaches.

引用

页码：464 / 473

页数：10

共 50 条

[1] Instance-based Generalization in Reinforcement Learning
Bertran, Martin
Martinez, Natalia
Phielipp, Mariano
Sapiro, Guillermo
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[2] Local instance-based transfer learning for reinforcement learning
Li, Xiaoguang
Ji, Wanting
Huang, Jidong
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
[3] An instance-based state representation for network repair
Littman, ML
Ravi, N
Fenson, E
Howard, R
[J]. PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 287 - 292
[4] Instance-Based Ensemble Selection Using Deep Reinforcement Learning
Liu, Zhengshang
Ramamohanarao, Kotagiri
[J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[5] Improving the Robustness of Instance-Based Reinforcement Learning Robots by Metalearning
Yasuda, Toshiyuki
Araki, Kousuke
Ohkura, Kazuhiro
[J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2011, 15 (08) : 1065 - 1072
[6] Making Instance-based Learning Theory usable and understandable: The Instance-based Learning Tool
Dutt, Varun
Gonzalez, Cleotilde
[J]. COMPUTERS IN HUMAN BEHAVIOR, 2012, 28 (04) : 1227 - 1240
[7] Instance-based learning by searching
Fuchs, M
[J]. INTELLIGENT INFORMATION SYSTEMS, (IIS'97) PROCEEDINGS, 1997, : 189 - 193
[8] Instance-based defense against adversarial attacks in Deep Reinforcement Learning
Garcia, Javier
Sagredo, Ismael
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 107
[9] Instance-based reinforcement learning for robot path finding in continuous space
Nakamura, J
Ohnishi, S
Ohkura, K
Ueda, K
[J]. SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 1229 - 1234
[10] INSTANCE-BASED LEARNING ALGORITHMS
AHA, DW
KIBLER, D
ALBERT, MK
[J]. MACHINE LEARNING, 1991, 6 (01) : 37 - 66

← 1 2 3 4 5 →