Learning skills in reinforcement learning using relative novelty

被引：0

作者：

Simsek, Ö ^{[1
]}

Barto, AG ^{[1
]}

机构：

[1] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA

来源：

ABSTRACTION, REFORMULATION AND APPROXIMATION, PROCEEDINGS | 2005年 / 3607卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a method for automatically creating a set of useful temporally-extended actions, or skills, in reinforcement learning. Our method identifies states that allow the agent to transition to a different region of the state space for example, a doorway between two rooms-and generates temporally-extended actions that efficiently take the agent to these states. In identifying such states we use the concept of relative novelty, a measure of how much short-term novelty a state introduces to the agent. The resulting algorithm is simple, has low computational complexity, and is shown to improve performance in a number of problems.

引用

页码：367 / 374

页数：8

共 50 条

[1] The effect of novelty on reinforcement learning
Houillon, A.
Lorenz, R. C.
Boehmer, W.
Rapp, M. A.
Heinz, A.
Gallinat, J.
Obermayer, K.
[J]. DECISION MAKING: NEURAL AND BEHAVIOURAL APPROACHES, 2013, 202 : 415 - 439
[2] Learning Pushing Skills Using Object Detection and Deep Reinforcement Learning
Guo, Wei
Dong, Guantao
Chen, Chen
Li, Mantian
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2019, : 469 - 474
[3] Dynamic Terrain Traversal Skills Using Reinforcement Learning
Peng, Xue Bin
Berseth, Glen
van de Panne, Michiel
[J]. ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (04):
[4] Novelty Seeking Multiagent Evolutionary Reinforcement Learning
Aydeniz, Ayhan Alp
Loftin, Robert
Tumer, Kagan
[J]. PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023, 2023, : 402 - 410
[5] Novelty Detector for Reinforcement Learning Based on Forecasting
Gregor, Michal
Spalek, Juraj
[J]. 2014 IEEE 12TH INTERNATIONAL SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI), 2014, : 73 - 78
[6] Novelty and Inductive Generalization in Human Reinforcement Learning
Gershman, Samuel J.
Niv, Yael
[J]. TOPICS IN COGNITIVE SCIENCE, 2015, 7 (03) : 391 - 415
[7] Learning Basketball Dribbling Skills Using Trajectory Optimization and Deep Reinforcement Learning
Liu, Libin
Hodgins, Jessica
[J]. ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04):
[8] Attention and Relative Novelty in Human Perceptual Learning
Wang, Tony
Mitchell, Chris J.
[J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-ANIMAL BEHAVIOR PROCESSES, 2011, 37 (04): : 436 - 445
[9] Learning Generalizable Locomotion Skills with Hierarchical Reinforcement Learning
Li, Tianyu
Lambert, Nathan
Calandra, Roberto
Meier, Franziska
Rai, Akshara
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 413 - 419
[10] Learning visual path–following skills for industrial robot using deep reinforcement learning
Guoliang Liu
Wenlei Sun
Wenxian Xie
Yangyang Xu
[J]. The International Journal of Advanced Manufacturing Technology, 2022, 122 : 1099 - 1111

← 1 2 3 4 5 →