Learning skills in reinforcement learning using relative novelty

被引:0
|
作者
Simsek, Ö [1 ]
Barto, AG [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a method for automatically creating a set of useful temporally-extended actions, or skills, in reinforcement learning. Our method identifies states that allow the agent to transition to a different region of the state space for example, a doorway between two rooms-and generates temporally-extended actions that efficiently take the agent to these states. In identifying such states we use the concept of relative novelty, a measure of how much short-term novelty a state introduces to the agent. The resulting algorithm is simple, has low computational complexity, and is shown to improve performance in a number of problems.
引用
收藏
页码:367 / 374
页数:8
相关论文
共 50 条
  • [1] The effect of novelty on reinforcement learning
    Houillon, A.
    Lorenz, R. C.
    Boehmer, W.
    Rapp, M. A.
    Heinz, A.
    Gallinat, J.
    Obermayer, K.
    [J]. DECISION MAKING: NEURAL AND BEHAVIOURAL APPROACHES, 2013, 202 : 415 - 439
  • [2] Learning Pushing Skills Using Object Detection and Deep Reinforcement Learning
    Guo, Wei
    Dong, Guantao
    Chen, Chen
    Li, Mantian
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2019, : 469 - 474
  • [3] Dynamic Terrain Traversal Skills Using Reinforcement Learning
    Peng, Xue Bin
    Berseth, Glen
    van de Panne, Michiel
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (04):
  • [4] Novelty Seeking Multiagent Evolutionary Reinforcement Learning
    Aydeniz, Ayhan Alp
    Loftin, Robert
    Tumer, Kagan
    [J]. PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023, 2023, : 402 - 410
  • [5] Novelty Detector for Reinforcement Learning Based on Forecasting
    Gregor, Michal
    Spalek, Juraj
    [J]. 2014 IEEE 12TH INTERNATIONAL SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI), 2014, : 73 - 78
  • [6] Novelty and Inductive Generalization in Human Reinforcement Learning
    Gershman, Samuel J.
    Niv, Yael
    [J]. TOPICS IN COGNITIVE SCIENCE, 2015, 7 (03) : 391 - 415
  • [7] Learning Basketball Dribbling Skills Using Trajectory Optimization and Deep Reinforcement Learning
    Liu, Libin
    Hodgins, Jessica
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04):
  • [8] Attention and Relative Novelty in Human Perceptual Learning
    Wang, Tony
    Mitchell, Chris J.
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-ANIMAL BEHAVIOR PROCESSES, 2011, 37 (04): : 436 - 445
  • [9] Learning Generalizable Locomotion Skills with Hierarchical Reinforcement Learning
    Li, Tianyu
    Lambert, Nathan
    Calandra, Roberto
    Meier, Franziska
    Rai, Akshara
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 413 - 419
  • [10] Learning visual path–following skills for industrial robot using deep reinforcement learning
    Guoliang Liu
    Wenlei Sun
    Wenxian Xie
    Yangyang Xu
    [J]. The International Journal of Advanced Manufacturing Technology, 2022, 122 : 1099 - 1111