The successor representation in human reinforcement learning

被引:180
|
作者
Momennejad, I. [1 ]
Russek, E. M. [2 ]
Cheong, J. H. [3 ]
Botvinick, M. M. [4 ,5 ]
Daw, N. D. [1 ]
Gershman, S. J. [6 ]
机构
[1] Princeton Univ, Dept Psychol, Princeton Neurosci Inst, Princeton, NJ 08544 USA
[2] NYU, Ctr Neural Sci, New York, NY 10003 USA
[3] Dartmouth Coll, Dept Psychol & Brain Sci, Hanover, NH 03755 USA
[4] DeepMind, London, England
[5] UCL, Gatsby Computat Neurosci Unit, London, England
[6] Harvard Univ, Dept Psychol, Ctr Brain Sci, 33 Kirkland St, Cambridge, MA 02138 USA
来源
NATURE HUMAN BEHAVIOUR | 2017年 / 1卷 / 09期
基金
美国国家卫生研究院;
关键词
PREFRONTAL CORTEX; HIPPOCAMPAL REPLAY; COGNITIVE MAPS; MEMORY; PREDICTION; MECHANISMS; CHOICES;
D O I
10.1038/s41562-017-0180-8
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Theories of reward learning in neuroscience have focused on two families of algorithms thought to capture deliberative versus habitual choice. 'Model-based' algorithms compute the value of candidate actions from scratch, whereas 'model-free' algorithms make choice more efficient but less flexible by storing pre-computed action values. We examine an intermediate algorithmic family, the successor representation, which balances flexibility and efficiency by storing partially computed action values: predictions about future events. These pre-computation strategies differ in how they update their choices following changes in a task. The successor representation's reliance on stored predictions about future states predicts a unique signature of insensitivity to changes in the task's sequence of events, but flexible adjustment following changes to rewards. We provide evidence for such differential sensitivity in two behavioural studies with humans. These results suggest that the successor representation is a computational substrate for semi-flexible choice in humans, introducing a subtler, more cognitive notion of habit.
引用
收藏
页码:680 / 692
页数:13
相关论文
共 50 条
  • [1] The successor representation in human reinforcement learning
    I. Momennejad
    E. M. Russek
    J. H. Cheong
    M. M. Botvinick
    N. D. Daw
    S. J. Gershman
    [J]. Nature Human Behaviour, 2017, 1 : 680 - 692
  • [2] Temporal Abstraction in Reinforcement Learning with the Successor Representation
    Machado, Marlos C.
    Barreto, Andre
    Precup, Doina
    Bowling, Michael
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [3] A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
    Fujimoto, Scott
    Meger, David
    Precup, Doina
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [4] Successor Features for Transfer in Reinforcement Learning
    Barreto, Andre
    Dabney, Will
    Munos, Remi
    Hunt, Jonathan J.
    Schaul, Tom
    van Hasselt, Hado
    Silver, David
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [5] Associative Learning of an Unnormalized Successor Representation
    Verosky, Niels J.
    [J]. NEURAL COMPUTATION, 2024, 36 (07) : 1410 - 1423
  • [6] Black-Box Reward Attacks Against Deep Reinforcement Learning Based on Successor Representation
    Cai, Kanting
    Zhu, Xiangbin
    Hu, Zhao-Long
    [J]. IEEE ACCESS, 2022, 10 : 51548 - 51560
  • [7] Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal Difference and Successor Representation
    Salimibeni, Mohammad
    Mohammadi, Arash
    Malekzadeh, Parvin
    Plataniotis, Konstantinos N.
    [J]. SENSORS, 2022, 22 (04)
  • [8] Transformed Successor Features for Transfer Reinforcement Learning
    Garces, Kiyoshige
    Xuan, Junyu
    Zuo, Hua
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT II, 2024, 14472 : 298 - 309
  • [9] Successor Options: An Option Discovery Framework for Reinforcement Learning
    Ramesh, Rahul
    Tomar, Manan
    Ravindran, Balaraman
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3304 - 3310
  • [10] Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation
    Sherstan, Craig
    Machado, Marlos C.
    Pilarski, Patrick M.
    [J]. 2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 2997 - 3003