The Successor Representation and Temporal Context

被引:72
|
作者
Gershman, Samuel J. [1 ,2 ]
Moore, Christopher D. [1 ,2 ]
Todd, Michael T. [1 ,2 ]
Norman, Kenneth A. [1 ,2 ]
Sederberg, Per B. [3 ]
机构
[1] Princeton Univ, Dept Psychol, Princeton, NJ 08540 USA
[2] Princeton Univ, Princeton Neurosci Inst, Princeton, NJ 08540 USA
[3] Ohio State Univ, Dept Psychol, Columbus, OH 43210 USA
基金
美国国家科学基金会;
关键词
RETRIEVAL-PROCESSES; EPISODIC MEMORY; WORKING-MEMORY; TIME-COURSE; MODEL; FUTURE; HIPPOCAMPUS; PREDICTION; RECALL; CONSTRUCTION;
D O I
10.1162/NECO_a_00282
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The successor representation was introduced into reinforcement learning by Dayan (1993) as a means of facilitating generalization between states with similar successors. Although reinforcement learning in general has been used extensively as a model of psychological and neural processes, the psychological validity of the successor representation has yet to be explored. An interesting possibility is that the successor representation can be used not only for reinforcement learning but for episodic learning as well. Our main contribution is to show that a variant of the temporal context model (TCM; Howard & Kahana, 2002), an influential model of episodic memory, can be understood as directly estimating the successor representation using the temporal difference learning algorithm (Sutton & Barto, 1998). This insight leads to a generalization of TCM and new experimental predictions. In addition to casting a new normative light on TCM, this equivalence suggests a previously unexplored point of contact between different learning systems.
引用
收藏
页码:1553 / 1568
页数:16
相关论文
共 50 条
  • [1] Temporal Abstraction in Reinforcement Learning with the Successor Representation
    Machado, Marlos C.
    Barreto, Andre
    Precup, Doina
    Bowling, Michael
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [2] A Probabilistic Successor Representation for Context-Dependent Learning
    Geerts, Jesse P. P.
    Gershman, Samuel J. J.
    Burgess, Neil
    Stachenfeld, Kimberly L. L.
    [J]. PSYCHOLOGICAL REVIEW, 2023, : 578 - 597
  • [4] A distributed representation of temporal context
    Howard, MW
    Kahana, MJ
    [J]. JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2002, 46 (03) : 269 - 299
  • [5] Temporal Context Representation and Reasoning
    Moldovan, Dan
    Clark, Christine
    Harabagiu, Sanda
    [J]. 19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 1099 - 1104
  • [6] Event Representation in Temporal and Geographic Context
    Shaw, Ryan
    Larson, Ray R.
    [J]. RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 2008, 5173 : 415 - 418
  • [7] Task and context dependence in the representation of temporal information
    Jantzen, KJ
    Sedita, S
    Kelso, JAS
    [J]. JOURNAL OF SPORT & EXERCISE PSYCHOLOGY, 2006, 28 : S19 - S20
  • [8] Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal Difference and Successor Representation
    Salimibeni, Mohammad
    Mohammadi, Arash
    Malekzadeh, Parvin
    Plataniotis, Konstantinos N.
    [J]. SENSORS, 2022, 22 (04)
  • [9] Associative Learning of an Unnormalized Successor Representation
    Verosky, Niels J.
    [J]. NEURAL COMPUTATION, 2024, 36 (07) : 1410 - 1423
  • [10] The successor representation in human reinforcement learning
    Momennejad, I.
    Russek, E. M.
    Cheong, J. H.
    Botvinick, M. M.
    Daw, N. D.
    Gershman, S. J.
    [J]. NATURE HUMAN BEHAVIOUR, 2017, 1 (09): : 680 - 692