The Successor Representation and Temporal Context

被引:72
|
作者
Gershman, Samuel J. [1 ,2 ]
Moore, Christopher D. [1 ,2 ]
Todd, Michael T. [1 ,2 ]
Norman, Kenneth A. [1 ,2 ]
Sederberg, Per B. [3 ]
机构
[1] Princeton Univ, Dept Psychol, Princeton, NJ 08540 USA
[2] Princeton Univ, Princeton Neurosci Inst, Princeton, NJ 08540 USA
[3] Ohio State Univ, Dept Psychol, Columbus, OH 43210 USA
基金
美国国家科学基金会;
关键词
RETRIEVAL-PROCESSES; EPISODIC MEMORY; WORKING-MEMORY; TIME-COURSE; MODEL; FUTURE; HIPPOCAMPUS; PREDICTION; RECALL; CONSTRUCTION;
D O I
10.1162/NECO_a_00282
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The successor representation was introduced into reinforcement learning by Dayan (1993) as a means of facilitating generalization between states with similar successors. Although reinforcement learning in general has been used extensively as a model of psychological and neural processes, the psychological validity of the successor representation has yet to be explored. An interesting possibility is that the successor representation can be used not only for reinforcement learning but for episodic learning as well. Our main contribution is to show that a variant of the temporal context model (TCM; Howard & Kahana, 2002), an influential model of episodic memory, can be understood as directly estimating the successor representation using the temporal difference learning algorithm (Sutton & Barto, 1998). This insight leads to a generalization of TCM and new experimental predictions. In addition to casting a new normative light on TCM, this equivalence suggests a previously unexplored point of contact between different learning systems.
引用
收藏
页码:1553 / 1568
页数:16
相关论文
共 50 条
  • [41] Temporal representation and dynamics
    Grush, Rick
    [J]. NEW IDEAS IN PSYCHOLOGY, 2008, 26 (02) : 146 - 157
  • [42] Representation of temporal unawareness
    Chountas, P
    Petrounias, I
    Atanassov, K
    Kodogiannis, V
    El-Darzi, E
    [J]. ADVANCES IN INFORMATION SYSTEMS, 2002, 2457 : 21 - 30
  • [43] Temporal representation and reasoning
    Morris, R
    Khatib, L
    [J]. KNOWLEDGE ENGINEERING REVIEW, 1997, 12 (04): : 411 - 412
  • [44] Temporal Ventriloquism in a Purely Temporal Context
    Hartcher-O'Brien, Jessica
    Alais, David
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2011, 37 (05) : 1383 - 1395
  • [45] TAN: a temporal-aware attention network with context-rich representation for boosting proposal generation
    Jiao, Yanyan
    Yang, Wenzhu
    Xing, Wenjie
    Zeng, Shuang
    Geng, Lei
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 3691 - 3708
  • [46] TAN: a temporal-aware attention network with context-rich representation for boosting proposal generation
    Yanyan Jiao
    Wenzhu Yang
    Wenjie Xing
    Shuang Zeng
    Lei Geng
    [J]. Complex & Intelligent Systems, 2024, 10 : 3691 - 3708
  • [47] LEARN A ROBUST REPRESENTATION FOR COVER SONG IDENTIFICATION VIA AGGREGATING LOCAL AND GLOBAL MUSIC TEMPORAL CONTEXT
    Jiang, Chaoya
    Yang, Deshun
    Chen, Xiaoou
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [48] Context-aware temporal network representation of event logs: Model and methods for process performance analysis
    Senderovich, Arik
    Weidlich, Matthias
    Gal, Avigdor
    [J]. INFORMATION SYSTEMS, 2019, 84 : 240 - 254
  • [49] AKF-SR: Adaptive Kalman filtering-based successor representation q
    Malekzadeh, Parvin
    Salimibeni, Mohammad
    Hou, Ming
    Mohammadi, Arash
    Plataniotis, Konstantinos N.
    [J]. NEUROCOMPUTING, 2022, 467 : 476 - 490
  • [50] Temporal Context Manager
    Kvet, Michal
    Matiasko, Karol
    [J]. PROCEEDINGS OF THE 2015 FEDERATED CONFERENCE ON SOFTWARE DEVELOPMENT AND OBJECT TECHNOLOGIES, 2017, 511 : 169 - 192