One-shot learning and behavioral eligibility traces in sequential decision making

被引:11
|
作者
Lehmann, Marco P. [1 ,2 ]
Xu, He A. [3 ]
Liakoni, Vasiliki [1 ,2 ]
Herzog, Michael H. [3 ]
Gerstner, Wulfram [1 ,2 ]
Preuschoff, Kerstin [4 ]
机构
[1] Ecole Polytech Fed Lausanne, Sch Life Sci, Brain Mind Inst, Lausanne, Switzerland
[2] Ecole Polytech Fed Lausanne, Sch Comp & Commun Sci, Lausanne, Switzerland
[3] Ecole Polytech Fed Lausanne, Sch Life Sci, Lab Psychophys, Lausanne, Switzerland
[4] Univ Geneva, Swiss Ctr Affect Sci, Geneva, Switzerland
来源
ELIFE | 2019年 / 8卷
基金
欧盟地平线“2020”; 欧洲研究理事会;
关键词
PUPIL DIAMETER; TERM-MEMORY; REINFORCEMENT; REWARD; PLASTICITY; SIGNALS; STATES; LOAD;
D O I
10.7554/eLife.47463
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In many daily tasks, we make multiple decisions before reaching a goal. In order to learn such sequences of decisions, a mechanism to link earlier actions to later reward is necessary. Reinforcement learning (RL) theory suggests two classes of algorithms solving this credit assignment problem: In classic temporal-difference learning, earlier actions receive reward information only after multiple repetitions of the task, whereas models with eligibility traces reinforce entire sequences of actions from a single experience (one-shot). Here, we show one-shot learning of sequences. We developed a novel paradigm to directly observe which actions and states along a multi-step sequence are reinforced after a single reward. By focusing our analysis on those states for which RL with and without eligibility trace make qualitatively distinct predictions, we find direct behavioral (choice probability) and physiological (pupil dilation) signatures of reinforcement learning with eligibility trace across multiple sensory modalities.
引用
收藏
页码:1 / 32
页数:25
相关论文
共 50 条
  • [21] Generative One-Shot Learning (GOL): A Semi-Parametric Approach to One-Shot Learning in Autonomous Vision
    Grigorescu, Sorin M.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 7127 - 7134
  • [22] One-Shot Learning in the Road Sign Problem
    Pinto, Rafael C.
    Engel, Paulo M.
    Heinen, Milton R.
    [J]. 2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [23] Learning in one-shot strategic form games
    Altman, Alon
    Bercovici-Boden, Avivit
    Tennenholtz, Moshe
    [J]. MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 6 - 17
  • [24] Order Optimal One-Shot Distributed Learning
    Sharifnassab, Arsalan
    Salehkaleybar, Saber
    Golestani, S. Jamaloddin
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [25] One-shot Transfer Learning for Population Mapping
    Shao, Erzhuo
    Feng, Jie
    Wang, Yingheng
    Xia, Tong
    Li, Yong
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 1588 - 1597
  • [26] How One-Shot Learning Unfolds in the Brain
    Weaver, Janelle
    [J]. PLOS BIOLOGY, 2015, 13 (04)
  • [27] Image Block Augmentation for One-Shot Learning
    Chen, Zitian
    Fu, Yanwei
    Chen, Kaiyu
    Jiang, Yu-Gang
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3379 - 3386
  • [28] One-Shot Ontogenetic Learning in Biomedical Datastreams
    Kalantari, John
    Mackey, Michael A.
    [J]. ARTIFICIAL GENERAL INTELLIGENCE: 10TH INTERNATIONAL CONFERENCE, AGI 2017, 2017, 10414 : 143 - 153
  • [29] Optimizing one-shot learning with binary Synapses
    Romani, Sandro
    Amit, Daniel J.
    Amit, Yali
    [J]. NEURAL COMPUTATION, 2008, 20 (08) : 1928 - 1950
  • [30] One-Shot Tuner for Deep Learning Compilers
    Ryu, Jaehun
    Park, Eunhyeok
    Sung, Hyojin
    [J]. CC'22: PROCEEDINGS OF THE 31ST ACM SIGPLAN INTERNATIONAL CONFERENCE ON COMPILER CONSTRUCTION, 2022, : 89 - 103