Trial-by-trial dynamics of reward prediction error-associated signals during extinction learning and renewal

被引:16
|
作者
Packheiser, Julian [1 ]
Donoso, Jose R. [2 ]
Cheng, Sen [2 ]
Guentuerkuen, Onur [1 ]
Pusch, Roland [1 ]
机构
[1] Ruhr Univ Bochum, Fac Psychol, Dept Biopsychol, Univ Str 150, D-44780 Bochum, Germany
[2] Ruhr Univ Bochum, Inst Neural Computat, Univ Str 150, D-44780 Bochum, Germany
来源
PROGRESS IN NEUROBIOLOGY | 2021年 / 197卷
关键词
Reward prediction error; Extinction learning; Renewal; Trial-by-trial learning; Electrophysiology; DOPAMINE NEURONS ENCODE; PIGEON COLUMBA-LIVIA; PREFRONTAL CORTEX; NIDOPALLIUM CAUDOLATERALE; VARIABILITY; MICRODRIVE; RESPONSES; CONTEXT;
D O I
10.1016/j.pneurobio.2020.101901
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Reward prediction errors (RPEs) have been suggested to drive associative learning processes, but their precise temporal dynamics at the single-neuron level remain elusive. Here, we studied the neural correlates of RPEs, focusing on their trial-by-trial dynamics during an operant extinction learning paradigm. Within a single behavioral session, pigeons went through acquisition, extinction and renewal the context-dependent response recovery after extinction. We recorded single units from the avian prefrontal cortex analogue, the nidopallium caudolaterale (NCL) and found that the omission of reward during extinction led to a peak of population activity that moved backwards in time as trials progressed. The chronological order of these signal changes during the progress of learning was indicative of temporal shifts of RPE signals that started during reward omission and then moved backwards to the presentation of the conditioned stimulus. Switches from operant choices to avoidance behavior (and vice versa) coincided with changes in population activity during the animals' decision-making. On the single unit level, we found more diverse patterns where some neurons' activity correlated with RPE signals whereas others correlated with the absolute value during the outcome period. Finally, we demonstrated that mere sensory contextual changes during the renewal test were sufficient to elicit signals likely associated with RPEs. Thus, RPEs are truly expectancy-driven since they can be elicited by changes in reward expectation, without an actual change in the quality or quantity of reward.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Does reward positivity encode trial-by-trial reward prediction error? A model- based EEG analysis
    Wu, Ka Chun
    Ip, Isaac
    Ching, Fiona
    Chiu, Heytou
    Chan, Rosa
    Chau, Bolton K. H.
    Wong, Savio W. H.
    Wong, Yetta Kwailing
    [J]. JOURNAL OF COMPUTATIONAL NEUROSCIENCE, 2021, 49 (SUPPL 1) : S197 - S198
  • [2] Feedback-driven Trial-by-trial Reward Learning in Schizophrenia
    Fervaha, Gagan
    Agid, Ofer
    Foussias, George
    Remington, Gary
    [J]. BIOLOGICAL PSYCHIATRY, 2015, 77 (09)
  • [3] Trial-by-Trial Modulation of Associative Memory Formation by Reward Prediction Error and Reward Anticipationas Revealed by a Biologically Plausible Computational Model
    Aberg, Kristoffer C.
    Mueller, Julia
    Schwartz, Sophie
    [J]. FRONTIERS IN HUMAN NEUROSCIENCE, 2017, 11
  • [4] Dynamics of Oddball Sound Processing: Trial-by-Trial Modeling of ECoG Signals
    Lecaignard, Francoise
    Bertrand, Raphaelle
    Brunner, Peter
    Caclin, Anne
    Schalk, Gerwin
    Mattout, Jeremie
    [J]. FRONTIERS IN HUMAN NEUROSCIENCE, 2022, 15
  • [5] Trial-by-trial transformation of error into sensorimotor adaptation changes with environmental dynamics
    Fine, Michael S.
    Thoroughman, Kurt A.
    [J]. JOURNAL OF NEUROPHYSIOLOGY, 2007, 98 (03) : 1392 - 1404
  • [6] Trial-by-trial modeling of electrophysiological signals during inverse Bayesian inference
    Antonio Kolossa
    Bruno Kopp
    Tim Fingscheidt
    [J]. BMC Neuroscience, 15 (Suppl 1)
  • [7] Reward and loss incentives improve spatial working memory by shaping trial-by-trial posterior frontoparietal signals
    Cho, Youngsun T.
    Moujaes, Flora
    Schleifer, Charles H.
    Starc, Martina
    Ji, Jie Lisa
    Santamauro, Nicole
    Adkinson, Brendan
    Kolobaric, Antonija
    Flynn, Morgan
    Krystal, John H.
    Murray, John D.
    Repovs, Grega
    Anticevic, Alan
    [J]. NEUROIMAGE, 2022, 254
  • [8] Blocking trial-by-trial error correction does not interfere with motor learning in human walking
    Long, Andrew W.
    Roemmich, Ryan T.
    Bastian, Amy J.
    [J]. JOURNAL OF NEUROPHYSIOLOGY, 2016, 115 (05) : 2341 - 2348
  • [9] Motor Learning Without Doing: Trial-by-Trial Improvement in Motor Performance During Mental Training
    Gentili, Rodolphe
    Han, Cheol E.
    Schweighofer, Nicolas
    Papaxanthis, Charalambos
    [J]. JOURNAL OF NEUROPHYSIOLOGY, 2010, 104 (02) : 774 - 783
  • [10] Reward prediction error signals associated with a modified time estimation task
    Holroyd, Clay B.
    Krigolson, Olave E.
    [J]. PSYCHOPHYSIOLOGY, 2007, 44 (06) : 913 - 917