Stimulus-Dependent Adjustment of Reward Prediction Error in the Midbrain

被引:11
|
作者
Takemura, Hiromasa [1 ]
Samejima, Kazuyuki [2 ]
Vogels, Rufin [3 ]
Sakagami, Masamichi [2 ]
Okuda, Jiro [2 ,4 ]
机构
[1] Univ Tokyo, Dept Life Sci, Tokyo, Japan
[2] Tamagawa Univ, Brain Sci Inst, Brain Sci Res Ctr, Machida, Tokyo, Japan
[3] Katholieke Univ Leuven, Sch Med, Lab Neuroen Psychofysiol, Louvain, Belgium
[4] Kyoto Sangyo Univ, Fac Comp Sci & Engn, Dept Intelligent Syst, Khoyama Ctr Neurosci, Kyoto 603, Japan
来源
PLOS ONE | 2011年 / 6卷 / 12期
基金
日本学术振兴会;
关键词
TEMPORAL DIFFERENCE MODELS; DOPAMINE RESPONSES; HUMAN BRAIN; DECISION; REPRESENTATION; SENSITIVITY; INFERENCE; NEURONS; HUMANS; CORTEX;
D O I
10.1371/journal.pone.0028337
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Previous reports have described that neural activities in midbrain dopamine areas are sensitive to unexpected reward delivery and omission. These activities are correlated with reward prediction error in reinforcement learning models, the difference between predicted reward values and the obtained reward outcome. These findings suggest that the reward prediction error signal in the brain updates reward prediction through stimulus-reward experiences. It remains unknown, however, how sensory processing of reward-predicting stimuli contributes to the computation of reward prediction error. To elucidate this issue, we examined the relation between stimulus discriminability of the reward-predicting stimuli and the reward prediction error signal in the brain using functional magnetic resonance imaging (fMRI). Before main experiments, subjects learned an association between the orientation of a perceptually salient (high-contrast) Gabor patch and a juice reward. The subjects were then presented with lower-contrast Gabor patch stimuli to predict a reward. We calculated the correlation between fMRI signals and reward prediction error in two reinforcement learning models: a model including the modulation of reward prediction by stimulus discriminability and a model excluding this modulation. Results showed that fMRI signals in the midbrain are more highly correlated with reward prediction error in the model that includes stimulus discriminability than in the model that excludes stimulus discriminability. No regions showed higher correlation with the model that excludes stimulus discriminability. Moreover, results show that the difference in correlation between the two models was significant from the first session of the experiment, suggesting that the reward computation in the midbrain was modulated based on stimulus discriminability before learning a new contingency between perceptually ambiguous stimuli and a reward. These results suggest that the human reward system can incorporate the level of the stimulus discriminability flexibly into reward computations by modulating previously acquired reward values for a typical stimulus.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] HALOPERIDOL AFFECTS STIMULUS-DEPENDENT STRATEGIES AND NOT REWARD-DEPENDENT STRATEGIES
    COENDERS, CJH
    KERBUSCH, SML
    VOSSEN, JMH
    BRAIN RESEARCH BULLETIN, 1993, 32 (01) : 7 - 10
  • [2] Different neural correlates of stimulus-action-dependent and stimulus-dependent reward predictions revealed by fMRI
    Haruno, Masahiko
    Kansaku, Kenji
    Aramaki, Yu
    Kawato, Mitsuo
    NEUROSCIENCE RESEARCH, 2006, 55 : S262 - S262
  • [3] Is Perception Stimulus-Dependent?
    Sergio Cermeño-Aínsa
    Review of Philosophy and Psychology, 2022, 13 : 735 - 754
  • [4] Is Perception Stimulus-Dependent?
    Cermeno-Ainsa, Sergio
    REVIEW OF PHILOSOPHY AND PSYCHOLOGY, 2022, 13 (03) : 735 - 754
  • [5] Stimulus-dependent auditory tuning results in synchronous population coding of vocalizations in the songbird midbrain
    Woolley, SMN
    Gill, PR
    Theunissen, FE
    JOURNAL OF NEUROSCIENCE, 2006, 26 (09): : 2499 - 2512
  • [6] Midbrain dopamine neurons encode a quantitative reward prediction error signal
    Bayer, HM
    Glimcher, PW
    NEURON, 2005, 47 (01) : 129 - 141
  • [7] The effect of effort on reward prediction error signals in midbrain dopamine neurons
    Tanaka, Shingo
    Taylor, Jessica E.
    Sakagami, Masamichi
    CURRENT OPINION IN BEHAVIORAL SCIENCES, 2021, 41 : 152 - 159
  • [8] Discrete coding of stimulus value, reward expectation, and reward prediction error in the dorsal striatum
    Oyama, Kei
    Tateyama, Yukina
    Hernadi, Istvan
    Tobler, Philippe N.
    Iijima, Toshio
    Tsutsui, Ken-Ichiro
    JOURNAL OF NEUROPHYSIOLOGY, 2015, 114 (05) : 2600 - 2615
  • [9] Stimulus-dependent processing of temporal order
    Fink, M
    Ulbrich, P
    Churan, J
    Wittmann, M
    BEHAVIOURAL PROCESSES, 2006, 71 (2-3) : 344 - 352
  • [10] Stimulus-dependent spiking relationships with the EEG
    Snyder, Adam C.
    Smith, Matthew A.
    JOURNAL OF NEUROPHYSIOLOGY, 2015, 114 (03) : 1468 - 1482