Reinforcement learning and the reward positivity with aversive outcomes

被引:0
|
作者
Bauer, Elizabeth A. [1 ,2 ]
Watanabe, Brandon K. [1 ]
Macnamara, Annmarie [1 ]
机构
[1] Texas A&M Univ, Dept Psychol & Brain Sci, College Stn, TX USA
[2] Texas A&M Univ, Dept Psychol & Brain Sci, 4235 TAMU, College Stn, TX 77843 USA
关键词
ERP; punishment; reinforcement learning; reward positivity (RewP); PREDICTION ERROR; DOPAMINE; FEEDBACK; ERP; METAANALYSIS; POTENTIALS; NEGATIVITY; P300; PCA;
D O I
10.1111/psyp.14460
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
The reinforcement learning (RL) theory of the reward positivity (RewP), an event-related potential (ERP) component that measures reward responsivity, suggests that the RewP should be largest when positive outcomes are unexpected and has been supported by work using appetitive outcomes (e.g., money). However, the RewP can also be elicited by the absence of aversive outcomes (e.g., shock). The limited work to-date that has manipulated expectancy while using aversive outcomes has not supported the predictions of RL theory. Nonetheless, this work has been difficult to reconcile with the appetitive literature because the RewP was not observed as a reward signal in these studies, which used passive tasks that did not involve participant choice. Here, we tested the predictions of the RL theory by manipulating expectancy in an active/choice-based threat-of-shock doors task that was previously found to elicit the RewP as a reward signal. Moreover, we used principal components analysis to isolate the RewP from overlapping ERP components. Eighty participants viewed pairs of doors surrounded by a red or green border; shock delivery was expected (80%) following red-bordered doors and unexpected (20%) following green-bordered doors. The RewP was observed as a reward signal (i.e., no shock > shock) that was not potentiated for unexpected feedback. In addition, the RewP was larger overall for unexpected (vs expected) feedback. Therefore, the RewP appears to reflect the additive (not interactive) effects of reward and expectancy, challenging the RL theory of the RewP, at least when reward is defined as the absence of an aversive outcome.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance
    Knox, W. Bradley
    Stone, Peter
    [J]. ARTIFICIAL INTELLIGENCE, 2015, 225 : 24 - 50
  • [2] DRIVE AND REWARD IN AVERSIVE LEARNING
    MCALLISTER, WR
    MCALLIST.DE
    [J]. AMERICAN JOURNAL OF PSYCHOLOGY, 1967, 80 (03): : 377 - +
  • [3] Feedback delay impaired reinforcement learning: Principal components analysis of Reward Positivity
    Yin, Hang
    Wang, Yu
    Zhang, Xukai
    Li, Peng
    [J]. NEUROSCIENCE LETTERS, 2018, 685 : 179 - 184
  • [4] LEARNING RELATED CHANGES IN THE REWARD POSITIVITY
    Krigolson, Olav
    [J]. PSYCHOPHYSIOLOGY, 2016, 53 : S10 - S10
  • [5] Reward Reports for Reinforcement Learning
    Gilbert, Thomas Krendl
    Lambert, Nathan
    Dean, Sarah
    Zick, Tom
    Snoswell, Aaron
    Mehta, Soham
    [J]. PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023, 2023, : 84 - 130
  • [6] Reward, motivation, and reinforcement learning
    Dayan, P
    Balleine, BW
    [J]. NEURON, 2002, 36 (02) : 285 - 298
  • [7] MAGNITUDE AND SHIFT OF REWARD IN INSTRUMENTAL AVERSIVE LEARNING IN RATS
    MCALLISTER, DE
    MCALLIST.WR
    GOLDMAN, JA
    BROOKS, CI
    [J]. JOURNAL OF COMPARATIVE AND PHYSIOLOGICAL PSYCHOLOGY, 1972, 80 (03) : 490 - +
  • [8] Information Directed Reward Learning for Reinforcement Learning
    Lindner, David
    Turchetta, Matteo
    Tschiatschek, Sebastian
    Ciosek, Kamil
    Krause, Andreas
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [9] Reinforcement learning reward functions for unsupervised learning
    Fyfe, Colin
    Lai, Pei Ling
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 397 - +
  • [10] Reinforcement learning models of aversive learning and their translation to anxiety disorders
    Seymour, Ben
    Norbury, Agnes
    [J]. JOURNAL OF NEURAL TRANSMISSION, 2017, 124 (10) : 1283 - 1284