Prefrontal solution to the bias-variance tradeoff during reinforcement learning

被引:3
|
作者
Kim, Dongjae [1 ,2 ]
Jeong, Jaeseung [3 ,4 ]
Lee, Sang Wan [3 ,4 ,5 ,6 ,7 ]
机构
[1] NYU, Ctr Neural Sci, New York, NY 10003 USA
[2] NYU, Dept Psychol, 6 Washington Pl, New York, NY 10003 USA
[3] Korea Adv Inst Sci & Technol KAIST, Dept Bio & Brain Engn, Daejeon 34141, South Korea
[4] Korea Adv Inst Sci & Technol KAIST, Program Brain & Cognit Engn, Daejeon 34141, South Korea
[5] Korea Adv Inst Sci & Technol KAIST, KAIST Ctr Neurosci Inspired AI, Daejeon 34141, South Korea
[6] Korea Adv Inst Sci & Technol KAIST, KI Hlth Sci & Technol, Daejeon 34141, South Korea
[7] Korea Adv Inst Sci & Technol KAIST, KI Artificial Intelligence, Daejeon 34141, South Korea
来源
CELL REPORTS | 2021年 / 37卷 / 13期
基金
新加坡国家研究基金会;
关键词
PREDICTION ERRORS; DOPAMINE NEURONS; DECISION-MAKING; BAYESIAN MODEL; ARBITRATION; MIXTURE; COMPUTATIONS; UNCERTAINTY; SELECTION; STRIATUM;
D O I
10.1016/j.celrep.2021.110185
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Evidence that the brain combines different value learning strategies to minimize prediction error is accumulating. However, the tradeoff between bias and variance error, which imposes different constraints on each learning strategy's performance, poses a challenge for value learning. While this tradeoff specifies the requirements for optimal learning, little has been known about how the brain deals with this issue. Here, we hypothesize that the brain adaptively resolves the bias-variance tradeoff during reinforcement learning. Our theory suggests that the solution necessitates baseline correction for prediction error, which offsets the adverse effects of irreducible error on value learning. We show behavioral evidence of adaptive control using a Markov decision task with context changes. The prediction error baseline seemingly signals context changes to improve adaptability. Critically, we identify multiplexed representations of prediction error baseline within the ventrolateral and ventromedial prefrontal cortex, key brain regions known to guide model based and model-free reinforcement learning.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] The bias-variance tradeoff and the randomized GACV
    Wahba, G
    Lin, XW
    Gao, FY
    Xiang, D
    Klein, R
    Klein, B
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 620 - 626
  • [2] The Bias-Variance Tradeoff in Cognitive Science
    Doroudi, Shayan
    Rastegar, Seyed Ali
    [J]. COGNITIVE SCIENCE, 2023, 47 (01)
  • [3] Bias-Variance Tradeoff of Graph Laplacian Regularizer
    Chen, Pin-Yu
    Liu, Sijia
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (08) : 1118 - 1122
  • [4] Controlling the Bias-Variance Tradeoff via Coherent Risk for Robust Learning with Kernels
    Koppel, Alec
    Bedi, Amrit S.
    Rajawat, Ketan
    [J]. 2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 3519 - 3525
  • [5] Ensemble Learning in Hyperspectral Image Classification: Toward Selecting a Favorable Bias-Variance Tradeoff
    Merentitis, Andreas
    Debes, Christian
    Heremans, Roel
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (04) : 1089 - 1102
  • [6] Robust multi-layer extreme learning machine using bias-variance tradeoff
    Yu, Tian-jun
    Yan, Xue-feng
    [J]. JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2020, 27 (12) : 3744 - 3753
  • [7] Bias-variance tradeoff in machine learning: Theoretical formulation and implications to structural engineering applications
    Guan, Xingquan
    Burton, Henry
    [J]. STRUCTURES, 2022, 46 : 17 - 30
  • [8] Managing the Uncertainty of Bias-Variance Tradeoff in Software Predictive Analytics
    Mittas, Nikolaos
    Angelis, Lefteris
    [J]. 2016 42ND EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA), 2016, : 351 - 358
  • [9] Bias-variance tradeoff in anticorrelated noise reduction for spectral CT
    Persson, Mats
    Gronberg, Fredrik
    [J]. MEDICAL PHYSICS, 2017, 44 (09) : E242 - E254
  • [10] Bias-variance tradeoff in Hybrid Generative-Discriminative models
    Bouchard, Guillaume
    [J]. ICMLA 2007: SIXTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2007, : 124 - 129