Retrospective model-based inference guides model-free credit assignment

被引:0
|
作者
Rani Moran
Mehdi Keramati
Peter Dayan
Raymond J. Dolan
机构
[1] University College London,Max Planck UCL Centre for Computational Psychiatry and Ageing Research
[2] 10-12 Russell Square,Wellcome Centre for Human Neuroimaging
[3] University College London,Department of Psychology, City
[4] University of London,Gatsby Computational Neuroscience Unit
[5] University College London,undefined
[6] Max Planck Institute for Biological Cybernetics,undefined
[7] Max Plank-Ring 8,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
An extensive reinforcement learning literature shows that organisms assign credit efficiently, even under conditions of state uncertainty. However, little is known about credit-assignment when state uncertainty is subsequently resolved. Here, we address this problem within the framework of an interaction between model-free (MF) and model-based (MB) control systems. We present and support experimentally a theory of MB retrospective-inference. Within this framework, a MB system resolves uncertainty that prevailed when actions were taken thus guiding an MF credit-assignment. Using a task in which there was initial uncertainty about the lotteries that were chosen, we found that when participants’ momentary uncertainty about which lottery had generated an outcome was resolved by provision of subsequent information, participants preferentially assigned credit within a MF system to the lottery they retrospectively inferred was responsible for this outcome. These findings extend our knowledge about the range of MB functions and the scope of system interactions.
引用
收藏
相关论文
共 50 条
  • [1] Retrospective model-based inference guides model-free credit assignment
    Moran, Rani
    Keramati, Mehdi
    Dayan, Peter
    Dolan, Raymond J.
    [J]. NATURE COMMUNICATIONS, 2019, 10 (1)
  • [2] Dopamine Enhances Model-Free Credit Assignment Through Boosting of Retrospective Model-Based Inference
    Deserno, Lorenz
    Moran, Rani
    Lee, Ying
    Michely, Jochen
    Dayan, Peter
    Dolan, Raymond
    [J]. BIOLOGICAL PSYCHIATRY, 2021, 89 (09) : S94 - S94
  • [3] Dopamine enhances model-free credit assignment through boosting of retrospective model-based inference
    Deserno, Lorenz
    Moran, Rani
    Michely, Jochen
    Lee, Ying
    Dayan, Peter
    Dolan, Raymond J.
    [J]. ELIFE, 2021, 10
  • [4] Counterfactual Credit Assignment in Model-Free Reinforcement Learning
    Mesnard, Thomas
    Weber, Theophane
    Viola, Fabio
    Thakoor, Shantanu
    Saade, Alaa
    Harutyunyan, Anna
    Dabney, Will
    Stepleton, Tom
    Heess, Nicolas
    Guez, Arthur
    Moulines, Eric
    Hutter, Marcus
    Buesing, Lars
    Munos, Remi
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [5] Model-Free Versus Model-Based Methods
    不详
    [J]. IEEE CONTROL SYSTEMS MAGAZINE, 2023, 43 (05): : 40 - 40
  • [6] Prosocial learning: Model-based or model-free?
    Navidi, Parisa
    Saeedpour, Sepehr
    Ershadmanesh, Sara
    Hossein, Mostafa Miandari
    Bahrami, Bahador
    [J]. PLOS ONE, 2023, 18 (06):
  • [7] Model-free, Model-based, and General Intelligence
    Geffner, Hector
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 10 - 17
  • [8] Model-based and model-free filtering of genomic data
    Nounou M.N.
    Nounou H.N.
    Mansouri M.
    [J]. Network Modeling and Analysis in Health Informatics and Bioinformatics, 2013, 2 (03): : 109 - 121
  • [9] Model-free versus model-based volatility prediction
    Politis, Dimitris N.
    [J]. JOURNAL OF FINANCIAL ECONOMETRICS, 2007, 5 (03) : 358 - 389
  • [10] MODEL-BASED AND MODEL-FREE CONTROL OF AUTOCORRELATED PROCESSES
    RUNGER, GC
    WILLEMAIN, TR
    [J]. JOURNAL OF QUALITY TECHNOLOGY, 1995, 27 (04) : 283 - 292