Cognitive mechanisms of learning in sequential decision-making under uncertainty: an experimental and theoretical approach

被引:0
|
作者
Cecchini, Gloria [1 ,2 ]
DePass, Michael [2 ]
Baspinar, Emre [3 ]
Andujar, Marta [4 ]
Ramawat, Surabhi [4 ]
Pani, Pierpaolo [4 ]
Ferraina, Stefano [4 ]
Destexhe, Alain [3 ]
Moreno-Bote, Ruben [2 ,5 ]
Cos, Ignasi [1 ,5 ]
机构
[1] Univ Barcelona, Fac Matemat & Informat, Barcelona, Spain
[2] Univ Pompeu Fabra, Ctr Brain & Cognit, DTIC, Barcelona, Spain
[3] Paris Saclay Univ, Inst Neurosci NeuroPSI, CNRS, Saclay, France
[4] Sapienza Univ Rome, Dept Physiol & Pharmacol, Rome, Italy
[5] Serra Hunter Fellow Programme, Barcelona, Spain
来源
基金
欧盟地平线“2020”;
关键词
decision-making; learning; cognition; computational modeling; consequence; uncertanty; neural dynamics; behavior; PERCEPTUAL DECISION; RISK SENSITIVITY; VISUAL FIXATIONS; PREMOTOR CORTEX; NETWORK MODEL; NEURAL BASIS; CHOICE; SELECTION; PREDICTS; NEUROBIOLOGY;
D O I
10.3389/fnbeh.2024.1399394
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Learning to make adaptive decisions involves making choices, assessing their consequence, and leveraging this assessment to attain higher rewarding states. Despite vast literature on value-based decision-making, relatively little is known about the cognitive processes underlying decisions in highly uncertain contexts. Real world decisions are rarely accompanied by immediate feedback, explicit rewards, or complete knowledge of the environment. Being able to make informed decisions in such contexts requires significant knowledge about the environment, which can only be gained via exploration. Here we aim at understanding and formalizing the brain mechanisms underlying these processes. To this end, we first designed and performed an experimental task. Human participants had to learn to maximize reward while making sequences of decisions with only basic knowledge of the environment, and in the absence of explicit performance cues. Participants had to rely on their own internal assessment of performance to reveal a covert relationship between their choices and their subsequent consequences to find a strategy leading to the highest cumulative reward. Our results show that the participants' reaction times were longer whenever the decision involved a future consequence, suggesting greater introspection whenever a delayed value had to be considered. The learning time varied significantly across participants. Second, we formalized the neurocognitive processes underlying decision-making within this task, combining mean-field representations of competing neural populations with a reinforcement learning mechanism. This model provided a plausible characterization of the brain dynamics underlying these processes, and reproduced each aspect of the participants' behavior, from their reaction times and choices to their learning rates. In summary, both the experimental results and the model provide a principled explanation to how delayed value may be computed and incorporated into the neural dynamics of decision-making, and to how learning occurs in these uncertain scenarios.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] An Experimental Study of Advice in Sequential Decision-Making under Uncertainty
    Benavent, Florian
    Zanuttini, Bruno
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6319 - 6326
  • [2] Learning Structural Weight Uncertainty for Sequential Decision-Making
    Zhang, Ruiyi
    Li, Chunyuan
    Chen, Changyou
    Carin, Lawrence
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [3] UNCERTAINTY, INFORMATION AND ORGANIZATIONAL DECISION-MAKING - COGNITIVE APPROACH
    NYSTROM, H
    [J]. SWEDISH JOURNAL OF ECONOMICS, 1974, 76 (01): : 131 - 139
  • [4] Reinforcement learning for decision-making under deep uncertainty
    Pei, Zhihao
    Rojas-Arevalo, Angela M.
    de Haan, Fjalar J.
    Lipovetzky, Nir
    Moallemi, Enayat A.
    [J]. JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2024, 359
  • [5] DECISION-MAKING UNDER UNCERTAINTY - DIRECT MEASUREMENT APPROACH
    BONOMA, TV
    JOHNSTON, WJ
    [J]. JOURNAL OF CONSUMER RESEARCH, 1979, 6 (02) : 177 - 191
  • [6] A new approach to modeling decision-making under uncertainty
    Edi Karni
    [J]. Economic Theory, 2007, 33 : 225 - 242
  • [7] A new approach to modeling decision-making under uncertainty
    Karni, Edi
    [J]. ECONOMIC THEORY, 2007, 33 (02) : 225 - 242
  • [8] Learning and Reasoning for Robot Sequential Decision Making under Uncertainty
    Amiri, Saeid
    Shirazi, Mohammad Shokrolah
    Zhang, Shiqi
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 2726 - 2733
  • [9] DECISION-MAKING UNDER UNCERTAINTY
    Nistor, Rozalia
    Nistor, Costel
    Muntean, Mihaela-Carmen
    [J]. KNOWLEDGE MANAGEMENT AND INNOVATION: A BUSINESS COMPETITIVE EDGE PERSPECTIVE, VOLS 1-3, 2010, : 187 - 195
  • [10] Decision-making under uncertainty
    Esker, P.
    [J]. PHYTOPATHOLOGY, 2021, 111 (10) : 181 - 182