Conflict and competition between model-based and model-free control

被引:0
|
作者
Lei, Yuqing [1 ]
Solway, Alec [1 ,2 ]
机构
[1] Univ Maryland College Pk, Dept Psychol, College Pk, MD 20742 USA
[2] Univ Maryland College Pk, Program Neurosci & Cognit Sci, College Pk, MD 20742 USA
关键词
DECISION-MAKING; CHOICES; PREDICTION; MEMORY; ACCOUNT; SYSTEMS;
D O I
10.1371/journal.pcbi.1010047
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A large literature has accumulated suggesting that human and animal decision making is driven by at least two systems, and that important functions of these systems can be captured by reinforcement learning algorithms. The "model-free" system caches and uses stimulus-value or stimulus-response associations, and the "model-based" system implements more flexible planning using a model of the world. However, it is not clear how the two systems interact during deliberation and how a single decision emerges from this process, especially when they disagree. Most previous work has assumed that while the systems operate in parallel, they do so independently, and they combine linearly to influence decisions. Using an integrated reinforcement learning/drift-diffusion model, we tested the hypothesis that the two systems interact in a non-linear fashion similar to other situations with cognitive conflict. We differentiated two forms of conflict: action conflict, a binary state representing whether the systems disagreed on the best action, and value conflict, a continuous measure of the extent to which the two systems disagreed on the difference in value between the available options. We found that decisions with greater value conflict were characterized by reduced model-based control and increased caution both with and without action conflict. Action conflict itself (the binary state) acted in the opposite direction, although its effects were less prominent. We also found that between-system conflict was highly correlated with within-system conflict, and although it is less clear a priori why the latter might influence the strength of each system above its standard linear contribution, we could not rule it out. Our work highlights the importance of non-linear conflict effects, and provides new constraints for more detailed process models of decision making. It also presents new avenues to explore with relation to disorders of compulsivity, where an imbalance between systems has been implicated. Author summaryA number of studies have framed goal-directed and habitual decision making from the perspective of different reinforcement learning algorithms ("model-based" and "model-free"), and further suggested that they are supported by separate though potentially overlapping systems. However, there has been little work to understand how the different systems work together. By design, they will sometimes disagree on the identity of the best action, and even when they agree, they will assign different values to the actions. Despite this, the end result is a single behavioral output. The issue of how the two systems interact and compete draws parallels to the existing literature on cognitive control, where a central question has been how more 'costly' cognitive resources should be deployed in the presence of decision conflict (here, the goal-directed system is more computationally 'expensive'). Across four datasets, we found that the influence of the goal-directed system was reduced as a function of conflict between systems, and in addition, responses overall were more cautious. Our results provide new constraints for process models of decision making, and suggest new research directions for questions related to psychopathology and disorders of compulsivity in particular, where an imbalance between the two systems has previously been implicated.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] Model-based and Model-free Reinforcement Learning for Visual Servoing
    Farahmand, Amir Massoud
    Shademan, Azad
    Jagersand, Martin
    Szepesvari, Csaba
    [J]. ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 4135 - 4142
  • [42] An unified approach to model-based and model-free visual servoing
    Malis, E
    [J]. COMPUTER VISION - ECCV 2002, PT IV, 2002, 2353 : 433 - 447
  • [43] Model-free and model-based strategy for rats' action selection
    Funamizu, Akihiro
    Ito, Makoto
    Doya, Kenji
    Kanzaki, Ryohei
    Takahashi, Hirokazu
    [J]. NEUROSCIENCE RESEARCH, 2010, 68 : E186 - E187
  • [44] Model-based and model-free mechanisms in methamphetamine use disorder
    Robinson, Alex H.
    Mahlberg, Justin
    Chong, Trevor T. -J.
    Verdejo-Garcia, Antonio
    [J]. ADDICTION BIOLOGY, 2024, 29 (01)
  • [45] Language acquisition is model-based rather than model-free
    Wang, Felix Hao
    Mintz, Toben H.
    [J]. BEHAVIORAL AND BRAIN SCIENCES, 2016, 39
  • [46] Model-based learning retrospectively updates model-free values
    Doody, Max
    Van Swieten, Maaike M. H.
    Manohar, Sanjay G.
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [47] Biases in estimating the balance between model-free and model-based learning systems due to model misspecification
    Toyama, Asako
    Katahira, Kentaro
    Ohira, Hideki
    [J]. JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2019, 91 : 88 - 102
  • [48] Curious Meta-Controller: Adaptive Alternation between Model-Based and Model-Free Control in Deep Reinforcement Learning
    Hafez, Muhammad Burhan
    Weber, Cornelius
    Kerzel, Matthias
    Wermter, Stefan
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [49] No substantial change in the balance between model-free and model-based control via training on the two-step task
    Grosskurth, Elmar D.
    Bach, Dominik R.
    Economides, Marcos
    Huys, Quentin J. M.
    Holper, Lisa
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (11)
  • [50] Disruption of Dorsolateral Prefrontal Cortex Decreases Model-Based in Favor of Model-free Control in Humans
    Smittenaar, Peter
    FitzGerald, Thomas H. B.
    Romei, Vincenzo
    Wright, Nicholas D.
    Dolan, Raymond J.
    [J]. NEURON, 2013, 80 (04) : 914 - 919