The role of training variability for model-based and model-free learning of an arbitrary visuomotor mapping

被引:0
|
作者
Velazquez-Vargas, Carlos A. [1 ]
Daw, Nathaniel D. [1 ,2 ]
Taylor, Jordan A. [1 ,2 ]
机构
[1] Princeton Univ, Dept Psychol, Princeton, NJ 08544 USA
[2] Princeton Univ, Princeton Neurosci Inst, Princeton, NJ USA
基金
美国国家卫生研究院;
关键词
SENSORY PREDICTION; SCHEMA THEORY; MOTOR; MOVEMENT; DYNAMICS; ADAPTATION; EXPLICIT; IMPLICIT; REPRESENTATIONS; ACQUISITION;
D O I
10.1371/journal.pcbi.1012471
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A fundamental feature of the human brain is its capacity to learn novel motor skills. This capacity requires the formation of vastly different visuomotor mappings. Using a grid navigation task, we investigated whether training variability would enhance the flexible use of a visuomotor mapping (key-to-direction rule), leading to better generalization performance. Experiments 1 and 2 show that participants trained to move between multiple start-target pairs exhibited greater generalization to both distal and proximal targets compared to participants trained to move between a single pair. This finding suggests that limited variability can impair decisions even in simple tasks without planning. In addition, during the training phase, participants exposed to higher variability were more inclined to choose options that, counterintuitively, moved the cursor away from the target while minimizing its actual distance under the constrained mapping, suggesting a greater engagement in model-based computations. In Experiments 3 and 4, we showed that the limited generalization performance in participants trained with a single pair can be enhanced by a short period of variability introduced early in learning or by incorporating stochasticity into the visuomotor mapping. Our computational modeling analyses revealed that a hybrid model between model-free and model-based computations with different mixing weights for the training and generalization phases, best described participants' data. Importantly, the differences in the model-based weights between our experimental groups, paralleled the behavioral findings during training and generalization. Taken together, our results suggest that training variability enables the flexible use of the visuomotor mapping, potentially by preventing the consolidation of habits due to the continuous demand to change responses. The development of new motor skills often requires the learning of novel associations between actions and outcomes. These novel mappings can be flexible and generalize to new situations, or more local with narrow generalization, similar to stimulus-action associations. In a series of experiments using a navigation task, we showed that generalizable mappings are favored under a training variability regime, while local mappings with narrow generalization are developed in the absence of variability. Training variability was generated in our experiments either with multiple goals or with stochasticity in the action-outcome mapping, with both regimes leading to successful generalization. In addition, we showed that the benefits in generalization from training variability can be observed even when participants are subsequently exposed to no variability for a prolonged period of time. These results were best described by a mixture of model-free and model-based reinforcement learning algorithms, with different mixture weights for the training and generalization phases.
引用
收藏
页数:43
相关论文
共 50 条
  • [41] Benchmarking model-free and model-based optimal control
    Koryakovskiy, Ivan
    Kudruss, Manuel
    Babuska, Robert
    Caarls, Wouter
    Kirches, Christian
    Mombaur, Katja
    Schloeder, Johannes P.
    Vallery, Heike
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2017, 92 : 81 - 90
  • [42] LEARNING UNDER UNCERTAINTY: NEURAL MARKERS OF MODEL-FREE AND MODEL-BASED LEARNING IN PROBABILISTIC ENVIRONMENTS
    Wurm, Franz
    Ernst, Benjamin
    Steinhauser, Marco
    PSYCHOPHYSIOLOGY, 2017, 54 : S127 - S127
  • [43] Granger Causality in Cardiovascular Variability Series: Comparison between Model-based and Model-free Approaches
    Porta, Alberto
    Bassani, Tito
    Bari, Vlasta
    Guzzetti, Stefano
    2012 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2012, : 3684 - 3687
  • [44] Predictive representations can link model-based reinforcement learning to model-free mechanisms
    Russek, Evan M.
    Momennejad, Ida
    Botvinick, Matthew M.
    Gershman, Samuel J.
    Daw, Nathaniel D.
    PLOS COMPUTATIONAL BIOLOGY, 2017, 13 (09)
  • [45] Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning
    Chebotar, Yevgen
    Hausman, Karol
    Zhang, Marvin
    Sukhatme, Gaurav
    Schaal, Stefan
    Levine, Sergey
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [46] Extraversion differentiates between model-based and model-free strategies in a reinforcement learning task
    Skatova, Anya
    Chan, Patricia A.
    Daw, Nathaniel D.
    FRONTIERS IN HUMAN NEUROSCIENCE, 2013, 7
  • [47] Dyna-style Model-based reinforcement learning with Model-Free Policy Optimization
    Dong, Kun
    Luo, Yongle
    Wang, Yuxin
    Liu, Yu
    Qu, Chengeng
    Zhang, Qiang
    Cheng, Erkang
    Sun, Zhiyong
    Song, Bo
    KNOWLEDGE-BASED SYSTEMS, 2024, 287
  • [48] The modulation of acute stress on model-free and model-based reinforcement learning in gambling disorder
    Wyckmans, Florent
    Banerjee, Nilosmita
    Saeremans, Melanie
    Otto, Ross
    Kornreich, Charles
    Vanderijst, Laetitia
    Gruson, Damien
    Carbone, Vincenzo
    Bechara, Antoine
    Buchanan, Tony
    Noel, Xavier
    JOURNAL OF BEHAVIORAL ADDICTIONS, 2022, 11 (03) : 831 - 844
  • [49] Effects of subclinical depression on prefrontal-striatal model-based and model-free learning
    Heo, Suyeon
    Sung, Yoondo
    Lee, Sang Wan
    PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (05)
  • [50] Alcohol Hangover Does Not Alter the Application of Model-Based and Model-Free Learning Strategies
    Berghaeuser, Julia
    Bensmann, Wiebke
    Zink, Nicolas
    Endrass, Tanja
    Beste, Christian
    Stock, Ann-Kathrin
    JOURNAL OF CLINICAL MEDICINE, 2020, 9 (05)