Neurornodulatory adaptive combination of correlation-based learning in cerebellum and reward-based learning in basal ganglia for goal-directed behavior control

被引:20
|
作者
Dasgupta, Sakyasingha [1 ,2 ]
Woergoetter, Florentin [1 ,2 ]
Manoonpong, Poramate [2 ,3 ]
机构
[1] Univ Gottingen, Inst Phys Biophys, D-37077 Gottingen, Germany
[2] Univ Gottingen, Bernstein Ctr Computat Neurosci, D-37077 Gottingen, Germany
[3] Univ Southern Denmark, Maersk Mc Kinney Moller Inst, Ctr Biorobot, Odense, Denmark
关键词
decision making; recurrent neural networks; basal ganglia; cerebellum; operant conditioning; classical conditioning; neuromodulation; correlation learning; SUPPLEMENTARY MOTOR AREA; ACTION SELECTION; HETEROSYNAPTIC MODULATION; MODEL; TIME; MECHANISMS; PLASTICITY; OPERANT; CORTEX; MEMORY;
D O I
10.3389/fncir.2014.00126
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Goal-directed decision making in biological systems is broadly based on associations between conditional and unconditional stimuli. This can be further classified as classical conditioning (correlation-based learning) and operant conditioning (reward-based learning). A number of computational and experimental studies have well established the role of the basal ganglia in reward-based learning, where as the cerebellum plays an important role in developing specific conditioned responses. Although viewed as distinct learning systems, recent animal experiments point toward their complementary role in behavioral learning, and also show the existence of substantial two-way communication between these two brain structures. Based on this notion of co-operative learning, in this paper we hypothesize that the basal ganglia and cerebellar learning systems work in parallel and interact with each other. We envision that such an interaction is influenced by reward modulated heterosynaptic plasticity (RMHP) rule at the thalamus, guiding the overall goal directed behavior. Using a recurrent neural network actor-critic model of the basal ganglia and a feed-forward correlation-based learning model of the cerebellum, we demonstrate that the RMHP rule can effectively balance the outcomes of the two learning systems. This is tested using simulated environments of increasing complexity with a four-wheeled robot in a foraging task in both static and dynamic configurations. Although modeled with a simplified level of biological abstraction, we clearly demonstrate that such a RMHP induced combinatorial learning mechanism, leads to stabler and faster learning of goal-directed behaviors, in comparison to the individual systems. Thus, in this paper we provide a computational model for adaptive combination of the basal ganglia and cerebellum learning systems by way of neuromodulated plasticity for goal-directed decision making in biological and bio-mimetic organisms.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Vascular risk factors and associated diseases, not focal brain lesions, determine deficits of reward-based reversal learning in acute basal ganglia stroke
    Hermann, D. M.
    Seidel, U. K.
    Gronewold, J.
    Wicking, M.
    Bellebaun, C.
    CEREBROVASCULAR DISEASES, 2014, 37 : 185 - 185
  • [32] Towards Goal-Directed Navigation Through Combining Learning Based Global and Local Planners
    Zhou, Xiaomao
    Gao, Yanbin
    Guan, Lianwu
    SENSORS, 2019, 19 (01)
  • [33] Learning, Fast and Slow: A Goal-Directed Memory-Based Approach for Dynamic Environments
    Tan, John Chong Min
    Motani, Mehul
    2023 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, ICDL, 2023, : 1 - 6
  • [34] DISTINCT DOPAMINERGIC CONTROL OF THE DIRECT AND INDIRECT PATHWAYS IN REWARD-BASED AND AVOIDANCE LEARNING BEHAVIORS
    Nakanishi, S.
    Hikida, T.
    Yawata, S.
    NEUROSCIENCE, 2014, 282 : 49 - 59
  • [35] Functional integration processes underlying the instruction-based learning of novel goal-directed behaviors
    Ruge, Hannes
    Wolfensteller, Uta
    NEUROIMAGE, 2013, 68 : 162 - 172
  • [36] Learning Model-based F0 Production through Goal-directed Babbling
    Liu, Hao
    Xu, Yi
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 284 - 288
  • [37] Adaptive Reward Shifting Based on Behavior Proximity for Offline Reinforcement Learning
    Zhang, Zhe
    Tan, Xiaoyang
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4620 - 4628
  • [38] Safe reward-based deep reinforcement learning control for an electro-hydraulic servo system
    Wu, Minling
    Liu, Lijun
    Yu, Zhen
    Li, Weizhou
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (13) : 7646 - 7662
  • [39] Rapid learning of spatial representations for goal-directed navigation based on a novel model of hippocampal place fields
    Alabi, Adedapo
    Vanderelst, Dieter
    Minai, Ali A.
    NEURAL NETWORKS, 2023, 161 : 116 - 128
  • [40] From Creatures of Habit to Goal-Directed Learners: Tracking the Developmental Emergence of Model-Based Reinforcement Learning
    Decker, Johannes H.
    Otto, A. Ross
    Daw, Nathaniel D.
    Hartley, Catherine A.
    PSYCHOLOGICAL SCIENCE, 2016, 27 (06) : 848 - 858