Neurornodulatory adaptive combination of correlation-based learning in cerebellum and reward-based learning in basal ganglia for goal-directed behavior control

被引:20
|
作者
Dasgupta, Sakyasingha [1 ,2 ]
Woergoetter, Florentin [1 ,2 ]
Manoonpong, Poramate [2 ,3 ]
机构
[1] Univ Gottingen, Inst Phys Biophys, D-37077 Gottingen, Germany
[2] Univ Gottingen, Bernstein Ctr Computat Neurosci, D-37077 Gottingen, Germany
[3] Univ Southern Denmark, Maersk Mc Kinney Moller Inst, Ctr Biorobot, Odense, Denmark
关键词
decision making; recurrent neural networks; basal ganglia; cerebellum; operant conditioning; classical conditioning; neuromodulation; correlation learning; SUPPLEMENTARY MOTOR AREA; ACTION SELECTION; HETEROSYNAPTIC MODULATION; MODEL; TIME; MECHANISMS; PLASTICITY; OPERANT; CORTEX; MEMORY;
D O I
10.3389/fncir.2014.00126
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Goal-directed decision making in biological systems is broadly based on associations between conditional and unconditional stimuli. This can be further classified as classical conditioning (correlation-based learning) and operant conditioning (reward-based learning). A number of computational and experimental studies have well established the role of the basal ganglia in reward-based learning, where as the cerebellum plays an important role in developing specific conditioned responses. Although viewed as distinct learning systems, recent animal experiments point toward their complementary role in behavioral learning, and also show the existence of substantial two-way communication between these two brain structures. Based on this notion of co-operative learning, in this paper we hypothesize that the basal ganglia and cerebellar learning systems work in parallel and interact with each other. We envision that such an interaction is influenced by reward modulated heterosynaptic plasticity (RMHP) rule at the thalamus, guiding the overall goal directed behavior. Using a recurrent neural network actor-critic model of the basal ganglia and a feed-forward correlation-based learning model of the cerebellum, we demonstrate that the RMHP rule can effectively balance the outcomes of the two learning systems. This is tested using simulated environments of increasing complexity with a four-wheeled robot in a foraging task in both static and dynamic configurations. Although modeled with a simplified level of biological abstraction, we clearly demonstrate that such a RMHP induced combinatorial learning mechanism, leads to stabler and faster learning of goal-directed behaviors, in comparison to the individual systems. Thus, in this paper we provide a computational model for adaptive combination of the basal ganglia and cerebellum learning systems by way of neuromodulated plasticity for goal-directed decision making in biological and bio-mimetic organisms.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Neural Combinatorial Learning of Goal-directed Behavior with Reservoir Critic and Reward Modulated Hebbian Plasticity
    Dasgupta, Sakyasingha
    Woergoetter, Florentin
    Morimoto, Jun
    Manoonpong, Poramate
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 993 - 1000
  • [22] LEARNING GOAL-DIRECTED SENSORY-BASED NAVIGATION OF A MOBILE ROBOT
    TANI, J
    FUKUMURA, N
    NEURAL NETWORKS, 1994, 7 (03) : 553 - 563
  • [23] Biologically inspired reinforcement learning: Reward-based decomposition for multi-goal environments
    Zhou, WD
    Coggins, R
    BIOLOGICALLY INSPIRED APPROACHES TO ADVANCED INFORMATION TECHNOLOGY, 2004, 3141 : 80 - 94
  • [24] Adaptive Point-Based Value Iteration for Continuous States POMDP in Goal-Directed Imitation Learning
    Pratama, Ferdian Adi
    Lee, Hosun
    Lee, Geunho
    Chong, Nak Young
    2012 9TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAL), 2012, : 249 - 254
  • [25] Neural activity in cortico-basal ganglia circuits of juvenile songbirds encodes performance during goal-directed learning
    Achiro, Jennifer M.
    Shen, John
    Bottjer, Sarah W.
    ELIFE, 2017, 6
  • [26] A Goal-Directed Behavioral Learning Model Based on Hippocampal-Striatal Circuit
    Chai, Jie
    Ruan, Xiaogang
    Huang, Jing
    Li, Peng
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 916 - 921
  • [27] Learning actions from vision-based positioning in goal-directed navigation
    Cicirelli, G
    Distante, C
    D'Orazio, T
    Attolico, G
    1998 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS - PROCEEDINGS, VOLS 1-3: INNOVATIONS IN THEORY, PRACTICE AND APPLICATIONS, 1998, : 1715 - 1720
  • [28] The role of the basal ganglia in exploratory behavior in a model based on reinforcement learning
    Devarajan, S
    Prashanth, PS
    Chakravarthy, VS
    NEURAL INFORMATION PROCESSING, 2004, 3316 : 70 - 77
  • [29] Vascular risk factors and associated diseases, not focal brain lesions, determine deficits of reward-based reversal learning in acute basal ganglia stroke
    Seidel, U. K.
    Gronewold, J.
    Wicking, M.
    Bellebaum, C.
    Hermann, D. M.
    EUROPEAN JOURNAL OF NEUROLOGY, 2014, 21 : 566 - 566
  • [30] Vascular risk factors and associated diseases, not focal brain lesions, determine deficits of reward-based reversal learning in acute basal ganglia stroke
    Seidel, U. K.
    Gronewold, J.
    Wicking, M.
    Bellebaum, C.
    Hermann, D. M.
    JOURNAL OF NEUROLOGY, 2014, 261 : S370 - S371