Neural Combinatorial Learning of Goal-directed Behavior with Reservoir Critic and Reward Modulated Hebbian Plasticity

被引:3
|
作者
Dasgupta, Sakyasingha [1 ]
Woergoetter, Florentin [1 ]
Morimoto, Jun [2 ]
Manoonpong, Poramate [1 ]
机构
[1] Univ Gottingen, BCCN, Friedrich Hund Pl 1, D-37077 Gottingen, Germany
[2] ATR Computat Neurosci Lab, Kyoto 6190288, Japan
关键词
Re-inforcement learning; Reservoir networks; Correlation learning; Temporal memory;
D O I
10.1109/SMC.2013.174
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Learning of goal-directed behaviors in biological systems is broadly based on associations between conditional and unconditional stimuli. This can be further classified as classical conditioning (correlation-based learning) and operant conditioning (reward-based learning). Although traditionally modeled as separate learning systems in artificial agents, numerous animal experiments point towards their co-operative role in behavioral learning. Based on this concept, the recently introduced framework of neural combinatorial learning combines the two systems where both the systems run in parallel to guide the overall learned behavior. Such a combinatorial learning demonstrates a faster and efficient learner. In this work, we further improve the framework by applying a reservoir computing network (RC) as an adaptive critic unit and reward modulated Hebbian plasticity. Using a mobile robot system for goal-directed behavior learning, we clearly demonstrate that the reservoir critic outperforms traditional radial basis function (RBF) critics in terms of stability of convergence and learning time. Furthermore the temporal memory in RC allows the system to learn partially observable markov decision process scenario, in contrast to a memoryless RBF critic.
引用
收藏
页码:993 / 1000
页数:8
相关论文
共 50 条
  • [21] The hippocampal-striatal axis in learning, prediction and goal-directed behavior
    Pennartz, C. M. A.
    Ito, R.
    Verschure, P. F. M. J.
    Battaglia, F. P.
    Robbins, T. W.
    TRENDS IN NEUROSCIENCES, 2011, 34 (10) : 548 - 559
  • [22] Neuron as a reward-modulated combinatorial switch and a model of learning behavior
    Rvachev, Marat M.
    NEURAL NETWORKS, 2013, 46 : 62 - 74
  • [23] Astrocyte adenosine signaling and neural mechanisms of goal-directed and habitual reward-seeking behaviors
    Kang, Seungwoo
    Choi, Doo-Sup
    NEUROPSYCHOPHARMACOLOGY, 2021, 46 (01) : 227 - 228
  • [24] Astrocyte adenosine signaling and neural mechanisms of goal-directed and habitual reward-seeking behaviors
    Seungwoo Kang
    Doo-Sup Choi
    Neuropsychopharmacology, 2021, 46 : 227 - 228
  • [25] From modulated Hebbian plasticity to simple behavior learning through noise and weight saturation
    Soltoggio, Andrea
    Stanley, Kenneth O.
    NEURAL NETWORKS, 2012, 34 : 28 - 41
  • [26] PREFRONTAL CORTEX AND GOAL-DIRECTED BEHAVIOR IN SCHIZOPHRENIA: NEURAL MECHANISMS UNDERLYING INFLEXIBILITY
    Sitnikova, Tatiana
    Gramfort, Alexandre
    Boyle, Robert W.
    Cather, Corinne
    Freudenreich, Oliver
    Hamalainen, Matti S.
    Rosen, Bruce R.
    Goff, Donald C.
    SCHIZOPHRENIA RESEARCH, 2012, 136 : S104 - S104
  • [27] Coordinated accumbal dopamine release and neural activity drive goal-directed behavior
    Cheer, Joseph F.
    Aragona, Brandon J.
    Heien, Michael L. A. V.
    Seipel, Andrew T.
    Carelli, Regina M.
    Wightman, R. Mark
    NEURON, 2007, 54 (02) : 237 - 244
  • [28] Neurornodulatory adaptive combination of correlation-based learning in cerebellum and reward-based learning in basal ganglia for goal-directed behavior control
    Dasgupta, Sakyasingha
    Woergoetter, Florentin
    Manoonpong, Poramate
    FRONTIERS IN NEURAL CIRCUITS, 2014, 8
  • [29] Spatial learning correlates with decreased hippocampal activity in the goal-directed behavior of pigeons
    Fan, Jiantao
    Li, Mengmeng
    Cheng, Shuguan
    Shang, Zhigang
    Wan, Hong
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 558 - 561
  • [30] Involvement of the basal ganglia and dopamine system in learning and execution of goal-directed behavior
    Kimura, M
    Matsumoto, N
    Ueda, Y
    Satoh, T
    Minamimoto, T
    Yamada, H
    CATECHOLAMINE RESEARCH: FROM MOLECULAR INSIGHTS TO CLINICAL MEDICINE, 2002, 53 : 377 - 380