A Reservoir Computing Model of Reward-Modulated Motor Learning and Automaticity

被引:10
|
作者
Pyle, Ryan [1 ]
Rosenbaum, Robert [1 ,2 ]
机构
[1] Univ Notre Dame, Dept Appl & Computat Math & Stat, Notre Dame, IN 46556 USA
[2] Univ Notre Dame, Interdisciplinary Ctr Network Sci & Applicat, Notre Dame, IN 46556 USA
基金
美国国家科学基金会;
关键词
GANGLIA-FOREBRAIN CIRCUIT; NEURAL-NETWORKS; STRIATAL NEURONS; REINFORCEMENT; DYNAMICS; CORTEX; CHAOS; COMPUTATION; SHOULDER; PATTERNS;
D O I
10.1162/neco_a_01198
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reservoir computing is a biologically inspired class of learning algorithms in which the intrinsic dynamics of a recurrent neural network are mined to produce target time series. Most existing reservoir computing algorithms rely on fully supervised learning rules, which require access to an exact copy of the target response, greatly reducing the utility of the system. Reinforcement learning rules have been developed for reservoir computing, but we find that they fail to converge on complex motor tasks. Current theories of biological motor learning pose that early learning is controlled by dopamine-modulated plasticity in the basal ganglia that trains parallel cortical pathways through unsupervised plasticity as a motor task becomes well learned. We developed a novel learning algorithm for reservoir computing that models the interaction between reinforcement and unsupervised learning observed in experiments. This novel learning algorithm converges on simulated motor tasks on which previous reservoir computing algorithms fail and reproduces experimental findings that relate Parkinson's disease and its treatments to motor learning. Hence, incorporating biological theories of motor learning improves the effectiveness and biological relevance of reservoir computing models.
引用
收藏
页码:1430 / 1461
页数:32
相关论文
共 50 条
  • [21] Functional Requirements for Reward-Modulated Spike-Timing-Dependent Plasticity
    Fremaux, Nicolas
    Sprekeler, Henning
    Gerstner, Wulfram
    JOURNAL OF NEUROSCIENCE, 2010, 30 (40): : 13326 - 13337
  • [22] Biologically Realizable Reward-Modulated Hebbian Training for Spiking Neural Networks
    Ferrari, Silvia
    Mehta, Bhavesh
    Di Muro, Gianluca
    VanDongen, Antonius M. J.
    Henriquez, Craig
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1780 - 1786
  • [23] A Reward-Modulated Hebbian Learning Rule Can Explain Experimentally Observed Network Reorganization in a Brain Control Task
    Legenstein, Robert
    Chase, Steven M.
    Schwartz, Andrew B.
    Maass, Wolfgang
    JOURNAL OF NEUROSCIENCE, 2010, 30 (25): : 8400 - 8410
  • [24] First-Spike-Based Visual Categorization Using Reward-Modulated STDP
    Mozafari, Milad
    Kheradpisheh, Saeed Reza
    Masquelier, Timothee
    Nowzari-Dalini, Abbas
    Ganjtabesh, Mohammad
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (12) : 6178 - 6190
  • [25] Reward-modulated Hebbian plasticity as leverage for partially embodied control in compliant robotics
    Burms, Jeroen
    Caluwaerts, Ken
    Dambre, Joni
    FRONTIERS IN NEUROROBOTICS, 2015, 9
  • [26] Perceptual learning, motor learning, and automaticity
    Fecteau, Jillian H.
    Roelfsema, Pieter
    De Zeeuw, Chris I.
    Kousta, Stavroula
    TRENDS IN COGNITIVE SCIENCES, 2010, 14 (01) : 1 - 1
  • [27] Tiny Reservoir Computing for Extreme Learning of Motor Control
    Federici, Niccolo
    Pau, Danilo
    Adami, Nicola
    Benini, Sergio
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [28] Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle
    Bing, Zhenshan
    Baumann, Ivan
    Jiang, Zhuangyi
    Huang, Kai
    Cai, Caixia
    Knoll, Alois
    FRONTIERS IN NEUROROBOTICS, 2019, 13
  • [29] Brain Inspired Sequences Production by Spiking Neural Networks With Reward-Modulated STDP
    Fang, Hongjian
    Zeng, Yi
    Zhao, Feifei
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2021, 15
  • [30] A Bio-Inspired Hierarchical Spiking Neural Network With Reward-Modulated STDP Learning Rule for AER Object Recognition
    Zhou, Qian
    Li, Xiaohu
    IEEE SENSORS JOURNAL, 2022, 22 (16) : 16323 - 16338