Meta-learning in spiking neural networks with reward-modulated STDP

被引:3
|
作者
Khoee, Arsham Gholamzadeh [1 ]
Javaheri, Alireza [2 ]
Kheradpisheh, Saeed Reza [2 ]
Ganjtabesh, Mohammad [1 ]
机构
[1] Univ Tehran, Coll Sci, Sch Math Stat & Comp Sci, Dept Comp Sci, Tehran, Iran
[2] Shahid Beheshti Univ, Fac Math Sci, Dept Comp & Data Sci, Tehran, Iran
关键词
Meta-learning; Few-shot learning; Learning to learn; Spiking neurons; STDP; Reward-modulated STDP; PFC; Hippocampus; TIMING-DEPENDENT PLASTICITY; PREFRONTAL CORTEX; NEURONS; DECISION; SELECTION; PROGRESS; CHOICES;
D O I
10.1016/j.neucom.2024.128173
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The human brain constantly learns and rapidly adapts to new situations by integrating acquired knowledge and experiences into memory. Developing this capability in machine learning models is considered an important goal of AI research since deep neural networks perform poorly when there is limited data or when they need to adapt quickly to new unseen tasks. Meta-learning models are proposed to facilitate quick learning in low-data regimes by employing absorbed information from the past. Although some models have recently been introduced that reached high-performance levels, they are not biologically plausible. In our research, we have proposed a bio-plausible meta-learning model inspired by the hippocampus and the prefrontal cortex using spiking neural networks with a reward-based learning system. The major contribution of our work lies in the design of a bio-plausible meta-learning framework that incorporates learning rules such as SpikeTiming-Dependent Plasticity (STDP) and Reward-Modulated STDP (R-STDP). This framework not only reflects biological learning mechanisms more accurately but also attains competitive results comparable to those achieved by traditional gradient descent-based approaches in meta-learning. Our proposed model includes a memory designed to prevent catastrophic forgetting, a phenomenon that occurs when meta-learning models forget what they have learned so far as learning the new task begins. Furthermore, our new model can easily be applied to spike-based neuromorphic devices and enables fast learning in neuromorphic hardware. The implications and predictions of various models for solving few-shot classification tasks are extensively analyzed. Base on the results, our model has demonstrated the ability to compete with the existing state-of-the-art metalearning techniques, representing a significant step towards creating AI systems that emulate the human brain's ability to learn quickly and efficiently from limited data.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Brain Inspired Sequences Production by Spiking Neural Networks With Reward-Modulated STDP
    Fang, Hongjian
    Zeng, Yi
    Zhao, Feifei
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2021, 15
  • [2] BioLCNet: Reward-Modulated Locally Connected Spiking Neural Networks
    Ghaemi, Hafez
    Mirzaei, Erfan
    Nouri, Mahbod
    Kheradpisheh, Saeed Reza
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2022, PT II, 2023, 13811 : 564 - 578
  • [3] A Bio-Inspired Hierarchical Spiking Neural Network With Reward-Modulated STDP Learning Rule for AER Object Recognition
    Zhou, Qian
    Li, Xiaohu
    IEEE SENSORS JOURNAL, 2022, 22 (16) : 16323 - 16338
  • [4] Biologically Realizable Reward-Modulated Hebbian Training for Spiking Neural Networks
    Ferrari, Silvia
    Mehta, Bhavesh
    Di Muro, Gianluca
    VanDongen, Antonius M. J.
    Henriquez, Craig
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1780 - 1786
  • [5] A Low-Cost FPGA Implementation of Spiking Extreme Learning Machine With On-Chip Reward-Modulated STDP Learning
    He, Zhen
    Shi, Cong
    Wang, Tengxiao
    Wang, Ying
    Tian, Min
    Zhou, Xichuan
    Li, Ping
    Liu, Liyuan
    Wu, Nanjian
    Luo, Gang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (03) : 1657 - 1661
  • [6] Mapping Spatio-temporally Encoded Patterns by Reward-Modulated STDP in Spiking Neurons
    Ozturk, Ibrahim
    Halliday, David M.
    PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [7] Spiking Neural Network Actor–Critic Reinforcement Learning with Temporal Coding and Reward-Modulated Plasticity
    D. S. Vlasov
    R. B. Rybka
    A. V. Serenko
    A. G. Sboev
    Moscow University Physics Bulletin, 2024, 79 (Suppl 2) : S944 - S952
  • [8] Mechanisms of Reward-Modulated STDP and Winner-Take-All in Bayesian Spiking Decision-Making Circuit
    Yan, Hui
    Liu, Xinle
    Huo, Hong
    Fang, Tao
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT III, 2019, 11955 : 162 - 172
  • [9] Meta-learning spiking neural networks with surrogate gradient descent
    Stewart, Kenneth M.
    Neftci, Emre O.
    NEUROMORPHIC COMPUTING AND ENGINEERING, 2022, 2 (04):
  • [10] Statistical Mechanics of Reward-Modulated Learning in Decision-Making Networks
    Katahira, Kentaro
    Okanoya, Kazuo
    Okada, Masato
    NEURAL COMPUTATION, 2012, 24 (05) : 1230 - 1270