Meta-learning in spiking neural networks with reward-modulated STDP

被引:3
|
作者
Khoee, Arsham Gholamzadeh [1 ]
Javaheri, Alireza [2 ]
Kheradpisheh, Saeed Reza [2 ]
Ganjtabesh, Mohammad [1 ]
机构
[1] Univ Tehran, Coll Sci, Sch Math Stat & Comp Sci, Dept Comp Sci, Tehran, Iran
[2] Shahid Beheshti Univ, Fac Math Sci, Dept Comp & Data Sci, Tehran, Iran
关键词
Meta-learning; Few-shot learning; Learning to learn; Spiking neurons; STDP; Reward-modulated STDP; PFC; Hippocampus; TIMING-DEPENDENT PLASTICITY; PREFRONTAL CORTEX; NEURONS; DECISION; SELECTION; PROGRESS; CHOICES;
D O I
10.1016/j.neucom.2024.128173
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The human brain constantly learns and rapidly adapts to new situations by integrating acquired knowledge and experiences into memory. Developing this capability in machine learning models is considered an important goal of AI research since deep neural networks perform poorly when there is limited data or when they need to adapt quickly to new unseen tasks. Meta-learning models are proposed to facilitate quick learning in low-data regimes by employing absorbed information from the past. Although some models have recently been introduced that reached high-performance levels, they are not biologically plausible. In our research, we have proposed a bio-plausible meta-learning model inspired by the hippocampus and the prefrontal cortex using spiking neural networks with a reward-based learning system. The major contribution of our work lies in the design of a bio-plausible meta-learning framework that incorporates learning rules such as SpikeTiming-Dependent Plasticity (STDP) and Reward-Modulated STDP (R-STDP). This framework not only reflects biological learning mechanisms more accurately but also attains competitive results comparable to those achieved by traditional gradient descent-based approaches in meta-learning. Our proposed model includes a memory designed to prevent catastrophic forgetting, a phenomenon that occurs when meta-learning models forget what they have learned so far as learning the new task begins. Furthermore, our new model can easily be applied to spike-based neuromorphic devices and enables fast learning in neuromorphic hardware. The implications and predictions of various models for solving few-shot classification tasks are extensively analyzed. Base on the results, our model has demonstrated the ability to compete with the existing state-of-the-art metalearning techniques, representing a significant step towards creating AI systems that emulate the human brain's ability to learn quickly and efficiently from limited data.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] A biologically plausible supervised learning method for spiking neural networks using the symmetric STDP rule
    Hao, Yunzhe
    Huang, Xuhui
    Dong, Meng
    Xu, Bo
    NEURAL NETWORKS, 2020, 121 : 387 - 395
  • [42] STDP Based Unsupervised Multimodal Learning With Cross-Modal Processing in Spiking Neural Networks
    Rathi, Nitin
    Roy, Kaushik
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2021, 5 (01): : 143 - 153
  • [43] An Imbalanced R-STDP Learning Rule in Spiking Neural Networks for Medical Image Classification
    Zhou, Qian
    Ren, Cong
    Qi, Saibing
    IEEE ACCESS, 2020, 8 (08): : 224162 - 224177
  • [44] Modulated spike-time dependent plasticity (STDP)-based learning for spiking neural network (SNN): A review
    Rahman, Nazeerah Abdul
    Yusoff, Nooraini
    NEUROCOMPUTING, 2025, 618
  • [45] Spatiotemporal motor learning with reward-modulated Hebbian plasticity in modular reservoir computing
    Kawai, Yuji
    Asada, Minoru
    NEUROCOMPUTING, 2023, 558
  • [46] Efficient STDP Micro-Architecture for Silicon Spiking Neural Networks
    Dytckov, Sergei
    Daneshtalab, Masoud
    Ebrahimi, Masoumeh
    Anwar, Hassan
    Plosila, Juha
    Tenhunen, Hannu
    2014 17TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2014, : 496 - 503
  • [47] Classifying Melanoma Skin Lesions Using Convolutional Spiking Neural Networks With Unsupervised STDP Learning Rule
    Zhou, Qian
    Shi, Yan
    Xu, Zhenghua
    Qu, Ruowei
    Xu, Guizhi
    IEEE ACCESS, 2020, 8 : 101309 - 101319
  • [48] Classical conditioning in different temporal constraints: an STDP learning rule for robots controlled by spiking neural networks
    Cyr, Andre
    Boukadoum, Mounir
    ADAPTIVE BEHAVIOR, 2012, 20 (04) : 257 - 272
  • [49] Multi-Task Reinforcement Meta-Learning in Neural Networks
    Shakah, Ghazi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 263 - 269
  • [50] Interpretable Deep Convolutional Neural Networks via Meta-learning
    Liu, Xuan
    Wang, Xiaoguang
    Matwin, Stan
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,