Meta-learning in spiking neural networks with reward-modulated STDP

被引:3
|
作者
Khoee, Arsham Gholamzadeh [1 ]
Javaheri, Alireza [2 ]
Kheradpisheh, Saeed Reza [2 ]
Ganjtabesh, Mohammad [1 ]
机构
[1] Univ Tehran, Coll Sci, Sch Math Stat & Comp Sci, Dept Comp Sci, Tehran, Iran
[2] Shahid Beheshti Univ, Fac Math Sci, Dept Comp & Data Sci, Tehran, Iran
关键词
Meta-learning; Few-shot learning; Learning to learn; Spiking neurons; STDP; Reward-modulated STDP; PFC; Hippocampus; TIMING-DEPENDENT PLASTICITY; PREFRONTAL CORTEX; NEURONS; DECISION; SELECTION; PROGRESS; CHOICES;
D O I
10.1016/j.neucom.2024.128173
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The human brain constantly learns and rapidly adapts to new situations by integrating acquired knowledge and experiences into memory. Developing this capability in machine learning models is considered an important goal of AI research since deep neural networks perform poorly when there is limited data or when they need to adapt quickly to new unseen tasks. Meta-learning models are proposed to facilitate quick learning in low-data regimes by employing absorbed information from the past. Although some models have recently been introduced that reached high-performance levels, they are not biologically plausible. In our research, we have proposed a bio-plausible meta-learning model inspired by the hippocampus and the prefrontal cortex using spiking neural networks with a reward-based learning system. The major contribution of our work lies in the design of a bio-plausible meta-learning framework that incorporates learning rules such as SpikeTiming-Dependent Plasticity (STDP) and Reward-Modulated STDP (R-STDP). This framework not only reflects biological learning mechanisms more accurately but also attains competitive results comparable to those achieved by traditional gradient descent-based approaches in meta-learning. Our proposed model includes a memory designed to prevent catastrophic forgetting, a phenomenon that occurs when meta-learning models forget what they have learned so far as learning the new task begins. Furthermore, our new model can easily be applied to spike-based neuromorphic devices and enables fast learning in neuromorphic hardware. The implications and predictions of various models for solving few-shot classification tasks are extensively analyzed. Base on the results, our model has demonstrated the ability to compete with the existing state-of-the-art metalearning techniques, representing a significant step towards creating AI systems that emulate the human brain's ability to learn quickly and efficiently from limited data.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] A Computational Model of Match Decision-Making Problem Using Spiking SHESN with Reward-Modulated Reinforcement Learning
    Deng, Zhidong
    Yang, Guorun
    NEURAL INFORMATION PROCESSING, PT I, 2015, 9489 : 512 - 521
  • [22] Biologically Plausible Models of Homeostasis and STDP: Stability and Learning in Spiking Neural Networks
    Carlson, Kristofor D.
    Richert, Micah
    Dutt, Nikil
    Krichmar, Jeffrey L.
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [23] Unsupervised Learning Based on Temporal Coding Using STDP in Spiking Neural Networks
    Sun, Congyi
    Chen, Qinyu
    Chen, Kai
    He, Guoqiang
    Fu, Yuxiang
    Li, Li
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 2142 - 2146
  • [24] Optimizing Generic Neural Microcircuits through Reward Modulated STDP
    Joshi, Prashant
    Triesch, Jochen
    ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT I, 2009, 5768 : 239 - 248
  • [25] A Reservoir Computing Model of Reward-Modulated Motor Learning and Automaticity
    Pyle, Ryan
    Rosenbaum, Robert
    NEURAL COMPUTATION, 2019, 31 (07) : 1430 - 1461
  • [26] Neuron as a reward-modulated combinatorial switch and a model of learning behavior
    Rvachev, Marat M.
    NEURAL NETWORKS, 2013, 46 : 62 - 74
  • [27] Population coding for a reward-modulated Hebbian learning of vergence control
    Gibaldi, Agostino
    Canessa, Andrea
    Chessa, Manuela
    Solari, Fabio
    Sabatini, Silvio P.
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [28] Memristive Spiking Neural Networks Trained with Unsupervised STDP
    Zhou, Errui
    Fang, Liang
    Yang, Binbin
    ELECTRONICS, 2018, 7 (12)
  • [29] Paired competing neurons improving STDP supervised local learning in Spiking Neural Networks
    Goupy, Gaspard
    Tirilly, Pierre
    Bilasco, Ioan Marius
    FRONTIERS IN NEUROSCIENCE, 2024, 18
  • [30] Touch Modality Classification using Spiking Neural Networks and Supervised-STDP Learning
    Dabbous, Ali
    Ibrahim, Ali
    Valle, Maurizio
    Bartolozzi, Chiara
    2021 28TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS (IEEE ICECS 2021), 2021,