Spiking Neural Networks with Different Reinforcement Learning (RL) Schemes in a Multiagent Setting

被引:2
|
作者
Christodoulou, Chris [1 ]
Cleanthous, Aristodemos [1 ]
机构
[1] Univ Cyprus, Dept Comp Sci, CY-1678 Nicosia, Cyprus
来源
CHINESE JOURNAL OF PHYSIOLOGY | 2010年 / 53卷 / 06期
关键词
spiking neural networks; multiagent reinforcement learning; reward-modulated spike timing-dependent plasticity; TEMPORAL INTEGRATION; MODEL;
D O I
10.4077/CJP.2010.AMM030
中图分类号
Q4 [生理学];
学科分类号
071003 ;
摘要
This paper investigates the effectiveness of spiking agents when trained with reinforcement learning (RL) in a challenging multiagent task. In particular, it explores learning through reward-modulated spike-timing dependent plasticity (STDP) and compares it to reinforcement of stochastic synaptic transmission in the general-sum game of the Iterated Prisoner's Dilemma (IPD). More specifically. a computational model is developed where we implement two spiking neural networks as two "selfish" agents learning simultaneously but independently, competing in the IPD game. The purpose of our system (or collective) is to maximise its accumulated reward in the presence of reward-driven competing agents within the collective. This can only be achieved when the agents engage in a behaviour of mutual cooperation during the IPD. Previously, we successfully applied reinforcement of stochastic synaptic transmission to the IPD game. The current study utilises reward-modulated STDP with eligibility trace and results show that the system managed to exhibit the desired behaviour by establishing mutual cooperation between the agents. It is noted that the cooperative outcome was attained after a relatively short learning period which enhanced the accumulation of reward by the system. As in our previous implementation, the successful application of the learning algorithm to the IPD becomes possible only after we extended it with additional global reinforcement signals in order to enhance competition at the neuronal level. Moreover it is also shown that learning is enhanced (as indicated by an increased IPD cooperative outcome) through: (i) strong memory for each agent (regulated by a high eligibility trace time constant) and (ii) firing irregularity produced by equipping the agents' LIF neurons with a partial somatic reset mechanism.
引用
收藏
页码:447 / 453
页数:7
相关论文
共 50 条
  • [1] A reinforcement learning algorithm for spiking neural networks
    Florian, RV
    [J]. Seventh International Symposium on Symbolic and Numeric Algorithms for Scientific Computing, Proceedings, 2005, : 299 - 306
  • [2] Learning in neural networks by reinforcement of irregular spiking
    Xie, XH
    Seung, HS
    [J]. PHYSICAL REVIEW E, 2004, 69 (04): : 10
  • [3] Multiagent Reinforcement Learning for Hyperparameter Optimization of Convolutional Neural Networks
    Iranfar, Arman
    Zapater, Marina
    Atienza, David
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (04) : 1034 - 1047
  • [4] Reinforcement Learning in Spiking Neural Networks with Stochastic and Deterministic Synapses
    Yuan, Mengwen
    Wu, Xi
    Yan, Rui
    Tang, Huajin
    [J]. NEURAL COMPUTATION, 2019, 31 (12) : 2368 - 2389
  • [5] Learning in spiking neural networks by reinforcement of stochastic synaptic transmission
    Seung, HS
    [J]. NEURON, 2003, 40 (06) : 1063 - 1073
  • [6] Unsupervised Learning and Clustered Connectivity Enhance Reinforcement Learning in Spiking Neural Networks
    Weidel, Philipp
    Duarte, Renato
    Morrison, Abigail
    [J]. FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2021, 15
  • [7] Reinforcement Learning in Memristive Spiking Neural Networks through Modulation of ReSuMe
    Ji, Xun
    Zhang, Yaozhong
    Li, Chuxi
    Wu, Tanghong
    Hu, Xiaofang
    [J]. ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS III, 2019, 2073
  • [8] Soft-Reward Based Reinforcement Learning by Spiking Neural Networks
    Shi, Weiya
    [J]. ADVANCED RESEARCH ON INFORMATION SCIENCE, AUTOMATION AND MATERIAL SYSTEM, PTS 1-6, 2011, 219-220 : 770 - 773
  • [9] BrainQN: Enhancing the Robustness of Deep Reinforcement Learning with Spiking Neural Networks
    Feng, Shuo
    Cao, Jian
    Ou, Zehong
    Chen, Guang
    Zhong, Yi
    Wang, Zilin
    Yan, Juntong
    Chen, Jue
    Wang, Bingsen
    Zou, Chenglong
    Feng, Zebang
    Wang, Yuan
    [J]. ADVANCED INTELLIGENT SYSTEMS, 2024, 6 (09)
  • [10] Distributed Neural Learning Algorithms for Multiagent Reinforcement Learning
    Dai, Pengcheng
    Liu, Hongzhe
    Yu, Wenwu
    Wang, He
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (23) : 21039 - 21060