Multiagent Reinforcement Learning with Spiking and Non-Spiking Agents in the Iterated Prisoner's Dilemma

被引:0
|
作者
Vassiliades, Vassilis [1 ]
Cleanthous, Aristodemos [1 ]
Christodoulou, Chris [1 ]
机构
[1] Univ Cyprus, Dept Comp Sci, CY-1678 Nicosia, Cyprus
关键词
ANSWER;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates Multiagent Reinforcement Learning (MARL) in a general-sum game where the payoffs' structure is such that the agents are required to exploit each other in a way that benefits all agents. The contradictory nature of these games makes their study in multiagent systems quite challenging. In particular, we investigate MARL with spiking and non-spiking agents in the Iterated Prisoner's Dilemma by exploring the conditions required to enhance its cooperative outcome. According to the results, this is enhanced by: (i) a mixture of positive and negative payoff values and a high discount factor in the case of non-spiking agents and (ii) having longer eligibility trace time constant in the case of spiking agents. Moreover, it is shown that spiking and non-spiking agents have similar behaviour and therefore they can equally well be used in any multiagent interaction setting. For training the spiking agents, a novel and necessary modification enhances competition to an existing learning rule based on stochastic synaptic transmission.
引用
收藏
页码:737 / 746
页数:10
相关论文
共 50 条
  • [1] Multiagent Reinforcement Learning: Spiking and Nonspiking Agents in the Iterated Prisoner's Dilemma
    Vassiliades, Vassilis
    Cleanthous, Aristodemos
    Christodoulou, Chris
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (04): : 639 - 653
  • [2] Multiagent reinforcement learning in the Iterated Prisoner's Dilemma
    Sandholm, TW
    Crites, RH
    BIOSYSTEMS, 1996, 37 (1-2) : 147 - 166
  • [3] Causal Reinforcement Learning in Iterated Prisoner's Dilemma
    Kazemi, Yosra
    Chanel, Caroline P. C.
    Givigi, Sidney
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 2523 - 2534
  • [4] Multiagent Reinforcement Learning in the Iterated Prisoner's Dilemma: Fast Cooperation through Evolved Payoffs
    Vassiliades, Vassilis
    Christodoulou, Chris
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [5] Integrating Non-spiking Interneurons in Spiking Neural Networks
    Strohmer, Beck
    Stagsted, Rasmus Karnoe
    Manoonpong, Poramate
    Larsen, Leon Bonde
    FRONTIERS IN NEUROSCIENCE, 2021, 15
  • [6] PREMOTOR NON-SPIKING INTERNEURONS
    WILSON, JA
    PHILLIPS, CE
    PROGRESS IN NEUROBIOLOGY, 1983, 20 (1-2) : 89 - 107
  • [7] Reinforcement learning produces dominant strategies for the Iterated Prisoner's Dilemma
    Harper, Marc
    Knight, Vincent
    Jones, Martin
    Koutsovoulos, Georgios
    Glynatsi, Nikoleta E.
    Campbell, Owen
    PLOS ONE, 2017, 12 (12):
  • [8] Augmenting Reinforcement Learning to Enhance Cooperation in the Iterated Prisoner's Dilemma
    Feehan, Grace
    Fatima, Shaheen
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 146 - 157
  • [9] INTERGANGLIONIC COMMUNICATION BY SPIKING AND NON-SPIKING FIBERS IN SAME NEURON
    DICKINSON, PS
    NAGY, F
    MOULINS, M
    JOURNAL OF NEUROPHYSIOLOGY, 1981, 45 (06) : 1125 - 1138
  • [10] Connection Strategies in Associative Memory Models with Spiking and Non-spiking Neurons
    Chen, Weiliang
    Maex, Reinoud
    Adams, Rod
    Steuber, Volker
    Calcraft, Lee
    Davey, Neil
    ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, 2009, 5495 : 42 - 51