Multiagent Reinforcement Learning with Spiking and Non-Spiking Agents in the Iterated Prisoner's Dilemma

被引:0
|
作者
Vassiliades, Vassilis [1 ]
Cleanthous, Aristodemos [1 ]
Christodoulou, Chris [1 ]
机构
[1] Univ Cyprus, Dept Comp Sci, CY-1678 Nicosia, Cyprus
关键词
ANSWER;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates Multiagent Reinforcement Learning (MARL) in a general-sum game where the payoffs' structure is such that the agents are required to exploit each other in a way that benefits all agents. The contradictory nature of these games makes their study in multiagent systems quite challenging. In particular, we investigate MARL with spiking and non-spiking agents in the Iterated Prisoner's Dilemma by exploring the conditions required to enhance its cooperative outcome. According to the results, this is enhanced by: (i) a mixture of positive and negative payoff values and a high discount factor in the case of non-spiking agents and (ii) having longer eligibility trace time constant in the case of spiking agents. Moreover, it is shown that spiking and non-spiking agents have similar behaviour and therefore they can equally well be used in any multiagent interaction setting. For training the spiking agents, a novel and necessary modification enhances competition to an existing learning rule based on stochastic synaptic transmission.
引用
收藏
页码:737 / 746
页数:10
相关论文
共 50 条
  • [21] Learning versus evolution in iterated prisoner's dilemma
    Hingston, P
    Kendall, G
    CEC2004: PROCEEDINGS OF THE 2004 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2004, : 364 - 372
  • [22] Response properties of spiking and non-spiking brain neurons mirror pulse interval selectivity
    Zhang, Xinyang
    Hedwig, Berthold
    FRONTIERS IN CELLULAR NEUROSCIENCE, 2022, 16
  • [23] Numerical analysis of a reinforcement learning model with the dynamic aspiration level in the iterated Prisoner's dilemma
    Masuda, Naoki
    Nakamura, Mitsuhiro
    JOURNAL OF THEORETICAL BIOLOGY, 2011, 278 (01) : 55 - 62
  • [24] Cooperation-Eliciting Prisoner's Dilemma Payoffs for Reinforcement Learning Agents
    Moriyama, Koichi
    Kurihara, Satoshi
    Numao, Masayuki
    AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 1619 - 1620
  • [25] NONLINEAR MECHANICAL MODEL OF A NON-SPIKING MUSCLE RECEPTOR
    BERGER, CS
    BUSH, BMH
    JOURNAL OF EXPERIMENTAL BIOLOGY, 1979, 83 (DEC): : 339 - 343
  • [26] Spiking Neural Networks with Different Reinforcement Learning (RL) Schemes in a Multiagent Setting
    Christodoulou, Chris
    Cleanthous, Aristodemos
    CHINESE JOURNAL OF PHYSIOLOGY, 2010, 53 (06): : 447 - 453
  • [27] Processing of sensory signals by a non-spiking neuron in the leech
    Marín-Burgin, A
    Szczupak, L
    JOURNAL OF COMPARATIVE PHYSIOLOGY A-SENSORY NEURAL AND BEHAVIORAL PHYSIOLOGY, 2000, 186 (10): : 989 - 997
  • [28] ACTION-POTENTIALS IN NON-SPIKING VISUAL INTERNEURONES
    ECKERT, HEA
    HAMDORF, K
    ZEITSCHRIFT FUR NATURFORSCHUNG C-A JOURNAL OF BIOSCIENCES, 1981, 36 (5-6): : 470 - 474
  • [29] Biological emergent properties in non-spiking neural networks
    Naudin, Lois
    AIMS MATHEMATICS, 2022, 7 (10): : 19415 - 19439
  • [30] Processing of sensory signals by a non-spiking neuron in the leech
    A. Marín-Burgin
    L. Szczupak
    Journal of Comparative Physiology A, 2000, 186 : 989 - 997