Augmenting Reinforcement Learning to Enhance Cooperation in the Iterated Prisoner's Dilemma

被引:2
|
作者
Feehan, Grace [1 ]
Fatima, Shaheen [1 ]
机构
[1] Loughborough Univ, Epinal Way, Loughborough, Leics, England
基金
英国科研创新办公室;
关键词
Iterated Prisoner's Dilemma; Interaction Structure; Multiagent Reinforcement Learning; Mood; Cooperation Index;
D O I
10.5220/0010787500003116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning algorithms applied to social dilemmas sometimes struggle with converging to mutual cooperation against like-minded partners, particularly when utilising greedy behavioural selection methods. Recent research has demonstrated how affective cognitive mechanisms, such as mood and emotion, might facilitate increased rates of mutual cooperation when integrated with these algorithms. This research has, thus far, primarily utilised mobile multi-agent frameworks to demonstrate this relationship - where they have also identified interaction structure as a key determinant of the emergence of cooperation. Here, we use a deterministic, static interaction structure to provide deeper insight into how a particular moody reinforcement learner might encourage the evolution of cooperation in the Iterated Prisoner's Dilemma. In a novel grid environment, we both replicated original test parameters and then varied the distribution of agents and the payoff matrix. We found that behavioural trends from past research were present (with suppressed magnitude), and that the proportion of mutual cooperations was heightened when both the influence of mood and the cooperation index of the payoff matrix chosen increased. Changing the proportion of moody agents in the environment only increased mutual cooperations by virtue of introducing cooperative agents to each other.
引用
收藏
页码:146 / 157
页数:12
相关论文
共 50 条
  • [1] Causal Reinforcement Learning in Iterated Prisoner's Dilemma
    Kazemi, Yosra
    Chanel, Caroline P. C.
    Givigi, Sidney
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 2523 - 2534
  • [2] Multiagent reinforcement learning in the Iterated Prisoner's Dilemma
    Sandholm, TW
    Crites, RH
    [J]. BIOSYSTEMS, 1996, 37 (1-2) : 147 - 166
  • [3] Multiagent Reinforcement Learning in the Iterated Prisoner's Dilemma: Fast Cooperation through Evolved Payoffs
    Vassiliades, Vassilis
    Christodoulou, Chris
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [4] A rationalization of cooperation in the iterated prisoner's dilemma
    Spohn, W
    [J]. RATIONALITY, RULES, AND STRUCTURE, 2000, 28 : 67 - 84
  • [5] Evolving Cooperation for the Iterated Prisoner's Dilemma
    Finocchiaro, Jessica
    Mathias, H. David
    [J]. PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 199 - 200
  • [6] Clans and Cooperation in the Iterated Prisoner's Dilemma
    Julstrom, Bryant A.
    [J]. PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTATION COMPANION (GECCO'12), 2012, : 1463 - 1464
  • [7] On Learning Stable Cooperation in the Iterated Prisoner's Dilemma with Paid Incentives
    Sun, Xiyue
    Pieroth, Fabian R.
    Schmid, Kyrill
    Wirsing, Martin
    Belzner, Lenz
    [J]. 2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW), 2022, : 113 - 118
  • [8] Reinforcement learning produces dominant strategies for the Iterated Prisoner's Dilemma
    Harper, Marc
    Knight, Vincent
    Jones, Martin
    Koutsovoulos, Georgios
    Glynatsi, Nikoleta E.
    Campbell, Owen
    [J]. PLOS ONE, 2017, 12 (12):
  • [9] The emergence of cooperation in asynchronous iterated prisoner's dilemma
    Cornforth, David
    Newth, David
    [J]. SIMULATED EVOLUTION AND LEARNING, PROCEEDINGS, 2006, 4247 : 742 - 749
  • [10] Risk consideration and cooperation in the iterated prisoner's dilemma
    Zeng, Weijun
    Li, Minqiang
    Chen, Fuzan
    Nan, Guofang
    [J]. SOFT COMPUTING, 2016, 20 (02) : 567 - 587