Augmenting Reinforcement Learning to Enhance Cooperation in the Iterated Prisoner's Dilemma

被引：2

作者：

Feehan, Grace ^{[1
]}

Fatima, Shaheen ^{[1
]}

机构：

[1] Loughborough Univ, Epinal Way, Loughborough, Leics, England

来源：

ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3 | 2022年

基金：

英国科研创新办公室;

关键词：

Iterated Prisoner's Dilemma; Interaction Structure; Multiagent Reinforcement Learning; Mood; Cooperation Index;

D O I：

10.5220/0010787500003116

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning algorithms applied to social dilemmas sometimes struggle with converging to mutual cooperation against like-minded partners, particularly when utilising greedy behavioural selection methods. Recent research has demonstrated how affective cognitive mechanisms, such as mood and emotion, might facilitate increased rates of mutual cooperation when integrated with these algorithms. This research has, thus far, primarily utilised mobile multi-agent frameworks to demonstrate this relationship - where they have also identified interaction structure as a key determinant of the emergence of cooperation. Here, we use a deterministic, static interaction structure to provide deeper insight into how a particular moody reinforcement learner might encourage the evolution of cooperation in the Iterated Prisoner's Dilemma. In a novel grid environment, we both replicated original test parameters and then varied the distribution of agents and the payoff matrix. We found that behavioural trends from past research were present (with suppressed magnitude), and that the proportion of mutual cooperations was heightened when both the influence of mood and the cooperation index of the payoff matrix chosen increased. Changing the proportion of moody agents in the environment only increased mutual cooperations by virtue of introducing cooperative agents to each other.

引用

页码：146 / 157

页数：12

共 50 条

[1] Causal Reinforcement Learning in Iterated Prisoner's Dilemma
Kazemi, Yosra
Chanel, Caroline P. C.
Givigi, Sidney
[J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 2523 - 2534
[2] Multiagent reinforcement learning in the Iterated Prisoner's Dilemma
Sandholm, TW
Crites, RH
[J]. BIOSYSTEMS, 1996, 37 (1-2) : 147 - 166
[3] Multiagent Reinforcement Learning in the Iterated Prisoner's Dilemma: Fast Cooperation through Evolved Payoffs
Vassiliades, Vassilis
Christodoulou, Chris
[J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
[4] A rationalization of cooperation in the iterated prisoner's dilemma
Spohn, W
[J]. RATIONALITY, RULES, AND STRUCTURE, 2000, 28 : 67 - 84
[5] Evolving Cooperation for the Iterated Prisoner's Dilemma
Finocchiaro, Jessica
Mathias, H. David
[J]. PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 199 - 200
[6] Clans and Cooperation in the Iterated Prisoner's Dilemma
Julstrom, Bryant A.
[J]. PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTATION COMPANION (GECCO'12), 2012, : 1463 - 1464
[7] On Learning Stable Cooperation in the Iterated Prisoner's Dilemma with Paid Incentives
Sun, Xiyue
Pieroth, Fabian R.
Schmid, Kyrill
Wirsing, Martin
Belzner, Lenz
[J]. 2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW), 2022, : 113 - 118
[8] Reinforcement learning produces dominant strategies for the Iterated Prisoner's Dilemma
Harper, Marc
Knight, Vincent
Jones, Martin
Koutsovoulos, Georgios
Glynatsi, Nikoleta E.
Campbell, Owen
[J]. PLOS ONE, 2017, 12 (12):
[9] The emergence of cooperation in asynchronous iterated prisoner's dilemma
Cornforth, David
Newth, David
[J]. SIMULATED EVOLUTION AND LEARNING, PROCEEDINGS, 2006, 4247 : 742 - 749
[10] Risk consideration and cooperation in the iterated prisoner's dilemma
Zeng, Weijun
Li, Minqiang
Chen, Fuzan
Nan, Guofang
[J]. SOFT COMPUTING, 2016, 20 (02) : 567 - 587

← 1 2 3 4 5 →