The emergence of cooperation via Q-learning in spatial donation game

被引:0
|
作者
Zhang, Jing [1 ,2 ]
Rong, Zhihai [1 ]
Zheng, Guozhong [2 ]
Zhang, Jiqiang [3 ]
Chen, Li [2 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, Shanghai 201620, Peoples R China
[2] Shaanxi Normal Univ, Sch Phys & Informat Technol, Xian 710062, Peoples R China
[3] Ningxia Univ, Sch Phys, Yinchuan, Peoples R China
来源
JOURNAL OF PHYSICS-COMPLEXITY | 2024年 / 5卷 / 02期
关键词
evolutionary game; spatial game; reinforcement learning; INDIRECT RECIPROCITY; EVOLUTIONARY DYNAMICS; PUNISHMENT;
D O I
10.1088/2632-072X/ad3f65
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Decision-making often overlooks the feedback between agents and the environment. Reinforcement learning is widely employed through exploratory experimentation to address problems related to states, actions, rewards, decision-making in various contexts. This work considers a new perspective, where individuals continually update their policies based on interactions with the spatial environment, aiming to maximize cumulative rewards and learn the optimal strategy. Specifically, we utilize the Q-learning algorithm to study the emergence of cooperation in a spatial population playing the donation game. Each individual has a Q-table that guides their decision-making in the game. Interestingly, we find that cooperation emerges within this introspective learning framework, and a smaller learning rate and higher discount factor make cooperation more likely to occur. Through the analysis of Q-table evolution, we disclose the underlying mechanism for cooperation, which may provide some insights to the emergence of cooperation in the real-world systems.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Evolution of cooperation in the public goods game with Q-learning
    Zheng, Guozhong
    Zhang, Jiqiang
    Deng, Shengfeng
    Cai, Weiran
    Chen, Li
    [J]. Chaos, Solitons and Fractals, 2024, 188
  • [2] Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game
    Yang, Zhengzhi
    Zheng, Lei
    Perc, Matjaz
    Li, Yumeng
    [J]. APPLIED MATHEMATICS AND COMPUTATION, 2024, 463
  • [3] Multi-Agent Cooperation Q-Learning Algorithm Based on Constrained Markov Game
    Ge, Yangyang
    Zhu, Fei
    Huang, Wei
    Zhao, Peiyao
    Liu, Quan
    [J]. COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2020, 17 (02) : 647 - 664
  • [4] Deep Q-Learning with Phased Experience Cooperation
    Wang, Hongbo
    Zeng, Fanbing
    Tu, Xuyan
    [J]. COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, CHINESECSCW 2019, 2019, 1042 : 752 - 765
  • [5] Q-learning in Multi-Agent Cooperation
    Hwang, Kao-Shing
    Chen, Yu-Jen
    Lin, Tzung-Feng
    [J]. 2008 IEEE WORKSHOP ON ADVANCED ROBOTICS AND ITS SOCIAL IMPACTS, 2008, : 239 - 244
  • [6] Energy Cooperation in CoMP System Based on Q-learning
    Lv, Yabo
    Li, Baogang
    Yao, Yuanbin
    Guo, Dandan
    [J]. PROCEEDINGS OF 2017 11TH IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2017, : 90 - 94
  • [7] Hybrid Q-learning Algorithm About Cooperation in MAS
    Chen, Wei
    Guo, Jing
    Li, Xiong
    Wang, Jie
    [J]. CCDC 2009: 21ST CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, PROCEEDINGS, 2009, : 3943 - 3947
  • [8] Continuous Q-Learning for Multi-Agent Cooperation
    Hwang, Kao-Shing
    Jiang, Wei-Cheng
    Lin, Yu-Hong
    Lai, Li-Hsin
    [J]. CYBERNETICS AND SYSTEMS, 2012, 43 (03) : 227 - 256
  • [9] Q-learning with FCMAC in multi-agent cooperation
    Hwang, Kao-Shing
    Chen, Yu-Jen
    Lin, Tzung-Feng
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 599 - 606
  • [10] The Improvement of Q-learning Applied to Imperfect Information Game
    Lin, Jing
    Wang, Xuan
    Han, Lijiao
    Zhang, Jiajia
    Xi, Xinxin
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 1562 - +