The emergence of cooperation via Q-learning in spatial donation game

被引:0
|
作者
Zhang, Jing [1 ,2 ]
Rong, Zhihai [1 ]
Zheng, Guozhong [2 ]
Zhang, Jiqiang [3 ]
Chen, Li [2 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, Shanghai 201620, Peoples R China
[2] Shaanxi Normal Univ, Sch Phys & Informat Technol, Xian 710062, Peoples R China
[3] Ningxia Univ, Sch Phys, Yinchuan, Peoples R China
来源
JOURNAL OF PHYSICS-COMPLEXITY | 2024年 / 5卷 / 02期
关键词
evolutionary game; spatial game; reinforcement learning; INDIRECT RECIPROCITY; EVOLUTIONARY DYNAMICS; PUNISHMENT;
D O I
10.1088/2632-072X/ad3f65
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Decision-making often overlooks the feedback between agents and the environment. Reinforcement learning is widely employed through exploratory experimentation to address problems related to states, actions, rewards, decision-making in various contexts. This work considers a new perspective, where individuals continually update their policies based on interactions with the spatial environment, aiming to maximize cumulative rewards and learn the optimal strategy. Specifically, we utilize the Q-learning algorithm to study the emergence of cooperation in a spatial population playing the donation game. Each individual has a Q-table that guides their decision-making in the game. Interestingly, we find that cooperation emerges within this introspective learning framework, and a smaller learning rate and higher discount factor make cooperation more likely to occur. Through the analysis of Q-table evolution, we disclose the underlying mechanism for cooperation, which may provide some insights to the emergence of cooperation in the real-world systems.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Q-LEARNING
    WATKINS, CJCH
    DAYAN, P
    [J]. MACHINE LEARNING, 1992, 8 (3-4) : 279 - 292
  • [22] Deep Spatial Q-Learning for Infectious Disease Control
    Liu, Zhishuai
    Clifton, Jesse
    Laber, Eric B.
    Drake, John
    Fang, Ethan X.
    [J]. JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2023, 28 (04) : 749 - 773
  • [23] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning
    Tan, Fuxiao
    Yan, Pengfei
    Guan, Xinping
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483
  • [24] Backward Q-learning: The combination of Sarsa algorithm and Q-learning
    Wang, Yin-Hao
    Li, Tzuu-Hseng S.
    Lin, Chih-Jui
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (09) : 2184 - 2193
  • [25] Study of Cooperation Strategy of Robot Based on Parallel Q-Learning Algorithm
    Wang, Shuda
    Si, Feng
    Yang, Jing
    Wang, Shuoning
    Yang, Jun
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS, PT I, PROCEEDINGS, 2008, 5314 : 633 - 642
  • [26] Q-learning boosts the evolution of cooperation in structured population by involving extortion
    Ding, Hong
    Zhang, Geng-shun
    Wang, Shi-hao
    Li, Juan
    Wang, Zhen
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 536
  • [27] CONTINUOUS ACTION GENERATION OF Q-LEARNING IN MULTI-AGENT COOPERATION
    Hwang, Kao-Shing
    Chen, Yu-Jen
    Jiang, Wei-Cheng
    Lin, Tzung-Feng
    [J]. ASIAN JOURNAL OF CONTROL, 2013, 15 (04) : 1011 - 1020
  • [28] Real-Valued Q-learning in Multi-agent Cooperation
    Hwang, Kao-Shing
    Lo, Chia-Yue
    Chen, Kim-Joan
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 395 - 400
  • [29] Utility based Q-learning to maintain cooperation in Prisoners Dilemma games
    Moriyama, Koichi
    [J]. PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY (IAT 2007), 2007, : 146 - +
  • [30] Caching in Base Station with Recommendation via Q-Learning
    Guo, Kaiyang
    Yang, Chenyang
    Liu, Tingting
    [J]. 2017 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2017,