Modeling behavioral experiments on uncertainty and cooperation with population-based reinforcement learning

被引:7
|
作者
Domingos, Elias Fernandez [1 ,2 ,3 ]
Grujic, Jelena [1 ,2 ]
Burguillo, Juan C. [3 ]
Santos, Francisco C. [2 ,4 ,5 ]
Lenaerts, Tom [1 ,2 ]
机构
[1] Vrije Univ Brussel, Comp Sci Dept, Artificial Intelligence Lab, B-1050 Brussels, Belgium
[2] Univ Libre Bruxelles, Dept Informat, Machine Learning Grp, B-1050 Brussels, Belgium
[3] Univ Vigo, atlanTTic Res Ctr, Vigo 36310, Spain
[4] Univ Lisbon, INESC ID, P-2744016 Porto Salvo, Portugal
[5] Univ Lisbon, Inst Super Tecn, P-2744016 Porto Salvo, Portugal
关键词
Public goods game; Population dynamics; Individual learning; Collective risk; Uncertainty;
D O I
10.1016/j.simpat.2021.102299
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
From climate action to public health measures, human collective endeavors are often shaped by different uncertainties. Here we introduce a novel population-based learning model wherein a group of individuals facing a collective risk dilemma acquire their strategies over time through reinforcement learning, while handling different sources of uncertainty. In such an N-person collective risk dilemma players make step-wise contributions to avoid a catastrophe that would result in a loss of wealth for all players. Success is attained if they collectively reach a certain contribution level over time, or, when the threshold is not reached, they were lucky enough to avoid the cataclysm. The dilemma lies in the trade-off between the proportion of personal contributions that players wish to give to collectively reach the goal and the remainder of the wealth they can keep at the end of the game. We show that the strategies learned with the model correspond to those experimentally observed, even when there is uncertainty about either the risk of failing when the goal is not reached, the magnitude of the threshold to attain and the time available to reach the target. We furthermore confirm that being unsure about the time-window favors more extreme reactions and polarization, diminishing the number of agents that contribute fairly. The population-based on-line learning framework we propose is general enough to be applicable in a wide range of collective action problems and arbitrarily large sets of available policies.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] A Survey on Population-Based Deep Reinforcement Learning
    Long, Weifan
    Hou, Taixian
    Wei, Xiaoyi
    Yan, Shichao
    Zhai, Peng
    Zhang, Lihua
    [J]. MATHEMATICS, 2023, 11 (10)
  • [2] Fast Population-Based Reinforcement Learning on a Single Machine
    Flajolet, Arthur
    Monroc, Claire Bizon
    Beguir, Karim
    Pierrot, Thomas
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [3] Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning
    Bai, Hui
    Cheng, Ran
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [4] A population-based approach for multi-agent interpretable reinforcement learning
    Crespi, Marco
    Ferigo, Andrea
    Custode, Leonardo Lucio
    Iacca, Giovanni
    [J]. APPLIED SOFT COMPUTING, 2023, 147
  • [5] Population-Based Survery Experiments
    Glantz, Alexander
    [J]. POLITISCHE VIERTELJAHRESSCHRIFT, 2012, 53 (03) : 553 - 555
  • [6] Population-Based Survey Experiments
    Janssen, Marco A.
    [J]. INTERNATIONAL JOURNAL OF THE COMMONS, 2012, 6 (01): : 70 - 71
  • [7] Population-Based Survey Experiments
    Land, Kenneth C.
    [J]. CONTEMPORARY SOCIOLOGY-A JOURNAL OF REVIEWS, 2012, 41 (06) : 822 - 823
  • [8] Implementing a Learning Collaborative for Population-Based Physical and Behavioral Health Integration
    Brar, Jaspreet S.
    Maise, Amanda A.
    Schake, Patricia
    Bills, Lyndra J.
    Washington, Leslie
    Nikolajski, Cara
    Herschell, Amy D.
    [J]. COMMUNITY MENTAL HEALTH JOURNAL, 2021, 57 (07) : 1361 - 1373
  • [9] Population based Reinforcement Learning
    Pretorius, Kyle W.
    Pillay, Nelishia
    [J]. 2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [10] UAV Resource Cooperation Based on Reinforcement Learning
    Shan, Mingang
    Xiong, Jian
    Liu, Bo
    Shi, Zhiping
    Li, Xia
    Miao, Ningjie
    [J]. 2021 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2021,