Modeling behavioral experiments on uncertainty and cooperation with population-based reinforcement learning

被引:7
|
作者
Domingos, Elias Fernandez [1 ,2 ,3 ]
Grujic, Jelena [1 ,2 ]
Burguillo, Juan C. [3 ]
Santos, Francisco C. [2 ,4 ,5 ]
Lenaerts, Tom [1 ,2 ]
机构
[1] Vrije Univ Brussel, Comp Sci Dept, Artificial Intelligence Lab, B-1050 Brussels, Belgium
[2] Univ Libre Bruxelles, Dept Informat, Machine Learning Grp, B-1050 Brussels, Belgium
[3] Univ Vigo, atlanTTic Res Ctr, Vigo 36310, Spain
[4] Univ Lisbon, INESC ID, P-2744016 Porto Salvo, Portugal
[5] Univ Lisbon, Inst Super Tecn, P-2744016 Porto Salvo, Portugal
关键词
Public goods game; Population dynamics; Individual learning; Collective risk; Uncertainty;
D O I
10.1016/j.simpat.2021.102299
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
From climate action to public health measures, human collective endeavors are often shaped by different uncertainties. Here we introduce a novel population-based learning model wherein a group of individuals facing a collective risk dilemma acquire their strategies over time through reinforcement learning, while handling different sources of uncertainty. In such an N-person collective risk dilemma players make step-wise contributions to avoid a catastrophe that would result in a loss of wealth for all players. Success is attained if they collectively reach a certain contribution level over time, or, when the threshold is not reached, they were lucky enough to avoid the cataclysm. The dilemma lies in the trade-off between the proportion of personal contributions that players wish to give to collectively reach the goal and the remainder of the wealth they can keep at the end of the game. We show that the strategies learned with the model correspond to those experimentally observed, even when there is uncertainty about either the risk of failing when the goal is not reached, the magnitude of the threshold to attain and the time available to reach the target. We furthermore confirm that being unsure about the time-window favors more extreme reactions and polarization, diminishing the number of agents that contribute fairly. The population-based on-line learning framework we propose is general enough to be applicable in a wide range of collective action problems and arbitrarily large sets of available policies.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] RLMD-PA: A Reinforcement Learning-Based Myocarditis Diagnosis Combined with a Population-Based Algorithm for Pretraining Weights
    Moravvej, Seyed Vahid
    Alizadehsani, Roohallah
    Khanam, Sadia
    Sobhaninia, Zahra
    Shoeibi, Afshin
    Khozeimeh, Fahime
    Sani, Zahra Alizadeh
    Tan, Ru-San
    Khosravi, Abbas
    Nahavandi, Saeid
    Kadri, Nahrizul Adib
    Azizan, Muhammad Mokhzaini
    Arunkumar, N.
    Acharya, U. Rajendra
    [J]. CONTRAST MEDIA & MOLECULAR IMAGING, 2022, 2022
  • [42] Module-based reinforcement learning: Experiments with a real robot
    Kalmar, Z
    Szepesvari, C
    Lorincz, A
    [J]. AUTONOMOUS ROBOTS, 1998, 5 (3-4) : 273 - 295
  • [43] Modeling cortisol rhythms in a population-based study
    Ranjit, N
    Young, EA
    Raghunathan, TE
    Kaplan, GA
    [J]. PSYCHONEUROENDOCRINOLOGY, 2005, 30 (07) : 615 - 624
  • [44] Population-based nutrikinetic modeling of polyphenol exposure
    Ewoud J. J. van Velzen
    Johan A. Westerhuis
    Christian H. Grün
    Doris M. Jacobs
    Paul H. C. Eilers
    Theo P. Mulder
    Martin Foltz
    Ursula Garczarek
    Rober Kemperman
    Elaine E. Vaughan
    John P. M. van Duynhoven
    Age K. Smilde
    [J]. Metabolomics, 2014, 10 : 1059 - 1073
  • [45] Population-based nutrikinetic modeling of polyphenol exposure
    van Velzen, Ewoud J. J.
    Westerhuis, Johan A.
    Grun, Christian H.
    Jacobs, Doris M.
    Eilers, Paul H. C.
    Mulder, Theo P.
    Foltz, Martin
    Garczarek, Ursula
    Kemperman, Rober
    Vaughan, Elaine E.
    van Duynhoven, John P. M.
    Smilde, Age K.
    [J]. METABOLOMICS, 2014, 10 (06) : 1059 - 1073
  • [46] Cooperation of multiple fish-like microrobots based on reinforcement learning
    Shao, Jinyan
    Wang, Long
    [J]. 2007 IEEE SYMPOSIUM ON ARTIFICIAL LIFE, 2006, : 348 - +
  • [47] Module-based reinforcement learning: Experiments with a real robot
    Kalmar, Z
    Szepesvari, C
    Lorincz, A
    [J]. MACHINE LEARNING, 1998, 31 (1-3) : 55 - 85
  • [48] Module-Based Reinforcement Learning: Experiments with a Real Robot
    Zsolt Kalmár
    Csaba Szepesvári
    András Lőrincz
    [J]. Autonomous Robots, 1998, 5 : 273 - 295
  • [49] Multi-agent reinforcement learning with cooperation based on eligibility traces
    杨玉君
    程君实
    陈佳品
    [J]. Journal of Harbin Institute of Technology(New series), 2004, (05) : 564 - 568
  • [50] Gamification Framework for Reinforcement Learning-based Neuropsychology Experiments
    Chetitah, Mounsif
    Mueller, Julian
    Deserno, Lorenz
    Waltmann, Maria
    von Mammen, Sebastian
    [J]. PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF DIGITAL GAMES, FDG 2023, 2023,