Malthusian Reinforcement Learning

被引:0
|
作者
Leibo, Joel Z. [1 ]
Perolat, Julien [1 ]
Hughes, Edward [1 ]
Wheelwright, Steven [1 ]
Marblestone, Adam H. [1 ]
Duenez-Guzman, Edgar [1 ]
Sunehag, Peter [1 ]
Dunning, Iain [1 ]
Graepel, Thore [1 ]
机构
[1] DeepMind, London, England
关键词
Intrinsic motivation; Adaptive radiation; Demography; Evolution; Artificial general intelligence; EVOLUTION; DEMOGRAPHY; DISPERSAL; SELECTION; AFRICA;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Here we explore a new algorithmic framework for multi-agent reinforcement learning, called Malthusian reinforcement learning, which extends self-play to include fitness-linked population size dynamics that drive ongoing innovation. In Malthusian RL, increases in a subpopulation's average return drive subsequent increases in its size, just as Thomas Malthus argued in 1798 was the relationship between preindustrial income levels and population growth [24]. Malthusian reinforcement learning harnesses the competitive pressures arising from growing and shrinking population size to drive agents to explore regions of state and policy spaces that they could not otherwise reach. Furthermore, in environments where there are potential gains from specialization and division of labor, we show that Malthusian reinforcement learning is better positioned to take advantage of such synergies than algorithms based on self-play.
引用
收藏
页码:1099 / 1107
页数:9
相关论文
共 50 条
  • [1] The Advance of Reinforcement Learning and Deep Reinforcement Learning
    Lyu, Le
    Shen, Yang
    Zhang, Sicheng
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 644 - 648
  • [2] The Malthusian Empire: A Malthusian Model of the Roman Economy
    Naff, Theodore
    [J]. HIRUNDO-MCGILL JOURNAL OF CLASSICAL STUDIES, 2012, 11 : 31 - 47
  • [3] The Malthusian Controversy
    Duncan, Otis Dudley
    [J]. AMERICAN JOURNAL OF SOCIOLOGY, 1952, 57 (06) : 611 - 611
  • [4] The Malthusian Controversy
    Peacock, Alan T.
    [J]. POPULATION STUDIES-A JOURNAL OF DEMOGRAPHY, 1952, 6 (01): : 109 - 110
  • [5] On Normative Reinforcement Learning via Safe Reinforcement Learning
    Neufeld, Emery A.
    Bartocci, Ezio
    Ciabattoni, Agata
    [J]. PRIMA 2022: PRINCIPLES AND PRACTICE OF MULTI-AGENT SYSTEMS, 2023, 13753 : 72 - 89
  • [6] From Reinforcement Learning to Deep Reinforcement Learning: An Overview
    Agostinelli, Forest
    Hocquet, Guillaume
    Singh, Sameer
    Baldi, Pierre
    [J]. BRAVERMAN READINGS IN MACHINE LEARNING: KEY IDEAS FROM INCEPTION TO CURRENT STATE, 2018, 11100 : 298 - 328
  • [7] Reinforcement and learning
    Servedio, Maria R.
    Saether, Stein A.
    Saetre, Glenn-Peter
    [J]. EVOLUTIONARY ECOLOGY, 2009, 23 (01) : 109 - 123
  • [8] Reinforcement LEARNING
    Knight, Will
    [J]. TECHNOLOGY REVIEW, 2017, 120 (02) : 32 - 35
  • [9] Reinforcement learning
    Gallistel, CR
    [J]. JOURNAL OF COGNITIVE NEUROSCIENCE, 1999, 11 (01) : 126 - 130
  • [10] Reinforcement learning
    Yatawatta, S.
    [J]. ASTRONOMY AND COMPUTING, 2024, 48