Diversifying behaviors for learning in Asymmetric Multiagent Systems

被引:2
|
作者
Dixit, Gaurav [1 ]
Gonzalez, Everardo [1 ]
Tumer, Kagan [1 ]
机构
[1] Oregon State Univ, Corvallis, OR 97331 USA
基金
美国国家科学基金会;
关键词
Adaptive Team Balancing; Quality Diversity; Multiagent learning; Evolution;
D O I
10.1145/3512290.3528860
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To achieve coordination in multiagent systems such as air traffic control or search and rescue, agents must not only evolve their policies, but also adapt to the behaviors of other agents. However, extending coevolutionary algorithms to complex domains is difficult because agents evolve in the dynamic environment created by the changing policies of other agents. This problem is exacerbated when the teams consist of diverse asymmetric agents (agents with different capabilities and objectives), making it difficult for agents to evolve complementary policies. Quality-Diversity methods solve part of the problem by allowing agents to discover not just optimal, but diverse behaviors, but are computationally intractable in multiagent settings. This paper introduces a multiagent learning framework to allow asymmetric agents to specialize and explore diverse behaviors needed for coordination in a shared environment. The key insight of this work is that a hierarchical decomposition of diversity search, fitness optimization, and team composition modeling allows the fitness on the team-wide objective to direct the diversity search in a dynamic environment. Experimental results in multiagent environments with temporal and spatial coupling requirements demonstrate the diversity of acquired agent synergies in response to a changing environment and team compositions.
引用
收藏
页码:350 / 358
页数:9
相关论文
共 50 条
  • [1] Informed Diversity Search for Learning in Asymmetric Multiagent Systems
    Dixit, Gaurav
    Tumer, Kagan
    PROCEEDINGS OF THE 2024 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2024, 2024, : 313 - 321
  • [2] Asymmetric multiagent reinforcement learning
    Könönen, V
    IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 336 - 342
  • [3] Learning Synergies for Multi-Objective Optimization in Asymmetric Multiagent Systems
    Dixit, Gaurav
    Tumer, Kagan
    PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023, 2023, : 447 - 455
  • [4] Learning Complementary Multiagent Behaviors: A Case Study
    Kalyanakrishnan, Shivaram
    Stone, Peter
    ROBOCUP 2009: ROBOT SOCCER WORLD CUP XIII, 2010, 5949 : 153 - 165
  • [5] Asymmetric multiagent reinforcement learning in pricing applications
    Könönen, V
    Oja, E
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 1097 - 1102
  • [6] Collective learning in multiagent systems
    Calderoni, S
    ECAI 1998: 13TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1998, : 465 - 466
  • [7] Evolution and learning in multiagent systems
    Sen, S
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 1998, 48 (01) : 1 - 7
  • [8] Multiagent Learning of Coordination in Loosely Coupled Multiagent Systems
    Yu, Chao
    Zhang, Minjie
    Ren, Fenghui
    Tan, Guozhen
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (12) : 2853 - 2867
  • [9] Dynamic pricing based on asymmetric multiagent reinforcement learning
    Könönen, V
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2006, 21 (01) : 73 - 98
  • [10] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Ana L. C. Bazzan
    Autonomous Agents and Multi-Agent Systems, 2009, 18 : 342 - 375