Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning

被引:0
|
作者
Rahman, Arrasy [1 ]
Hopner, Niklas [2 ]
Christianos, Filippos [1 ]
Albrecht, Stefano, V [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
[2] Univ Amsterdam, Amsterdam, Netherlands
基金
英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ad hoc teamwork is the challenging problem of designing an autonomous agent which can adapt quickly to collaborate with teammates without prior coordination mechanisms, including joint training. Prior work in this area has focused on closed teams in which the number of agents is fixed. In this work, we consider open teams by allowing agents with different fixed policies to enter and leave the environment without prior notification. Our solution builds on graph neural networks to learn agent models and joint-action value models under varying team compositions. We contribute a novel action-value computation that integrates the agent model and joint-action value model to produce action-value estimates. We empirically demonstrate that our approach successfully models the effects other agents have on the learner, leading to policies that robustly adapt to dynamic team compositions and significantly outperform several alternative methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning
    Rahman, Arrasy
    Carlucho, Ignacio
    Hopner, Niklas
    V. Albrecht, Stefano
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [2] A graph-based model for disconnected ad hoc networks
    De Pellegrini, Francesco
    Miorandi, Daniele
    Carreras, Iacopo
    Chlamtac, Imrich
    [J]. INFOCOM 2007, VOLS 1-5, 2007, : 373 - +
  • [3] Graph-based mobility model for mobile ad hoc network simulation
    Tian, J
    Hähner, J
    Becker, C
    Stepanov, E
    Rothermel, K
    [J]. 35TH ANNUAL SIMULATION SYMPOSIUM, PROCEEDINGS, 2002, : 337 - 344
  • [4] A Graph-based Relevance Matching Model for Ad-hoc Retrieval
    Zhang, Yufeng
    Zhang, Jinghao
    Cui, Zeyu
    Wu, Shu
    Wang, Liang
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 4688 - 4696
  • [5] Ad hoc teamwork by learning teammates' task
    Melo, Francisco S.
    Sardinha, Alberto
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2016, 30 (02) : 175 - 219
  • [6] Ad Hoc Teamwork by Learning Teammates' Task
    Melo, Francisco S.
    Sardinha, Alberto
    [J]. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 577 - 578
  • [7] Ad hoc teamwork by learning teammates’ task
    Francisco S. Melo
    Alberto Sardinha
    [J]. Autonomous Agents and Multi-Agent Systems, 2016, 30 : 175 - 219
  • [8] Towards Large Scale Ad-hoc Teamwork
    Yourdshahi, Elnaz Shafipour
    Pinder, Thomas
    Dhawan, Gauri
    Marcolino, Leandro Soriano
    Angelov, Plamen
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON AGENTS (ICA), 2018, : 44 - 49
  • [9] TEAMSTER: Model-based reinforcement learning for ad hoc teamwork
    Ribeiro, Joao G.
    Rodrigues, Goncalo
    Sardinha, Alberto
    Melo, Francisco S.
    [J]. ARTIFICIAL INTELLIGENCE, 2023, 324
  • [10] Autonomous Learning Agents: Layered Learning and Ad Hoc Teamwork
    Stone, Peter
    [J]. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 2 - 2