Ad Hoc Teamwork in the Presence of Non-stationary Teammates

被引:5
|
作者
Santos, Pedro M. [1 ,2 ]
Ribeiro, Joao G. [1 ,2 ]
Sardinha, Alberto [1 ,2 ]
Melo, Francisco S. [1 ,2 ]
机构
[1] INESC ID, Lisbon, Portugal
[2] Univ Lisbon, Inst Super Tecn, Lisbon, Portugal
关键词
Multi-agent systems; Ad hoc teamwork problem; Reinforcement learning;
D O I
10.1007/978-3-030-86230-5_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we address the problem of ad hoc teamwork and contribute a novel approach, PPAS, that is able to handle non-stationary teammates. Current approaches to ad hoc teamwork assume that the (potentially unknown) teammates behave in a stationary way, which is a significant limitation in real world conditions, since humans and other intelligent systems do not necessarily follow strict policies. In our work we highlight the current limitations of state-of-the-art approaches to ad hoc teamwork problem in the presence of non-stationary teammate, and propose a novel solution that alleviates the stationarity assumption by combining ad hoc teamwork with adversarial online prediction. The proposed architecture is called PLASTIC Policy with Adversarial Selection, or PPAS. We showcase the effectiveness of our approach through an empirical evaluation in the half-field offense environment. Our results show that it is possible to cooperate in an ad hoc manner with non-stationary teammates in complex environments.
引用
收藏
页码:648 / 660
页数:13
相关论文
共 50 条
  • [1] Ad hoc teamwork by learning teammates' task
    Melo, Francisco S.
    Sardinha, Alberto
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2016, 30 (02) : 175 - 219
  • [2] Ad Hoc Teamwork by Learning Teammates' Task
    Melo, Francisco S.
    Sardinha, Alberto
    [J]. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 577 - 578
  • [3] Ad hoc teamwork by learning teammates’ task
    Francisco S. Melo
    Alberto Sardinha
    [J]. Autonomous Agents and Multi-Agent Systems, 2016, 30 : 175 - 219
  • [4] Learning with Generated Teammates to Achieve Type-Free Ad-Hoc Teamwork
    Xing, Dong
    Liu, Qianhui
    Zheng, Qian
    Pan, Gang
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 472 - 478
  • [5] Cooperating with Unknown Teammates in Complex Domains: A Robot Soccer Case Study of Ad Hoc Teamwork
    Barrett, Samuel
    Stone, Peter
    [J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2010 - 2016
  • [6] On-line estimators for ad-hoc task execution: learning types and parameters of teammates for effective teamwork
    Yourdshahi, Elnaz Shafipour
    Alves, Matheus Aparecido do Carmo
    Varma, Amokh
    Marcolino, Leandro Soriano
    Ueyama, Jo
    Angelov, Plamen
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2022, 36 (02)
  • [7] On-line estimators for ad-hoc task execution: learning types and parameters of teammates for effective teamwork
    Elnaz Shafipour Yourdshahi
    Matheus Aparecido do Carmo Alves
    Amokh Varma
    Leandro Soriano Marcolino
    Jó Ueyama
    Plamen Angelov
    [J]. Autonomous Agents and Multi-Agent Systems, 2022, 36
  • [8] Online Ad Procurement in Non-stationary Autobidding Worlds
    Liang, Jason Cheuk Nam
    Lu, Haihao
    Zhou, Baoyu
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [9] A Survey of Ad Hoc Teamwork Research
    Mirsky, Reuth
    Carlucho, Ignacio
    Rahman, Arrasy
    Fosong, Elliot
    Macke, William
    Sridharan, Mohan
    Stone, Peter
    Albrecht, Stefano, V
    [J]. MULTI-AGENT SYSTEMS, EUMAS 2022, 2022, 13442 : 275 - 293
  • [10] On-line Estimators for Ad-hoc Task Execution: Learning Types and Parameters of Teammates for Effective Teamwork JAAMAS Track
    do Carmo Alves, Matheus Ap.
    Yourdshahi, Elnaz Shafipour
    Varma, Amokh
    Marcolino, Leandro Soriano
    Ueyama, Jó
    Angelov, Plamen
    [J]. Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS, 2023, 2023-May : 140 - 142