Ad hoc teamwork by learning teammates’ task

被引:0
|
作者
Francisco S. Melo
Alberto Sardinha
机构
[1] Universidade de Lisboa,INESC
[2] Universidade de Lisboa,ID and Instituto Superior Técnico
关键词
Ad hoc teamwork; Online learning; POMDP;
D O I
暂无
中图分类号
学科分类号
摘要
This paper addresses the problem of ad hoc teamwork, where a learning agent engages in a cooperative task with other (unknown) agents. The agent must effectively coordinate with the other agents towards completion of the intended task, not relying on any pre-defined coordination strategy. We contribute a new perspective on the ad hoc teamwork problem and propose that, in general, the learning agent should not only identify (and coordinate with) the teammates’ strategy but also identify the task to be completed. In our approach to the ad hoc teamwork problem, we represent tasks as fully cooperative matrix games. Relying exclusively on observations of the behavior of the teammates, the learning agent must identify the task at hand (namely, the corresponding payoff function) from a set of possible tasks and adapt to the teammates’ behavior. Teammates are assumed to follow a bounded-rationality best-response model and thus also adapt their behavior to that of the learning agent. We formalize the ad hoc teamwork problem as a sequential decision problem and propose two novel approaches to address it. In particular, we propose (i) the use of an online learning approach that considers the different tasks depending on their ability to predict the behavior of the teammate; and (ii) a decision-theoretic approach that models the ad hoc teamwork problem as a partially observable Markov decision problem. We provide theoretical bounds of the performance of both approaches and evaluate their performance in several domains of different complexity.
引用
收藏
页码:175 / 219
页数:44
相关论文
共 50 条
  • [1] Ad Hoc Teamwork by Learning Teammates' Task
    Melo, Francisco S.
    Sardinha, Alberto
    [J]. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 577 - 578
  • [2] Ad hoc teamwork by learning teammates' task
    Melo, Francisco S.
    Sardinha, Alberto
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2016, 30 (02) : 175 - 219
  • [3] On-line estimators for ad-hoc task execution: learning types and parameters of teammates for effective teamwork
    Elnaz Shafipour Yourdshahi
    Matheus Aparecido do Carmo Alves
    Amokh Varma
    Leandro Soriano Marcolino
    Jó Ueyama
    Plamen Angelov
    [J]. Autonomous Agents and Multi-Agent Systems, 2022, 36
  • [4] On-line estimators for ad-hoc task execution: learning types and parameters of teammates for effective teamwork
    Yourdshahi, Elnaz Shafipour
    Alves, Matheus Aparecido do Carmo
    Varma, Amokh
    Marcolino, Leandro Soriano
    Ueyama, Jo
    Angelov, Plamen
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2022, 36 (02)
  • [5] Learning with Generated Teammates to Achieve Type-Free Ad-Hoc Teamwork
    Xing, Dong
    Liu, Qianhui
    Zheng, Qian
    Pan, Gang
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 472 - 478
  • [6] Ad Hoc Teamwork in the Presence of Non-stationary Teammates
    Santos, Pedro M.
    Ribeiro, Joao G.
    Sardinha, Alberto
    Melo, Francisco S.
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2021), 2021, 12981 : 648 - 660
  • [7] Cooperating with Unknown Teammates in Complex Domains: A Robot Soccer Case Study of Ad Hoc Teamwork
    Barrett, Samuel
    Stone, Peter
    [J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2010 - 2016
  • [8] Autonomous Learning Agents: Layered Learning and Ad Hoc Teamwork
    Stone, Peter
    [J]. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 2 - 2
  • [9] ATSIS: Achieving the Ad hoc Teamwork by Sub-task Inference and Selection
    Chen, Shuo
    Andrejczuk, Ewa
    Irissappane, Athirai A.
    Zhang, Jie
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 172 - 179
  • [10] TEAMSTER: Model-based reinforcement learning for ad hoc teamwork
    Ribeiro, Joao G.
    Rodrigues, Goncalo
    Sardinha, Alberto
    Melo, Francisco S.
    [J]. ARTIFICIAL INTELLIGENCE, 2023, 324