TEAMSTER: Model-based reinforcement learning for ad hoc teamwork

被引:2
|
作者
Ribeiro, Joao G. [1 ]
Rodrigues, Goncalo [2 ]
Sardinha, Alberto [1 ,3 ]
Melo, Francisco S. [1 ]
机构
[1] INESC ID, IST Taguspk, Av Prof Dr Cavaco Silva, P-2744016 Porto Salvo, Portugal
[2] Google, Google Bldg 110,Brandschenke Str 110, CH-8002 Zurich, Switzerland
[3] Pontif Catholic Univ Rio de Janeiro, Dept Informat, Rio De Janeiro, Brazil
基金
欧盟地平线“2020”;
关键词
Ad hoc teamwork; Reinforcement learning; Multi-agent systems;
D O I
10.1016/j.artint.2023.104013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the use of model-based reinforcement learning in the context of ad hoc teamwork. We introduce a novel approach, named TEAMSTER, where we propose learning both the environment's model and the model of the teammates' behavior separately. Compared to the state-of-the-art PLASTIC algorithms, our results in four different domains from the multi-agent systems literature show that TEAMSTER is more flexible than the PLASTIC-Model, by learning the environment's model instead of assuming a perfect hand-coded model, and more robust/efficient than PLASTIC-Policy, by being able to continuously adapt to newly encountered teams, without implicitly learning a new environment model from scratch. (c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] TEAMSTER: Model-Based Reinforcement Learning for Ad Hoc Teamwork (Abstract Reprint)
    Ribeiro, Joao G.
    Rodrigues, Goncalo
    Sardinha, Alberto
    Melo, Francisco S.
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22708 - 22708
  • [2] A Model-Based Reinforcement Learning Protocol for Routing in Vehicular Ad hoc Network
    Jafarzadeh, Omid
    Dehghan, Mehdi
    Sargolzaey, Hadi
    Esnaashari, Mohammad Mehdi
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2022, 123 (01) : 975 - 1001
  • [3] A Model-Based Reinforcement Learning Protocol for Routing in Vehicular Ad hoc Network
    Omid Jafarzadeh
    Mehdi Dehghan
    Hadi Sargolzaey
    Mohammad Mehdi Esnaashari
    [J]. Wireless Personal Communications, 2022, 123 : 975 - 1001
  • [4] A Model-Based Reinforcement Learning Algorithm for Routing in Energy Harvesting Mobile Ad-Hoc Networks
    Meisam Maleki
    Vesal Hakami
    Mehdi Dehghan
    [J]. Wireless Personal Communications, 2017, 95 : 3119 - 3139
  • [5] A Model-Based Reinforcement Learning Algorithm for Routing in Energy Harvesting Mobile Ad-Hoc Networks
    Maleki, Meisam
    Hakami, Vesal
    Dehghan, Mehdi
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2017, 95 (03) : 3119 - 3139
  • [6] Ad hoc teamwork by learning teammates' task
    Melo, Francisco S.
    Sardinha, Alberto
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2016, 30 (02) : 175 - 219
  • [7] Ad Hoc Teamwork by Learning Teammates' Task
    Melo, Francisco S.
    Sardinha, Alberto
    [J]. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 577 - 578
  • [8] Ad hoc teamwork by learning teammates’ task
    Francisco S. Melo
    Alberto Sardinha
    [J]. Autonomous Agents and Multi-Agent Systems, 2016, 30 : 175 - 219
  • [9] Autonomous Learning Agents: Layered Learning and Ad Hoc Teamwork
    Stone, Peter
    [J]. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 2 - 2
  • [10] Adaptive Routing for an Ad Hoc Network Based on Reinforcement Learning
    Desai, Rahul
    Patil, B. P.
    [J]. INTERNATIONAL JOURNAL OF BUSINESS DATA COMMUNICATIONS AND NETWORKING, 2015, 11 (02) : 40 - 52