A Hierarchical Approach to Population Training for Human-AI Collaboration

被引:0
|
作者
Loo, Yi [1 ]
Gong, Chen [1 ]
Meghjani, Malika [1 ]
机构
[1] Singapore Univ Technol & Design SUTD, Singapore, Singapore
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A major challenge for deep reinforcement learning (DRL) agents is to collaborate with novel partners that were not encountered by them during the training phase. This is specifically worsened by an increased variance in action responses when the DRL agents collaborate with human partners due to the lack of consistency in human behaviors. Recent work have shown that training a single agent as the best response to a diverse population of training partners significantly increases an agent's robustness to novel partners. We further enhance the population-based training approach by introducing a Hierarchical Reinforcement Learning (HRL) based method for Human-AI Collaboration. Our agent is able to learn multiple best-response policies as its low-level policy while at the same time, it learns a high-level policy that acts as a manager which allows the agent to dynamically switch between the low-level best-response policies based on its current partner. We demonstrate that our method is able to dynamically adapt to novel partners of different play styles and skill levels in the 2-player collaborative Overcooked game environment. We also conducted a human study in the same environment to test the effectiveness of our method when partnering with real human subjects. Code is available at https://gitlab.com/marvl-hipt/hipt.
引用
收藏
页码:3011 / 3019
页数:9
相关论文
共 50 条
  • [41] AI and XAI second opinion: the danger of false confirmation in human-AI collaboration
    Rosenbacke, Rikard
    Melhus, Asa
    McKee, Martin
    Stuckler, David
    JOURNAL OF MEDICAL ETHICS, 2024,
  • [42] Working With and Around Artificial Intelligence: AI Crafting and Human-AI Collaboration in Recruitment
    Laukkarinen, Matti
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2025,
  • [43] Human-AI Collaboration in Quality Control with Augmented Manufacturing Analytics
    Bousdekis, Alexandros
    Wellsandt, Stefan
    Bosani, Enrica
    Lepenioti, Katerina
    Apostolou, Dimitris
    Hribernik, Karl
    Mentzas, Gregoris
    ADVANCES IN PRODUCTION MANAGEMENT SYSTEMS: ARTIFICIAL INTELLIGENCE FOR SUSTAINABLE AND RESILIENT PRODUCTION SYSTEMS, APMS 2021, PT IV, 2021, 633 : 303 - 310
  • [44] Colorbo: Envisioned Mandala Coloring through Human-AI Collaboration
    Kim, Eunseo
    Hong, Jeongmin
    Lee, Hyuna
    Ko, Minsam
    IUI'22: 27TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2022, : 15 - 26
  • [45] Preparing Future Designers for Human-AI Collaboration in Persona Creation
    Goel, Toshali
    Shaer, Orit
    Gu, Quan
    Delcourt, Catherine
    Cooper, Angel
    PROCEEDINGS OF THE 2ND ANNUAL MEETING OF THE SYMPOSIUM ON HUMAN-COMPUTER INTERACTION FOR WORK, CHIWORK 2023, 2023,
  • [46] REGROW: Reimagining Global Crowdsourcing for Better Human-AI Collaboration
    Alorwu, Andy
    Savage, Saiph
    van Berkel, Niels
    Ustalov, Dmitry
    Drutsa, Alexey
    Oppenlaender, Jonas
    Bates, Oliver
    Hettiachchi, Danula
    Gadiraju, Ujwal
    Goncalves, Jorge
    Hosio, Simo
    EXTENDED ABSTRACTS OF THE 2022 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2022, 2022,
  • [47] Applications of Human-AI Collaboration: Insights from Theory and Practice
    Oeste-Reiß, Sarah
    Bittner, Eva
    Ebel, Philipp Alexander
    Söllner, Matthias
    Proceedings of the Annual Hawaii International Conference on System Sciences, 2022, 2022-January : 194 - 195
  • [48] Understanding Choice Independence and Error Types in Human-AI Collaboration
    Erlei, Alexander
    Sharma, Abhinav
    Gadiraju, Ujwal
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS, CHI 2024, 2024,
  • [49] Towards Stronger Adversarial Baselines Through Human-AI Collaboration
    You, Wencong
    Lowd, Daniel
    PROCEEDINGS OF THE FIRST WORKSHOP ON EFFICIENT BENCHMARKING IN NLP (NLP POWER 2022), 2022, : 11 - 21
  • [50] Benchmarking Human-AI collaboration for common evidence appraisal tools
    Woelfle, Tim
    Hirt, Julian
    Janiaud, Perrine
    Kappos, Ludwig
    Ioannidis, John P. A.
    Hemkens, Lars G.
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2024, 175