Large Language Models as Zero-Shot Human Models for Human-Robot Interaction

被引：1

作者：

Zhang, Bowen ^{[1
]}

Soh, Harold ^{[2
]}

机构：

[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore

[2] NUS, Smart Syst Inst SSI, Singapore, Singapore

来源：

2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年

基金：

新加坡国家研究基金会;

关键词：

D O I：

10.1109/IROS55552.2023.10341488

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human models play a crucial role in human-robot interaction (HRI), enabling robots to consider the impact of their actions on people and plan their behavior accordingly. However, crafting good human models is challenging; capturing context-dependent human behavior requires significant prior knowledge and/or large amounts of interaction data, both of which are difficult to obtain. In this work, we explore the potential of large language models (LLMs) - which have consumed vast amounts of human-generated text data - to act as zero-shot human models for HRI. Our experiments on three social datasets yield promising results; the LLMs are able to achieve performance comparable to purpose-built models. That said, we also discuss current limitations, such as sensitivity to prompts and spatial/numerical reasoning mishaps. Based on our findings, we demonstrate how LLM-based human models can be integrated into a social robot's planning process and applied in HRI scenarios focused on the important element of trust. Specifically, we present one case study on a simulated trust-based table-clearing task and replicate past results that relied on custom models. Next, we conduct a new robot utensil-passing experiment ( n = 65) where preliminary results show that planning with an LLM-based human model can achieve gains over a basic myopic plan. In summary, our results show that LLMs offer a promising (but incomplete) approach to human modeling for HRI.

引用

页码：7961 / 7968

页数：8

共 50 条

[11] Multi-modal Language Models for Human-Robot Interaction
Janssens, Ruben
COMPANION OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024 COMPANION, 2024, : 109 - 111
[12] Large language models for human–robot interaction: A review
Zhang C.
Chen J.
Li J.
Peng Y.
Mao Z.
Biomimetic Intelligence and Robotics, 2023, 3 (04):
[13] Comparison of various models of robot and human in human-robot interaction
Luh, JYS
Hu, SY
1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 1139 - 1144
[14] Language Models as Zero-Shot Trajectory Generators
Kwon, Teyun
Di Palo, Norman
Johns, Edward
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6728 - 6735
[15] Zero-shot Bilingual App Reviews Mining with Large Language Models
Wei, Jialiang
Courbis, Anne-Lise
Lambolais, Thomas
Xu, Binbin
Bernard, Pierre Louis
Dray, Gerard
2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 898 - 904
[16] Large Language Models Are Zero-Shot Fuzzers: Fuzzing Deep-Learning Libraries via Large Language Models
Deng, Yinlin
Xia, Chunqiu Steven
Peng, Haoran
Yang, Chenyuan
Zhan, Lingming
PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 423 - 435
[17] Zero-Shot Generative Large Language Models for Systematic Review Screening Automation
Wang, Shuai
Scells, Harrisen
Zhuang, Shengyao
Potthast, Martin
Koopman, Bevan
Zuccon, Guido
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 403 - 420
[18] Zero-shot interpretable phenotyping of postpartum hemorrhage using large language models
Alsentzer, Emily
Rasmussen, Matthew J.
Fontoura, Romy
Cull, Alexis L.
Beaulieu-Jones, Brett
Gray, Kathryn J.
Bates, David W.
Kovacheva, Vesela P.
NPJ DIGITAL MEDICINE, 2023, 6 (01)
[19] Improving Zero-Shot Text Matching for Financial Auditing with Large Language Models
Hillebrand, Lars
Berger, Armin
Deusser, Tobias
Dilmaghani, Tim
Khaled, Mohamed
Kliem, Bernd
Loitz, Ruediger
Pielka, Maren
Leonhard, David
Bauckhage, Christian
Sifa, Rafet
PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2023, 2023,
[20] Zero-shot interpretable phenotyping of postpartum hemorrhage using large language models
Emily Alsentzer
Matthew J. Rasmussen
Romy Fontoura
Alexis L. Cull
Brett Beaulieu-Jones
Kathryn J. Gray
David W. Bates
Vesela P. Kovacheva
npj Digital Medicine, 6

← 1 2 3 4 5 →