Generation of Robot Manipulation Plans Using Generative Large Language Models

被引：0

作者：

Toberg, Jan-Philipp ^{[1
,2
]}

Cimiano, Philipp ^{[1
,2
]}

机构：

[1] Univ Bielefeld, Ctr Cognit Interact Technol CITEC, Bielefeld, Germany

[2] Univ Bielefeld, Joint Res Ctr Cooperat & Cognit Enabled CoAI JRC, Bielefeld, Germany

来源：

2023 SEVENTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC 2023 | 2023年

关键词：

Robot Plan Generation; Large Language Models; Action Similarity; CRAM; GPT;

D O I：

10.1109/IRC59093.2023.00039

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Designing plans that allow robots to carry out actions such as grasping an object or cutting a fruit is a time-consuming activity requiring specific skills and knowledge. The recent success of Generative Large Language Models (LLMs) has opened new avenues for code generation. In order to evaluate the ability of LLMs to generate code representing manipulation plans, we carry out experiments with different LLMs in the CRAM framework. In our experimental framework, we ask an LLM such as ChatGPT or GPT-4 to generate a plan for a specific target action given the plan (called designator within CRAM) for a given reference action as an example. We evaluate the generated designators against a ground truth designator using machine translation and code generation metrics, as well as assessing whether they compile. We find that GPT-4 slightly outperforms ChatGPT, but both models achieve a solid performance above all evaluated metrics. However, only similar to 36% of the generated designators compile successfully. In addition, we assess how the chosen reference action influences the code generation quality as well as the compilation success. Unexpectedly, the action similarity negatively correlates with compilation success. With respect to the metrics, we obtain either a positive or negative correlation depending on the used model. Finally, we describe our attempt to use ChatGPT in an interactive fashion to incrementally refine the initially generated designator. On the basis of our observations we conclude that the behaviour of ChatGPT is not reliable and robust enough to support the incremental refinement of a designator.

引用

页码：190 / 197

页数：8

共 50 条

[1] Generative Expressive Robot Behaviors using Large Language Models
Mahadevan, Karthik
Chien, Jonathan
Brown, Noah
Xu, Zhuo
Parada, Carolina
Xia, Fei
Zeng, Andy
Takayama, Leila
Sadigh, Dorsa
PROCEEDINGS OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024, 2024, : 482 - 491
[2] PROGPROMPT: Generating Situated Robot Task Plans using Large Language Models
Singh, Ishika
Blukis, Valts
Mousavian, Arsalan
Goyal, Ankit
Xu, Danfei
Tremblay, Jonathan
Fox, Dieter
Thomason, Jesse
Garg, Animesh
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 11523 - 11530
[3] PROGPROMPT: program generation for situated robot task planning using large language models
Singh, Ishika
Blukis, Valts
Mousavian, Arsalan
Goyal, Ankit
Xu, Danfei
Tremblay, Jonathan
Fox, Dieter
Thomason, Jesse
Garg, Animesh
AUTONOMOUS ROBOTS, 2023, 47 (08) : 999 - 1012
[4] ProgPrompt: program generation for situated robot task planning using large language models
Ishika Singh
Valts Blukis
Arsalan Mousavian
Ankit Goyal
Danfei Xu
Jonathan Tremblay
Dieter Fox
Jesse Thomason
Animesh Garg
Autonomous Robots, 2023, 47 : 999 - 1012
[5] Generative Large Language Models Explained
Yan, Xueming
Xiao, Yan
Jin, Yaochu
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2024, 19 (04) : 45 - 46
[6] Foundation Models, Generative AI, and Large Language Models
Ross, Angela
McGrow, Kathleen
Zhi, Degui
Rasmy, Laila
CIN-COMPUTERS INFORMATICS NURSING, 2024, 42 (05) : 377 - 387
[7] FinGPT: Large Generative Models for a Small Language
Luukkonen, Risto
Komulainen, Ville
Luoma, Jouni
Eskelinen, Anni
Kanerva, Jenna
Kupari, Hanna-Mari
Ginter, Filip
Laippala, Veronika
Muennighoff, Niklas
Piktus, Aleksandra
Wang, Thomas
Tazi, Nouamane
Le Scao, Teven
Wolf, Thomas
Suominen, Osma
Sairanen, Samuli
Merioksa, Mikko
Heinonen, Jyrki
Vahtola, Aija
Ffi, Samuel Antao
Pyysalo, Sampo
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 2710 - 2726
[8] Journal policy on large language generative models
Sessler, Daniel I.
Turan, Alparslan
JOURNAL OF CLINICAL ANESTHESIA, 2024, 96
[9] Open Generative Large Language Models for Galician
Gamallo, Pablo
Rodriguez, Pablo
de-Dios-Flores, Iria
Sotelo, Susana
Paniagua, Silvia
Bardanca, Daniel
Ramom Pichel, Jose
Garcia, Marcos
PROCESAMIENTO DEL LENGUAJE NATURAL, 2024, (73): : 259 - 270
[10] Generative Relevance Feedback with Large Language Models
Mackie, Iain
Chatterjee, Shubham
Dalton, Jeffrey
PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2026 - 2031

← 1 2 3 4 5 →