Generation of Robot Manipulation Plans Using Generative Large Language Models

被引:0
|
作者
Toberg, Jan-Philipp [1 ,2 ]
Cimiano, Philipp [1 ,2 ]
机构
[1] Univ Bielefeld, Ctr Cognit Interact Technol CITEC, Bielefeld, Germany
[2] Univ Bielefeld, Joint Res Ctr Cooperat & Cognit Enabled CoAI JRC, Bielefeld, Germany
关键词
Robot Plan Generation; Large Language Models; Action Similarity; CRAM; GPT;
D O I
10.1109/IRC59093.2023.00039
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Designing plans that allow robots to carry out actions such as grasping an object or cutting a fruit is a time-consuming activity requiring specific skills and knowledge. The recent success of Generative Large Language Models (LLMs) has opened new avenues for code generation. In order to evaluate the ability of LLMs to generate code representing manipulation plans, we carry out experiments with different LLMs in the CRAM framework. In our experimental framework, we ask an LLM such as ChatGPT or GPT-4 to generate a plan for a specific target action given the plan (called designator within CRAM) for a given reference action as an example. We evaluate the generated designators against a ground truth designator using machine translation and code generation metrics, as well as assessing whether they compile. We find that GPT-4 slightly outperforms ChatGPT, but both models achieve a solid performance above all evaluated metrics. However, only similar to 36% of the generated designators compile successfully. In addition, we assess how the chosen reference action influences the code generation quality as well as the compilation success. Unexpectedly, the action similarity negatively correlates with compilation success. With respect to the metrics, we obtain either a positive or negative correlation depending on the used model. Finally, we describe our attempt to use ChatGPT in an interactive fashion to incrementally refine the initially generated designator. On the basis of our observations we conclude that the behaviour of ChatGPT is not reliable and robust enough to support the incremental refinement of a designator.
引用
收藏
页码:190 / 197
页数:8
相关论文
共 50 条
  • [1] Generative Expressive Robot Behaviors using Large Language Models
    Mahadevan, Karthik
    Chien, Jonathan
    Brown, Noah
    Xu, Zhuo
    Parada, Carolina
    Xia, Fei
    Zeng, Andy
    Takayama, Leila
    Sadigh, Dorsa
    PROCEEDINGS OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024, 2024, : 482 - 491
  • [2] PROGPROMPT: Generating Situated Robot Task Plans using Large Language Models
    Singh, Ishika
    Blukis, Valts
    Mousavian, Arsalan
    Goyal, Ankit
    Xu, Danfei
    Tremblay, Jonathan
    Fox, Dieter
    Thomason, Jesse
    Garg, Animesh
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 11523 - 11530
  • [3] PROGPROMPT: program generation for situated robot task planning using large language models
    Singh, Ishika
    Blukis, Valts
    Mousavian, Arsalan
    Goyal, Ankit
    Xu, Danfei
    Tremblay, Jonathan
    Fox, Dieter
    Thomason, Jesse
    Garg, Animesh
    AUTONOMOUS ROBOTS, 2023, 47 (08) : 999 - 1012
  • [4] ProgPrompt: program generation for situated robot task planning using large language models
    Ishika Singh
    Valts Blukis
    Arsalan Mousavian
    Ankit Goyal
    Danfei Xu
    Jonathan Tremblay
    Dieter Fox
    Jesse Thomason
    Animesh Garg
    Autonomous Robots, 2023, 47 : 999 - 1012
  • [5] Generative Large Language Models Explained
    Yan, Xueming
    Xiao, Yan
    Jin, Yaochu
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2024, 19 (04) : 45 - 46
  • [6] Foundation Models, Generative AI, and Large Language Models
    Ross, Angela
    McGrow, Kathleen
    Zhi, Degui
    Rasmy, Laila
    CIN-COMPUTERS INFORMATICS NURSING, 2024, 42 (05) : 377 - 387
  • [7] FinGPT: Large Generative Models for a Small Language
    Luukkonen, Risto
    Komulainen, Ville
    Luoma, Jouni
    Eskelinen, Anni
    Kanerva, Jenna
    Kupari, Hanna-Mari
    Ginter, Filip
    Laippala, Veronika
    Muennighoff, Niklas
    Piktus, Aleksandra
    Wang, Thomas
    Tazi, Nouamane
    Le Scao, Teven
    Wolf, Thomas
    Suominen, Osma
    Sairanen, Samuli
    Merioksa, Mikko
    Heinonen, Jyrki
    Vahtola, Aija
    Ffi, Samuel Antao
    Pyysalo, Sampo
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 2710 - 2726
  • [8] Journal policy on large language generative models
    Sessler, Daniel I.
    Turan, Alparslan
    JOURNAL OF CLINICAL ANESTHESIA, 2024, 96
  • [9] Open Generative Large Language Models for Galician
    Gamallo, Pablo
    Rodriguez, Pablo
    de-Dios-Flores, Iria
    Sotelo, Susana
    Paniagua, Silvia
    Bardanca, Daniel
    Ramom Pichel, Jose
    Garcia, Marcos
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2024, (73): : 259 - 270
  • [10] Generative Relevance Feedback with Large Language Models
    Mackie, Iain
    Chatterjee, Shubham
    Dalton, Jeffrey
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2026 - 2031