LLM-BT: Performing Robotic Adaptive Tasks based on Large Language Models and Behavior Trees

被引:0
|
作者
Zhou, Haotian [1 ]
Lin, Yunhan [1 ]
Yan, Longwu [1 ]
Zhu, Jihong [2 ]
Min, Huasong [1 ]
机构
[1] Wuhan Univ Sci & Technol, Inst Robot & Intelligent Syst, Wuhan, Peoples R China
[2] Univ York, Sch Phys Engn & Technol, York, N Yorkshire, England
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
10.1109/ICRA57147.2024.10610183
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large Language Models (LLMs) have been widely utilized to perform complex robotic tasks. However, handling external disturbances during tasks is still an open challenge. This paper proposes a novel method to achieve robotic adaptive tasks based on LLMs and Behavior Trees (BTs). It utilizes ChatGPT to reason the descriptive steps of tasks. In order to enable ChatGPT to understand the environment, semantic maps are constructed by an object recognition algorithm. Then, we design a Parser module based on Bidirectional Encoder Representations from Transformers (BERT) to parse these steps into initial BTs. Subsequently, a BTs Update algorithm is proposed to expand the initial BTs dynamically to control robots to perform adaptive tasks. Different from other LLM-based methods for complex robotic tasks, our method outputs variable BTs that can add and execute new actions according to environmental changes, which is robust to external disturbances. Our method is validated with simulation in different practical scenarios.
引用
收藏
页码:16655 / 16661
页数:7
相关论文
共 20 条
  • [11] UA-LLM: ADVANCING CONTEXT-BASED QUESTION ANSWERING IN UKRAINIAN THROUGH LARGE LANGUAGE MODELS
    Syromiatnikov, M., V
    Ruvinskaya, V. M.
    RADIO ELECTRONICS COMPUTER SCIENCE CONTROL, 2024, (01) : 147 - 160
  • [12] Generation of Scientific Literature Surveys Based on Large Language Models (LLM) and Multi-Agent Systems (MAS)
    Qi, Ruihua
    Li, Weilong
    Lyu, Haobo
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT V, NLPCC 2024, 2025, 15363 : 169 - 180
  • [13] Can Large Language Models Be Good Companions? An LLM-Based Eyewear System with Conversational Common Ground
    Xu, Zhenyu
    Xu, Hailin
    Lu, Zhouyang
    Zhao, Yingying
    Zhu, Rui
    Wang, Yujiang
    Dong, Mingzhi
    Chang, Yuhu
    Lv, Qin
    Dick, Robert P.
    Yang, Fan
    Lu, Tun
    Gu, Ning
    Shang, Li
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2024, 8 (02):
  • [14] Framework for Integrating Large Language Models with a Robotic Health Attendant for Adaptive Task Execution in Patient Care
    Kim, Kyungki
    Windle, John
    Christian, Melissa
    Windle, Tom
    Ryherd, Erica
    Huang, Pei-Chi
    Robinson, Anthony
    Chapman, Reid
    APPLIED SCIENCES-BASEL, 2024, 14 (21):
  • [15] Development of an Adaptive User Support System Based on Multimodal Large Language Models
    Wang, Wei
    Li, Lin
    Wickramathilaka, Shavindra
    Grundy, John
    Khalajzadeh, Hourieh
    Obie, Humphrey O.
    Madugalla, Anuradha
    2024 IEEE SYMPOSIUM ON VISUAL LANGUAGES AND HUMAN-CENTRIC COMPUTING, VL/HCC 2024, 2024, : 344 - 347
  • [16] Model based self adaptive behavior language for large scale real time embedded systems
    Shetty, S
    Neema, S
    Bapty, T
    11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOP ON THE ENGINEERING OF COMPUTER-BASED SYSTEMS, PROCEEDINGS, 2004, : 478 - 483
  • [17] Optimizing Agent Behavior in the MiniGrid Environment Using Reinforcement Learning Based on Large Language Models
    Park, Byeong-Ju
    Yong, Sung-Jung
    Hwang, Hyun-Seo
    Moon, Il-Young
    APPLIED SCIENCES-BASEL, 2025, 15 (04):
  • [18] Efficient subspace methods-based algorithms for performing sensitivity, uncertainty, and adaptive simulation of large-scale computational models
    Abdel-Khalik, Hany S.
    Turinsky, Paul J.
    Jessee, Matthew A.
    NUCLEAR SCIENCE AND ENGINEERING, 2008, 159 (03) : 256 - 272
  • [19] Performance Evaluation of Multimodal Large Language Models (LLaVA and GPT-4-based ChatGPT) in Medical Image Classification Tasks
    Guo, Yuhang
    Wan, Zhiyu
    2024 IEEE 12TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS, ICHI 2024, 2024, : 541 - 543
  • [20] AdaShield: Safeguarding Multimodal Large Language Models from Structure-Based Attack via Adaptive Shield Prompting
    Wang, Yu
    Liu, Xiaogeng
    Li, Yu
    Chen, Muhao
    Xiao, Chaowei
    COMPUTER VISION - ECCV 2024, PT XX, 2025, 15078 : 77 - 94