A Survey of Robot Intelligence with Large Language Models

被引:0
|
作者
Jeong, Hyeongyo [1 ]
Lee, Haechan [1 ]
Kim, Changwon [2 ]
Shin, Sungtae [1 ]
机构
[1] Department of Mechanical Engineering, Dong-A University, Busan,49315, Korea, Republic of
[2] School of Mechanical Engineering, Pukyong National University, Busan,48513, Korea, Republic of
来源
Applied Sciences (Switzerland) | 2024年 / 14卷 / 19期
基金
新加坡国家研究基金会;
关键词
Adversarial machine learning - Microrobots - Natural language processing systems - Problem oriented languages - Reinforcement learning - Robot learning - Robot vision - Supervised learning - Visual languages;
D O I
10.3390/app14198868
中图分类号
学科分类号
摘要
Since the emergence of ChatGPT, research on large language models (LLMs) has actively progressed across various fields. LLMs, pre-trained on vast text datasets, have exhibited exceptional abilities in understanding natural language and planning tasks. These abilities of LLMs are promising in robotics. In general, traditional supervised learning-based robot intelligence systems have a significant lack of adaptability to dynamically changing environments. However, LLMs help a robot intelligence system to improve its generalization ability in dynamic and complex real-world environments. Indeed, findings from ongoing robotics studies indicate that LLMs can significantly improve robots’ behavior planning and execution capabilities. Additionally, vision-language models (VLMs), trained on extensive visual and linguistic data for the vision question answering (VQA) problem, excel at integrating computer vision with natural language processing. VLMs can comprehend visual contexts and execute actions through natural language. They also provide descriptions of scenes in natural language. Several studies have explored the enhancement of robot intelligence using multimodal data, including object recognition and description by VLMs, along with the execution of language-driven commands integrated with visual information. This review paper thoroughly investigates how foundation models such as LLMs and VLMs have been employed to boost robot intelligence. For clarity, the research areas are categorized into five topics: reward design in reinforcement learning, low-level control, high-level planning, manipulation, and scene understanding. This review also summarizes studies that show how foundation models, such as the Eureka model for automating reward function design in reinforcement learning, RT-2 for integrating visual data, language, and robot actions in vision-language-action models, and AutoRT for generating feasible tasks and executing robot behavior policies via LLMs, have improved robot intelligence. © 2024 by the authors.
引用
收藏
相关论文
共 50 条
  • [1] Emotional intelligence of Large Language Models
    Wang, Xuena
    Li, Xueting
    Yin, Zi
    Wu, Yue
    Liu, Jia
    [J]. JOURNAL OF PACIFIC RIM PSYCHOLOGY, 2023, 17
  • [2] Artificial intelligence chatbots and large language models in dental education: Worldwide survey of educators
    Uribe, Sergio E.
    Maldupa, Ilze
    Kavadella, Argyro
    El Tantawi, Maha
    Chaurasia, Akhilanand
    Fontana, Margherita
    Marino, Rodrigo
    Innes, Nicola
    Schwendicke, Falk
    [J]. EUROPEAN JOURNAL OF DENTAL EDUCATION, 2024,
  • [3] Large Language Models in Finance: A Survey
    Li, Yinheng
    Wang, Shaofei
    Ding, Han
    Chen, Hang
    [J]. PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, ICAIF 2023, 2023, : 374 - 382
  • [4] Explainability for Large Language Models: A Survey
    Zhao, Haiyan
    Chen, Hanjie
    Yang, Fan
    Liu, Ninghao
    Deng, Huiqi
    Cai, Hengyi
    Wang, Shuaiqiang
    Yin, Dawei
    Du, Mengnan
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2024, 15 (02)
  • [5] A survey on LoRA of large language models
    Mao, Yuren
    Ge, Yuhang
    Fan, Yijiang
    Xu, Wenyi
    Mi, Yu
    Hu, Zhonghao
    Gao, Yunjun
    [J]. Frontiers of Computer Science, 2025, 19 (07)
  • [6] Large language models in law: A survey
    Lai, Jinqi
    Gan, Wensheng
    Wu, Jiayang
    Qi, Zhenlian
    Yu, Philip S.
    [J]. AI Open, 2024, 5 : 181 - 196
  • [7] A survey on large language models for recommendation
    Wu, Likang
    Zheng, Zhi
    Qiu, Zhaopeng
    Wang, Hao
    Gu, Hongchao
    Shen, Tingjia
    Qin, Chuan
    Zhu, Chen
    Zhu, Hengshu
    Liu, Qi
    Xiong, Hui
    Chen, Enhong
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2024, 27 (05):
  • [8] Large language models for medicine: a survey
    Zheng, Yanxin
    Gan, Wensheng
    Chen, Zefeng
    Qi, Zhenlian
    Liang, Qian
    Yu, Philip S.
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
  • [9] A Survey on Evaluation of Large Language Models
    Chang, Yupeng
    Wang, Xu
    Wang, Jindong
    Wu, Yuan
    Yang, Linyi
    Zhu, Kaijie
    Chen, Hao
    Yi, Xiaoyuan
    Wang, Cunxiang
    Wang, Yidong
    Ye, Wei
    Zhang, Yue
    Chang, Yi
    Yu, Philip S.
    Yang, Qiang
    Xie, Xing
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2024, 15 (03)
  • [10] Artificial intelligence, large language models, and you
    Marquardt, Charles
    [J]. JOURNAL OF VASCULAR SURGERY CASES INNOVATIONS AND TECHNIQUES, 2023, 9 (04):