Large Language Models Empower Multimodal Integrated Sensing and Communication

被引:0
|
作者
Cheng, Lu [1 ]
Zhang, Hongliang [2 ]
Di, Boya [1 ,2 ]
Niyato, Dusit [3 ]
Song, Lingyang [1 ,2 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, Shenzhen, Peoples R China
[2] Peking Univ, Sch Elect, Shenzhen, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
基金
新加坡国家研究基金会; 美国国家科学基金会; 北京市自然科学基金;
关键词
Integrated sensing and communication; Multimodal sensors; Robot sensing systems; Training; Feature extraction; Drones; Cameras; Tuning; Laser radar; Large language models;
D O I
10.1109/MCOM.004.2400281
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Integrated sensing and communication (ISAC) is considered as a key candidate technology for the sixth-generation (6G) wireless networks. Notably, an integration of multimodal sensing information within ISAC systems promises an improvement for communication performance. Nevertheless, traditional methods for ISAC systems are typically designed to handle unimodal data, making it challenging to effectively process and integrate semantically complex multimodal information. Moreover, they are usually customized for specific types of data or tasks, leading to poor generalization ability. Multimodal large language models (MLLMs), which are trained on massive multimodal datasets and possess large parameter scales, are expected to be powerful tools to address the above issues. In this article, we first introduce an MLLM-enabled ISAC system to achieve enhanced communication and sensing performance. We begin with the introduction of the fundamental principles of ISAC and MLLMs. Moreover, we present the overall system and the corresponding opportunities to be achieved. Furthermore, this article provides a case study to demonstrate the superior performance of MLLMs in handling the beam prediction task within ISAC systems. Finally, we discuss several research challenges and potential directions for future research.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Large language models empower the reliability of disassembly in remanufacturing
    Xia, Liqiao
    Pang, Jiazhen
    Li, Chengxi
    Wang, Ruoxin
    Zheng, Pai
    Manufacturing Letters, 2024, 41 : 1728 - 1733
  • [2] Large Language Models Empower the Reliability of Disassembly in Remanufacturing
    Xia, Liqiao
    Pang, Jiazhen
    Li, Chengxi
    Wang, Ruoxin
    Zheng, Pai
    MANUFACTURING LETTERS, 2024, 41 : 1728 - 1733
  • [3] Multimodal Learning for Integrated Sensing and Communication Networks
    Liu, Xiaonan
    Ratnarajah, Tharmalingam
    Sellathurai, Mathini
    Eldar, Yonina C.
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 1177 - 1181
  • [4] A survey on multimodal large language models
    Yin, Shukang
    Fu, Chaoyou
    Zhao, Sirui
    Li, Ke
    Sun, Xing
    Xu, Tong
    Chen, Enhong
    NATIONAL SCIENCE REVIEW, 2024, 11 (12)
  • [5] A survey on multimodal large language models
    Shukang Yin
    Chaoyou Fu
    Sirui Zhao
    Ke Li
    Xing Sun
    Tong Xu
    Enhong Chen
    National Science Review, 2024, 11 (12) : 277 - 296
  • [6] From Large Language Models to Large Multimodal Models: A Literature Review
    Huang, Dawei
    Yan, Chuan
    Li, Qing
    Peng, Xiaojiang
    APPLIED SCIENCES-BASEL, 2024, 14 (12):
  • [7] A comprehensive survey of large language models and multimodal large models in medicine
    Xiao, Hanguang
    Zhou, Feizhong
    Liu, Xingyue
    Liu, Tianqi
    Li, Zhipeng
    Liu, Xin
    Huang, Xiaoxuan
    INFORMATION FUSION, 2025, 117
  • [8] Multimodal Large Language Models in Vision and Ophthalmology
    Lu, Zhiyong
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
  • [9] The application of multimodal large language models in medicine
    Qiu, Jianing
    Yuan, Wu
    Lam, Kyle
    LANCET REGIONAL HEALTH-WESTERN PACIFIC, 2024, 45
  • [10] Visual cognition in multimodal large language models
    Buschoff, Luca M. Schulze
    Akata, Elif
    Bethge, Matthias
    Schulz, Eric
    NATURE MACHINE INTELLIGENCE, 2025, 7 (01) : 96 - 106