Representation and Fusion Based on Knowledge Graph in Multi-Modal Semantic Communication

被引:0
|
作者
Xing, Chenlin [1 ]
Lv, Jie [1 ]
Luo, Tao [1 ]
Zhang, Zhilong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Correlation; Feature extraction; Knowledge graphs; Cognition; Head; Data mining; Semantic communication; multi-modal fusion; knowledge graph;
D O I
10.1109/LWC.2024.3369864
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The existing research on multi-modal semantic communication ignores the exploration of reasoning correlation among multi-modal data. Motivated by this, a multi-modal semantic representation and fusion model based on knowledge graph (KG-MSF) is proposed in this letter. In KG-MSF, the direct and reasoning correlation semantic information is extracted and mapped into a two-layer semantic architecture to represent the semantics of each modal fully. After that, the knowledge graph with structural advantage is utilized to fuse multi-modal semantic information, which is transmitted under different channel conditions. To assess the efficacy of semantic representation and fusion of the proposed KG-MSF in the multi-modal semantic communication system, we conduct comprehensive experiments on the task of visual question answer (VQA) with a metric of answer accuracy. The results demonstrate the superiority compared with existing models for multi-modal semantic representation, fusion, transmission efficiency and channel robustness.
引用
收藏
页码:1344 / 1348
页数:5
相关论文
共 50 条
  • [31] Multi-hop neighbor fusion enhanced hierarchical transformer for multi-modal knowledge graph completion
    Wang, Yunpeng
    Ning, Bo
    Wang, Xin
    Li, Guanyu
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2024, 27 (05):
  • [32] Multi-modal Graph Convolutional Network for Knowledge Graph Entity Alignment
    You, Yinghui
    Wei, Yuyang
    Zhang, Yanlong
    Chen, Wei
    Zhao, Lei
    WEB AND BIG DATA, PT I, APWEB-WAIM 2023, 2024, 14331 : 142 - 157
  • [33] Propagation Graph Fusion for Multi-Modal Medical Content-Based Retrieval
    Liu, Sidong
    Liu, Siqi
    Pujol, Sonia
    Kikinis, Ron
    Feng, Dagan
    Cai, Weidong
    2014 13TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION (ICARCV), 2014, : 849 - 854
  • [34] Audio-Visual Scene Classification Based on Multi-modal Graph Fusion
    Lei, Han
    Chen, Ning
    INTERSPEECH 2022, 2022, : 4157 - 4161
  • [35] Multi-Modal Medical Image Fusion With Geometric Algebra Based Sparse Representation
    Li, Yanping
    Fang, Nian
    Wang, Haiquan
    Wang, Rui
    FRONTIERS IN GENETICS, 2022, 13
  • [36] Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering
    Xia, Wei
    Wang, Tianxiu
    Gao, Quanxue
    Yang, Ming
    Gao, Xinbo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1170 - 1183
  • [37] A multi-modal image fusion framework based on guided filter and sparse representation
    Zhang, Shuai
    Huang, Fuyu
    Liu, Bingqi
    Li, Gang
    Chen, Yichao
    Chen, Yudan
    Zhou, Bing
    Wu, Dongsheng
    OPTICS AND LASERS IN ENGINEERING, 2021, 137
  • [38] Combining Knowledge and Multi-modal Fusion for Meme Classification
    Zhong, Qi
    Wang, Qian
    Liu, Ji
    MULTIMEDIA MODELING (MMM 2022), PT I, 2022, 13141 : 599 - 611
  • [39] Application of Multi-modal Fusion Attention Mechanism in Semantic Segmentation
    Liu, Yunlong
    Yoshie, Osamu
    Watanabe, Hiroshi
    COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 378 - 397
  • [40] Rice Fertilization Period Discrimination Method Based on Multi-modal Knowledge Graph
    Yuan, Licun
    Zhou, Jun
    Ge, Weixi
    Zheng, Pengyuan
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2024, 55 (09): : 163 - 173