Representation and Fusion Based on Knowledge Graph in Multi-Modal Semantic Communication

被引:0
|
作者
Xing, Chenlin [1 ]
Lv, Jie [1 ]
Luo, Tao [1 ]
Zhang, Zhilong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Correlation; Feature extraction; Knowledge graphs; Cognition; Head; Data mining; Semantic communication; multi-modal fusion; knowledge graph;
D O I
10.1109/LWC.2024.3369864
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The existing research on multi-modal semantic communication ignores the exploration of reasoning correlation among multi-modal data. Motivated by this, a multi-modal semantic representation and fusion model based on knowledge graph (KG-MSF) is proposed in this letter. In KG-MSF, the direct and reasoning correlation semantic information is extracted and mapped into a two-layer semantic architecture to represent the semantics of each modal fully. After that, the knowledge graph with structural advantage is utilized to fuse multi-modal semantic information, which is transmitted under different channel conditions. To assess the efficacy of semantic representation and fusion of the proposed KG-MSF in the multi-modal semantic communication system, we conduct comprehensive experiments on the task of visual question answer (VQA) with a metric of answer accuracy. The results demonstrate the superiority compared with existing models for multi-modal semantic representation, fusion, transmission efficiency and channel robustness.
引用
收藏
页码:1344 / 1348
页数:5
相关论文
共 50 条
  • [1] Contrastive Multi-Modal Knowledge Graph Representation Learning
    Fang, Quan
    Zhang, Xiaowei
    Hu, Jun
    Wu, Xian
    Xu, Changsheng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 8983 - 8996
  • [2] An Enhanced Multi-Modal Recommendation Based on Alternate Training With Knowledge Graph Representation
    Wang, Yuequn
    Dong, Liyan
    Zhang, Hao
    Ma, Xintao
    Li, Yongli
    Sun, Minghui
    IEEE ACCESS, 2020, 8 : 213012 - 213026
  • [3] Knowledge-Based Visual Question Answering Using Multi-Modal Semantic Graph
    Jiang, Lei
    Meng, Zuqiang
    ELECTRONICS, 2023, 12 (06)
  • [4] Hashing-based Multi-modal Semantic Communication
    Zhu, Yibo
    Gu, Hongyu
    Nie, Jiangtian
    Tang, Jianhang
    Jin, Jiangming
    Zhang, Yang
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [5] Research and Comprehensive Review on Multi-Modal Knowledge Graph Fusion Techniques
    Chen, Youren
    Li, Yong
    Wen, Ming
    Sun, Chi
    Computer Engineering and Applications, 2024, 60 (13) : 36 - 50
  • [6] Semantic2Graph: graph-based multi-modal feature fusion for action segmentation in videos
    Junbin Zhang
    Pei-Hsuan Tsai
    Meng-Hsun Tsai
    Applied Intelligence, 2024, 54 : 2084 - 2099
  • [7] Semantic2Graph: graph-based multi-modal feature fusion for action segmentation in videos
    Zhang, Junbin
    Tsai, Pei-Hsuan
    Tsai, Meng-Hsun
    APPLIED INTELLIGENCE, 2024, 54 (02) : 2084 - 2099
  • [8] MMKRL: A robust embedding approach for multi-modal knowledge graph representation learning
    Lu, Xinyu
    Wang, Lifang
    Jiang, Zejun
    He, Shichang
    Liu, Shizhong
    APPLIED INTELLIGENCE, 2022, 52 (07) : 7480 - 7497
  • [9] MMKRL: A robust embedding approach for multi-modal knowledge graph representation learning
    Xinyu Lu
    Lifang Wang
    Zejun Jiang
    Shichang He
    Shizhong Liu
    Applied Intelligence, 2022, 52 : 7480 - 7497
  • [10] Richpedia: A Comprehensive Multi-modal Knowledge Graph
    Wang, Meng
    Qi, Guilin
    Wang, Haofen
    Zheng, Qiushuo
    SEMANTIC TECHNOLOGY, JIST 2019: PROCEEDINGS, 2020, 12032 : 130 - 145