Representation and Fusion Based on Knowledge Graph in Multi-Modal Semantic Communication

被引:0
|
作者
Xing, Chenlin [1 ]
Lv, Jie [1 ]
Luo, Tao [1 ]
Zhang, Zhilong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Correlation; Feature extraction; Knowledge graphs; Cognition; Head; Data mining; Semantic communication; multi-modal fusion; knowledge graph;
D O I
10.1109/LWC.2024.3369864
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The existing research on multi-modal semantic communication ignores the exploration of reasoning correlation among multi-modal data. Motivated by this, a multi-modal semantic representation and fusion model based on knowledge graph (KG-MSF) is proposed in this letter. In KG-MSF, the direct and reasoning correlation semantic information is extracted and mapped into a two-layer semantic architecture to represent the semantics of each modal fully. After that, the knowledge graph with structural advantage is utilized to fuse multi-modal semantic information, which is transmitted under different channel conditions. To assess the efficacy of semantic representation and fusion of the proposed KG-MSF in the multi-modal semantic communication system, we conduct comprehensive experiments on the task of visual question answer (VQA) with a metric of answer accuracy. The results demonstrate the superiority compared with existing models for multi-modal semantic representation, fusion, transmission efficiency and channel robustness.
引用
收藏
页码:1344 / 1348
页数:5
相关论文
共 50 条
  • [21] Image - Text Association Enhanced Multi-modal Swine Disease Knowledge Graph Fusion
    Jiang, Tingting
    Xu, Ao
    Wu, Feifei
    Yang, Shuai
    He, Jin
    Gu, Lichuan
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 56 (01): : 56 - 64
  • [22] Dynamic Graph Neural Representation Based Multi-modal Fusion Model for Cognitive Outcome Prediction in Stroke Cases
    Liu, Shuting
    Zhang, Baochang
    Fang, Rong
    Rueckert, Daniel
    Zimmer, Veronika A.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VIII, 2023, 14227 : 338 - 347
  • [23] Multi-modal entity alignment based on joint knowledge representation learning
    Wang, Hui-Yong
    Lun, Bing
    Zhang, Xiao-Ming
    Sun, Xiao-Ling
    Kongzhi yu Juece/Control and Decision, 2021, 35 (12): : 2855 - 2864
  • [24] DCRL-KG: Distributed Multi-Modal Knowledge Graph Retrieval Platform Based on Collaborative Representation Learning
    Li, Leilei
    Fu, Yansheng
    Zhu, Dongjie
    Li, Xiaofang
    Sun, Yundong
    Ding, Jianrui
    Wu, Mingrui
    Cao, Ning
    Higgs, Russell
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (03): : 3295 - 3307
  • [25] MMEA: Entity Alignment for Multi-modal Knowledge Graph
    Chen, Liyi
    Li, Zhi
    Wang, Yijun
    Xu, Tong
    Wang, Zhefeng
    Chen, Enhong
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT I, 2020, 12274 : 134 - 147
  • [26] NativE: Multi-modal Knowledge Graph Completion in the Wild
    Zhang, Yichi
    Chen, Zhuo
    Guo, Lingbing
    Xu, Yajing
    Hu, Binbin
    Liu, Ziqi
    Zhang, Wen
    Chen, Huajun
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 91 - 101
  • [27] Multi-Modal Knowledge Graph Construction and Application: A Survey
    Zhu, Xiangru
    Li, Zhixu
    Wang, Xiaodan
    Jiang, Xueyao
    Sun, Penglei
    Wang, Xuwu
    Xiao, Yanghua
    Yuan, Nicholas Jing
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (02) : 715 - 735
  • [28] Enhancing Recommender System with Multi-modal Knowledge Graph
    Sun, Chengjie
    Chen, Weiwei
    Lin, Lei
    Shan, Lili
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 395 - 407
  • [29] MERGE: A Modal Equilibrium Relational Graph Framework for Multi-Modal Knowledge Graph Completion
    Shang, Yuying
    Fu, Kun
    Zhang, Zequn
    Jin, Li
    Liu, Zinan
    Wang, Shensi
    Li, Shuchao
    Sensors, 2024, 24 (23)
  • [30] Attention-Based Multi-Modal Fusion Network for Semantic Scene Completion
    Li, Siqi
    Zou, Changqing
    Li, Yipeng
    Zhao, Xibin
    Gao, Yue
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11402 - 11409