Representation and Fusion Based on Knowledge Graph in Multi-Modal Semantic Communication

被引：0

作者：

Xing, Chenlin ^{[1
]}

Lv, Jie ^{[1
]}

Luo, Tao ^{[1
]}

Zhang, Zhilong ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China

来源：

IEEE WIRELESS COMMUNICATIONS LETTERS | 2024年 / 13卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Semantics; Correlation; Feature extraction; Knowledge graphs; Cognition; Head; Data mining; Semantic communication; multi-modal fusion; knowledge graph;

D O I：

10.1109/LWC.2024.3369864

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The existing research on multi-modal semantic communication ignores the exploration of reasoning correlation among multi-modal data. Motivated by this, a multi-modal semantic representation and fusion model based on knowledge graph (KG-MSF) is proposed in this letter. In KG-MSF, the direct and reasoning correlation semantic information is extracted and mapped into a two-layer semantic architecture to represent the semantics of each modal fully. After that, the knowledge graph with structural advantage is utilized to fuse multi-modal semantic information, which is transmitted under different channel conditions. To assess the efficacy of semantic representation and fusion of the proposed KG-MSF in the multi-modal semantic communication system, we conduct comprehensive experiments on the task of visual question answer (VQA) with a metric of answer accuracy. The results demonstrate the superiority compared with existing models for multi-modal semantic representation, fusion, transmission efficiency and channel robustness.

引用

页码：1344 / 1348

页数：5

共 50 条

[31] Multi-hop neighbor fusion enhanced hierarchical transformer for multi-modal knowledge graph completion
Wang, Yunpeng
Ning, Bo
Wang, Xin
Li, Guanyu
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2024, 27 (05):
[32] Multi-modal Graph Convolutional Network for Knowledge Graph Entity Alignment
You, Yinghui
Wei, Yuyang
Zhang, Yanlong
Chen, Wei
Zhao, Lei
WEB AND BIG DATA, PT I, APWEB-WAIM 2023, 2024, 14331 : 142 - 157
[33] Propagation Graph Fusion for Multi-Modal Medical Content-Based Retrieval
Liu, Sidong
Liu, Siqi
Pujol, Sonia
Kikinis, Ron
Feng, Dagan
Cai, Weidong
2014 13TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION (ICARCV), 2014, : 849 - 854
[34] Audio-Visual Scene Classification Based on Multi-modal Graph Fusion
Lei, Han
Chen, Ning
INTERSPEECH 2022, 2022, : 4157 - 4161
[35] Multi-Modal Medical Image Fusion With Geometric Algebra Based Sparse Representation
Li, Yanping
Fang, Nian
Wang, Haiquan
Wang, Rui
FRONTIERS IN GENETICS, 2022, 13
[36] Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering
Xia, Wei
Wang, Tianxiu
Gao, Quanxue
Yang, Ming
Gao, Xinbo
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1170 - 1183
[37] A multi-modal image fusion framework based on guided filter and sparse representation
Zhang, Shuai
Huang, Fuyu
Liu, Bingqi
Li, Gang
Chen, Yichao
Chen, Yudan
Zhou, Bing
Wu, Dongsheng
OPTICS AND LASERS IN ENGINEERING, 2021, 137
[38] Combining Knowledge and Multi-modal Fusion for Meme Classification
Zhong, Qi
Wang, Qian
Liu, Ji
MULTIMEDIA MODELING (MMM 2022), PT I, 2022, 13141 : 599 - 611
[39] Application of Multi-modal Fusion Attention Mechanism in Semantic Segmentation
Liu, Yunlong
Yoshie, Osamu
Watanabe, Hiroshi
COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 378 - 397
[40] Rice Fertilization Period Discrimination Method Based on Multi-modal Knowledge Graph
Yuan, Licun
Zhou, Jun
Ge, Weixi
Zheng, Pengyuan
Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2024, 55 (09): : 163 - 173

← 1 2 3 4 5 →