MLSFF: Multi-level structural features fusion for multi-modal knowledge graph completion

被引:1
|
作者
Zhai, Hanming [1 ]
Lv, Xiaojun [2 ]
Hou, Zhiwen [1 ]
Tong, Xin [1 ]
Bu, Fanliang [1 ]
机构
[1] Peoples Publ Secur Univ China, Sch Informat Network Secur, Beijing 100038, Peoples R China
[2] China Acad Railway Sci Corp Ltd, Inst Comp Technol, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
knowledge graph completion; multi-modal knowledge graph; link prediction; multi-modal feature fusion; graph neural network; transformer;
D O I
10.3934/mbe.2023630
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
With the rise of multi-modal methods, multi-modal knowledge graphs have become a better choice for storing human knowledge. However, knowledge graphs often suffer from the problem of incompleteness due to the infinite and constantly updating nature of knowledge, and thus the task of knowledge graph completion has been proposed. Existing multi-modal knowledge graph completion methods mostly rely on either embedding-based representations or graph neural networks, and there is still room for improvement in terms of interpretability and the ability to handle multi-hop tasks. Therefore, we propose a new method for multi-modal knowledge graph completion. Our method aims to learn multi-level graph structural features to fully explore hidden relationships within the knowledge graph and to improve reasoning accuracy. Specifically, we first use a Transformer architecture to separately learn about data representations for both the image and text modalities. Then, with the help of multimodal gating units, we filter out irrelevant information and perform feature fusion to obtain a unified encoding of knowledge representations. Furthermore, we extract multi-level path features using a width-adjustable sliding window and learn about structural feature information in the knowledge graph using graph convolutional operations. Finally, we use a scoring function to evaluate the probability of the truthfulness of encoded triplets and to complete the prediction task. To demonstrate the effectiveness of the model, we conduct experiments on two publicly available datasets, FB15K-237-IMG and WN18-IMG, and achieve improvements of 1.8 and 0.7%, respectively, in the Hits@1 metric.
引用
收藏
页码:14096 / 14116
页数:21
相关论文
共 50 条
  • [31] Multi-level Deep Correlative Networks for Multi-modal Sentiment Analysis
    CAI Guoyong
    LYU Guangrui
    LIN Yuming
    WEN Yimin
    [J]. Chinese Journal of Electronics, 2020, 29 (06) : 1025 - 1038
  • [32] Enhancing Recommender System with Multi-modal Knowledge Graph
    Sun, Chengjie
    Chen, Weiwei
    Lin, Lei
    Shan, Lili
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 395 - 407
  • [33] Multi-Modal Knowledge Graph Construction and Application: A Survey
    Zhu, Xiangru
    Li, Zhixu
    Wang, Xiaodan
    Jiang, Xueyao
    Sun, Penglei
    Wang, Xuwu
    Xiao, Yanghua
    Yuan, Nicholas Jing
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (02) : 715 - 735
  • [34] Multi-dimension and multi-modal rolling mill vibration prediction model based on multi-level network fusion
    Chen, Shu-zong
    Liu, Yun-xiao
    Wang, Yun-long
    Qian, Cheng
    Hua, Chang-chun
    Sun, Jie
    [J]. JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2024,
  • [35] Unsupervised domain adaptation multi-level adversarial network for semantic segmentation based on multi-modal features
    Wang, Zeyu
    Bu, Shuhui
    Huang, Wei
    Zheng, Yuanpan
    Wu, Qinggang
    Chang, Huawen
    Zhang, Xu
    [J]. Tongxin Xuebao/Journal on Communications, 2022, 43 (12): : 157 - 171
  • [36] DFMKE: A dual fusion multi-modal knowledge graph embedding framework for entity alignment
    Zhu, Jia
    Huang, Changqin
    De Meo, Pasquale
    [J]. INFORMATION FUSION, 2023, 90 : 111 - 119
  • [37] Image - Text Association Enhanced Multi-modal Swine Disease Knowledge Graph Fusion
    Jiang, Tingting
    Xu, Ao
    Wu, Feifei
    Yang, Shuai
    He, Jin
    Gu, Lichuan
    [J]. Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 56 (01): : 56 - 64
  • [38] Combining Knowledge and Multi-modal Fusion for Meme Classification
    Zhong, Qi
    Wang, Qian
    Liu, Ji
    [J]. MULTIMEDIA MODELING (MMM 2022), PT I, 2022, 13141 : 599 - 611
  • [39] Class Consistent Multi-Modal Fusion with Binary Features
    Shrivastava, Ashish
    Rastegari, Mohammad
    Shekhar, Sumit
    Chellappa, Rama
    Davis, Larry S.
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2282 - 2291
  • [40] Multi-modal Graph Convolutional Network for Knowledge Graph Entity Alignment
    You, Yinghui
    Wei, Yuyang
    Zhang, Yanlong
    Chen, Wei
    Zhao, Lei
    [J]. WEB AND BIG DATA, PT I, APWEB-WAIM 2023, 2024, 14331 : 142 - 157