Attention-guided Multi-step Fusion: A Hierarchical Fusion Network for Multimodal Recommendation

被引:2
|
作者
Zhou, Yan [1 ]
Guo, Jie [1 ]
Sun, Hao [1 ]
Song, Bin [1 ]
Yu, Fei Richard [2 ]
机构
[1] Xidian Univ, State Key Lab Integrated Serv Networks, Xian, Shaanxi, Peoples R China
[2] Shenzhen Univ, Guangdong Lab Artificial Intelligence & Digital E, Shenzhen, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature graph; Attention; Multi-step fusion; Multimodal recommendation;
D O I
10.1145/3539618.3591950
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The main idea of multimodal recommendation is the rational utilization of the item's multimodal information to improve the recommendation performance. Previous works directly integrate item multimodal features with item ID embeddings, ignoring the inherent semantic relations contained in the multimodal features. In this paper, we propose a novel and effective aTtention-guided Multi-step FUsion Network for multimodal recommendation, named TMFUN. Specifically, our model first constructs modality feature graph and item feature graph to model the latent item-item semantic structures. Then, we use the attention module to identify inherent connections between user-item interaction data and multimodal data, evaluate the impact of multimodal data on different interactions, and achieve early-step fusion of item features. Furthermore, our model optimizes item representation through the attention-guided multi-step fusion strategy and contrastive learning to improve recommendation performance. The extensive experiments on three real-world datasets show that our model has superior performance compared to the state-of-the-art models.
引用
收藏
页码:1816 / 1820
页数:5
相关论文
共 50 条
  • [1] Object Detection by Attention-Guided Feature Fusion Network
    Shi, Yuxuan
    Fan, Yue
    Xu, Siqi
    Gao, Yue
    Gao, Ran
    [J]. SYMMETRY-BASEL, 2022, 14 (05):
  • [2] An attention-guided multi-scale fusion network for surgical instrument segmentation
    Song, Mengqiu
    Zhai, Chenxu
    Yang, Lei
    Liu, Yanhong
    Bian, Guibin
    [J]. Biomedical Signal Processing and Control, 2025, 102
  • [3] Multi-Step Regression Network With Attention Fusion for Airport Delay Prediction
    Wei, Zhenchun
    Zhu, Siwei
    Lyu, Zengwei
    Qiao, Yan
    Yuan, Xiaohui
    Zhao, Yang
    Zhang, Hao
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (07) : 7093 - 7105
  • [4] Siamese Progressive Attention-Guided Fusion Network for Object Tracking
    Fan Y.
    Song X.
    [J]. Song, Xiaoning (x.song@jiangnan.edu.cn), 1600, Institute of Computing Technology (33): : 199 - 206
  • [5] Attention-Guided Multi-Scale Fusion Network for Similar Objects Semantic Segmentation
    Yao, Fengqin
    Wang, Shengke
    Ding, Laihui
    Zhong, Guoqiang
    Li, Shu
    Xu, Zhiwei
    [J]. COGNITIVE COMPUTATION, 2024, 16 (01) : 366 - 376
  • [6] AMFNet: An attention-guided generative adversarial network for multi-model image fusion
    Wang, Jing
    Yu, Long
    Tian, Shengwei
    Wu, Weidong
    Zhang, Dezhi
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78
  • [7] Attention-Guided Multi-Scale Fusion Network for Similar Objects Semantic Segmentation
    Fengqin Yao
    Shengke Wang
    Laihui Ding
    Guoqiang Zhong
    Shu Li
    Zhiwei Xu
    [J]. Cognitive Computation, 2024, 16 : 366 - 376
  • [8] Attention-guided graph convolutional network for multi-behavior recommendation
    Peng, Xingchen
    Sun, Jing
    Yan, Mingshi
    Sun, Fuming
    Wang, Fasheng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [9] Attention-guided multi-granularity fusion model for video summarization
    Zhang, Yunzuo
    Liu, Yameng
    Wu, Cunyu
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [10] Multi-Level Fusion and Attention-Guided CNN for Image Dehazing
    Zhang, Xiaoqin
    Wang, Tao
    Luo, Wenhan
    Huang, Pengcheng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (11) : 4162 - 4173