MMGCN: Multi-modal Graph Convolution Network for Personalized Recommendation of Micro-video

被引:290
|
作者
Wei, Yinwei [1 ]
Wang, Xiang [2 ]
Nie, Liqiang [1 ]
He, Xiangnan [3 ]
Hong, Richang [4 ]
Chua, Tat-Seng [2 ]
机构
[1] Shandong Univ, Jinan, Peoples R China
[2] Natl Univ Singapore, Singapore, Singapore
[3] Univ Sci & Technol China, Hefei, Peoples R China
[4] Hefei Univ Technol, Hefei, Peoples R China
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
Graph Convolution Network; Multi-modal Recommendation; Micro-video Understanding;
D O I
10.1145/3343031.3351034
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Personalized recommendation plays a central role in many online content sharing platforms. To provide quality micro-video recommendation service, it is of crucial importance to consider the interactions between users and items (i.e., micro-videos) as well as the item contents from various modalities (e.g., visual, acoustic, and textual). Existing works on multimedia recommendation largely exploit multi-modal contents to enrich item representations, while less effort is made to leverage information interchange between users and items to enhance user representations and further capture user's fine-grained preferences on different modalities. In this paper, we propose to exploit user-item interactions to guide the representation learning in each modality, and further personalized micro-video recommendation. We design a Multimodal Graph Convolution Network (MMGCN) framework built upon the message-passing idea of graph neural networks, which can yield modal-specific representations of users and micro-videos to better capture user preferences. Specifically, we construct a user-item bipartite graph in each modality, and enrich the representation of each node with the topological structure and features of its neighbors. Through extensive experiments on three publicly available datasets, Tiktok, Kwai, and MovieLens, we demonstrate that our proposed model is able to significantly outperform state-of-the-art multi-modal recommendation methods.
引用
收藏
页码:1437 / 1445
页数:9
相关论文
共 50 条
  • [1] Adaptive Anti-Bottleneck Multi-Modal Graph Learning Network for Personalized Micro-video Recommendation
    Cai, Desheng
    Qian, Shengsheng
    Fang, Quan
    Hu, Jun
    Xu, Changsheng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [2] Multi-modal Graph Contrastive Learning for Micro-video Recommendation
    Yi, Zixuan
    Wang, Xi
    Ounis, Iadh
    Macdonald, Craig
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1807 - 1811
  • [3] Personalized Micro-video Recommendation Based on Multi-modal Features and User Interest Evolution
    Jin, Yingying
    Xu, Juan
    He, Xin
    IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 607 - 618
  • [4] Multi-modal information augmented model for micro-video recommendation
    Huo Y.
    Jin B.
    Liao Z.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (06): : 1142 - 1152
  • [5] Heterogeneous Graph Contrastive Learning Network for Personalized Micro-Video Recommendation
    Cai, Desheng
    Qian, Shengsheng
    Fang, Quan
    Hu, Jun
    Ding, Wenkui
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2761 - 2773
  • [6] Multi-Aggregator Time-Warping Heterogeneous Graph Neural Network for Personalized Micro-Video Recommendation
    Han, Jinkun
    Li, Wei
    Cai, Zhipeng
    Li, Yingshu
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 676 - 685
  • [7] Heterogeneous Hierarchical Feature Aggregation Network for Personalized Micro-Video Recommendation
    Cai, Desheng
    Qian, Shengsheng
    Fang, Quan
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 805 - 818
  • [8] Predicting Micro-video Popularity via Multi-modal Retrieval Augmentation
    Zhong, Ting
    Lang, Jian
    Zhang, Yifan
    Cheng, Zhangtao
    Zhang, Kunpeng
    Zhou, Fan
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2579 - 2583
  • [9] User-Video Co-Attention Network for Personalized Micro-video Recommendation
    Liu, Shang
    Chen, Zhenzhong
    Liu, Hongyi
    Hu, Xinghai
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 3020 - 3026
  • [10] Cross-View Sample-Enriched Graph Contrastive Learning Network for Personalized Micro-video Recommendation
    He, Ying
    Wu, Gongqing
    Cai, Desheng
    Hu, Xuegang
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 48 - 56