Invariant Representation Learning for Multimedia Recommendation

被引:23
|
作者
Du, Xiaoyu [1 ]
Wu, Zike [2 ]
Feng, Fuli [3 ]
He, Xiangnan [3 ]
Tang, Jinhui [1 ]
机构
[1] Nanjing Univ Sci & Technol, Nanjing, Peoples R China
[2] South China Univ Technol, Guangzhou, Peoples R China
[3] Univ Sci & Technol China, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
Multimedia Recommendation; Multimedia Representation Learning; Invariant Learning; Spurious Correlation;
D O I
10.1145/3503161.3548405
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Multimedia recommendation forms a personalized ranking task with multimedia content representations which are mostly extracted via generic encoders. However, the generic representations introduce spurious correlations - the meaningless correlation from the recommendation perspective. For example, suppose a user bought two dresses on the same model, this co-occurrence would produce a correlation between the model and purchases, but the correlation is spurious from the view of fashion recommendation. Existing work alleviates this issue by customizing preference-aware representations, requiring high-cost analysis and design. In this paper, we propose an Invariant Representation Learning Framework (InvRL) to alleviate the impact of the spurious correlations. We utilize environments to reflect the spurious correlations and determine each environment with a set of interactions. We then learn invariant representations - the inherent factors attracting user attention - to make a consistent prediction of user-item interaction across various environments. In this light, InvRL proposes two iteratively executed modules to cluster user-item interactions and learn invariant representations. With them, InvRL trains a final recommender model thus mitigating the spurious correlations. We demonstrate InvRL on a cutting-edge recommender model UltraGCN and conduct extensive experiments on three public multimedia recommendation datasets, Movielens, Tiktok, and Kwai. The experimental results validate the rationality and effectiveness of InvRL. Codes are released at https://github.com/nickwzk/InvRL.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Learning discriminative and invariant representation for fingerprint retrieval
    Dehua SONG
    Ruilin LI
    Fandong ZHANG
    Jufu FENG
    Science China(Information Sciences), 2019, 62 (01) : 220 - 222
  • [42] Fundamental Limits and Tradeoffs in Invariant Representation Learning
    Zhao, Han
    Dan, Chen
    Aragam, Bryon
    Jaakkola, Tommi S.
    Gordon, Geoffrey J.
    Ravikumar, Pradeep
    Journal of Machine Learning Research, 2022, 23
  • [43] Learning discriminative and invariant representation for fingerprint retrieval
    Song, Dehua
    Li, Ruilin
    Zhang, Fandong
    Feng, Jufu
    SCIENCE CHINA-INFORMATION SCIENCES, 2019, 62 (01)
  • [44] LEARNING A TEMPORALLY INVARIANT REPRESENTATION FOR VISUAL TRACKING
    Ma, Chao
    Yang, Xiaokang
    Zhang, Chongyang
    Yang, Ming-Hsuan
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 857 - 861
  • [45] Learning discriminative and invariant representation for fingerprint retrieval
    Dehua Song
    Ruilin Li
    Fandong Zhang
    Jufu Feng
    Science China Information Sciences, 2019, 62
  • [46] Fundamental Limits and Tradeoffs in Invariant Representation Learning
    Zhao, Han
    Dan, Chen
    Aragam, Bryon
    Jaakkola, Tommi S.
    Gordon, Geoffrey J.
    Ravikumar, Pradeep
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [47] Deep learning based hashtag recommendation system for multimedia data
    Djenouri, Youcef
    Belhadi, Asma
    Srivastava, Gautam
    Lin, Jerry Chun -Wei
    INFORMATION SCIENCES, 2022, 609 : 1506 - 1517
  • [48] Multimodal Graph Contrastive Learning for Multimedia-Based Recommendation
    Liu, Kang
    Xue, Feng
    Guo, Dan
    Sun, Peijie
    Qian, Shengsheng
    Hong, Richang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 9343 - 9355
  • [49] Multimodal Counterfactual Learning Network for Multimedia-based Recommendation
    Li, Shuaiyang
    Guo, Dan
    Liu, Kang
    Hong, Richang
    Xue, Feng
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1539 - 1548
  • [50] Interpretable video tag recommendation with multimedia deep learning framework
    Yang, Zekun
    Lin, Zhijie
    INTERNET RESEARCH, 2022, 32 (02) : 518 - 535