Multimodal Counterfactual Learning Network for Multimedia-based Recommendation

被引:8
|
作者
Li, Shuaiyang [1 ]
Guo, Dan [1 ]
Liu, Kang [1 ]
Hong, Richang [1 ]
Xue, Feng [1 ,2 ]
机构
[1] Hefei Univ Technol, Hefei, Peoples R China
[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
Recommender Systems; Multimodal User Preference; Counterfactual Learning; Spurious Correlation;
D O I
10.1145/3539618.3591739
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multimedia-based recommendation (MMRec) utilizes multimodal content (images, textual descriptions, etc.) as auxiliary information on historical interactions to determine user preferences. Most MM-Rec approaches predict user interests by exploiting a large amount of multimodal contents of user-interacted items, ignoring the potential effect of multimodal content of user-uninteracted items. As a matter of fact, there is a small portion of user preference-irrelevant features in the multimodal content of user-interacted items, which may be a kind of spurious correlation with user preferences, thereby degrading the recommendation performance. In this work, we argue that the multimodal content of user-uninteracted items can be further exploited to identify and eliminate the user preferenceirrelevant portion inside user-interacted multimodal content, for example by counterfactual inference of causal theory. Going beyond multimodal user preference modeling only using interacted items, we propose a novel model called Multimodal Counterfactual Learning Network (MCLN), in which user-uninteracted items' multimodal content is additionally exploited to further purify the representation of user preference-relevant multimodal content that better matches the user's interests, yielding state-of-the-art performance. Extensive experiments are conducted to validate the effectiveness and rationality of MCLN. We release the complete codes of MCLN at https://github.com/hfutmars/MCLN.
引用
收藏
页码:1539 / 1548
页数:10
相关论文
共 50 条
  • [1] Multimodal Graph Contrastive Learning for Multimedia-Based Recommendation
    Liu, Kang
    Xue, Feng
    Guo, Dan
    Sun, Peijie
    Qian, Shengsheng
    Hong, Richang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 9343 - 9355
  • [2] Multimodal Graph Causal Embedding for Multimedia-Based Recommendation
    Li, Shuaiyang
    Xue, Feng
    Liu, Kang
    Guo, Dan
    Hong, Richang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 8842 - 8858
  • [3] Multimodal Hierarchical Graph Collaborative Filtering for Multimedia-Based Recommendation
    Liu, Kang
    Xue, Feng
    Li, Shuaiyang
    Sang, Sheng
    Hong, Richang
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) : 216 - 227
  • [4] Multimodality Invariant Learning for Multimedia-Based New Item Recommendation
    Bai, Haoyue
    Wu, Le
    Hou, Min
    Cai, Miaomiao
    He, Zhuangzhuang
    Zhou, Yuyang
    Hong, Richang
    Wang, Meng
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 677 - 686
  • [5] Network Characterization for Delivering Multimedia-based Learning in Rural Areas
    Bandung, Y.
    Erwin
    Hutabarat, Mervin T.
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ADVANCES IN EDUCATION TECHNOLOGY, 2015, 11 : 54 - 57
  • [6] Scalable Multimodal Learning and Multimedia Recommendation
    Shen, Jialie
    Morrison, Marie
    Li, Zhu
    2023 IEEE 9TH INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING, CIC, 2023, : 121 - 124
  • [7] Multimedia-Based Learning Model for Gymnastics Skills
    Kurniawan, Ari Wibowo
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON SPORTS SCIENCES AND HEALTH 2018 (2ND ICSSH 2018), 2018, 7 : 33 - 36
  • [8] Learner Acceptance of a Multimedia-Based Learning System
    Lee, Doo Young
    Ryu, Hokyoung
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2013, 29 (06) : 419 - 437
  • [9] Impact of multimedia-based instruction on learning and retention
    Issa, RRA
    Cox, RF
    Killingsworth, CF
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 1999, 13 (04) : 281 - 290
  • [10] Special Issue on New Frontiers in Multimedia-Based and Multimodal HCI
    Melonio, Alessandra
    De Marsico, Maria
    Gena, Cristina
    Gennari, Rosella
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) : 12747 - 12750