Heterogeneous Fusion of Semantic and Collaborative Information for Visually-Aware Food Recommendation

被引:22
|
作者
Meng, Lei [1 ,2 ]
Feng, Fuli [2 ]
He, Xiangnan [3 ]
Gao, Xiaoyan [4 ]
Chua, Tat-Seng [2 ]
机构
[1] Shandong Univ, Jinan, Peoples R China
[2] Natl Univ Singapore, Singapore, Singapore
[3] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
[4] Beijing Inst Technol, Beijing, Peoples R China
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
Visually-aware recommendation; Personalized visual preference; Heterogeneous multi-task learning; Dual-gating module;
D O I
10.1145/3394171.3413598
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visually-aware food recommendation recommends food items based on their visual features. Existing methods typically use the pre-extracted visual features from food classification models, which mainly encode the visual content with limited semantic information, such as the classes and ingredients. Therefore, such features may not cover the personalized visual preferences of users, termed collaborative information, e.g. users may attend to different colors and textures of food based on their preferred ingredients and cooking methods. To address this problem, this paper presents a heterogeneous multi-task learning framework, termed privileged-channel infused network (PiNet). It learns the visual features that contain both the semantic and collaborative information by training the image encoder to simultaneously fulfill the ingredient prediction and food recommendation tasks. However, the heterogeneity between the two tasks may lead to different visual information in need and different directions in model parameter optimization. To handle these challenges, PiNet first employs a dual-gating module (DGM) to enable the encoding and passing of different visual information from the image encoder to individual tasks. Secondly, PiNet adopts a two-phase training strategy and two prior knowledge incorporation methods to ensure an effective model training. Experimental results from two real-world datasets show that the visual features generated by PiNet better attend to the informative image regions, yielding superior performance.
引用
收藏
页码:3460 / 3468
页数:9
相关论文
共 47 条
  • [1] Hierarchical Attention Network for Visually-Aware Food Recommendation
    Gao, Xiaoyan
    Feng, Fuli
    He, Xiangnan
    Huang, Heyan
    Guan, Xinyu
    Feng, Chong
    Ming, Zhaoyan
    Chua, Tat-Seng
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (06) : 1647 - 1659
  • [2] Visually-Aware Fashion Recommendation and Design with Generative Image Models
    Kang, Wang-Cheng
    Fang, Chen
    Wang, Zhaowen
    McAuley, Julian
    [J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 207 - 216
  • [3] CausalRec: Causal Inference for Visual Debiasing in Visually-Aware Recommendation
    Qiu, Ruihong
    Wang, Sen
    Chen, Zhi
    Yin, Hongzhi
    Huang, Zi
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3844 - 3852
  • [4] A VISUALLY-AWARE FOOD ANALYSIS SYSTEM FOR DIET MANAGEMENT
    Hang Wu
    Xi Chen
    Li, Xuelong
    Ma, Haokai
    Zheng, Yuze
    Li, Xiangxian
    Meng, Xiangxu
    Lei Meng
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (IEEE ICMEW 2022), 2022,
  • [5] Visually-aware Acoustic Event Detection using Heterogeneous Graphs
    Shirian, Amir
    Somandepalli, Krishna
    Sanchez, Victor
    Guha, Tanaya
    [J]. INTERSPEECH 2022, 2022, : 2428 - 2432
  • [6] Heterogeneous information fusion based graph collaborative filtering recommendation
    Mu, Ruihui
    Zeng, Xiaoqin
    Zhang, Jiying
    [J]. INTELLIGENT DATA ANALYSIS, 2023, 27 (06) : 1595 - 1613
  • [7] VAFA: A Visually-Aware Food Analysis System for Socially-Engaged Diet Management
    Wu, Hang
    Chen, Xi
    Li, Xuelong
    Ma, Haokai
    Zheng, Yuze
    Li, Xiangxian
    Meng, Xiangxu
    Meng, Lei
    [J]. ARTIFICIAL INTELLIGENCE, CICAI 2022, PT III, 2022, 13606 : 554 - 558
  • [8] Community answer recommendation based on heterogeneous semantic fusion
    Wu, Yongliang
    Yin, Hu
    Zhou, Qianqian
    Dong, Jiahao
    Wei, Dan
    Liu, Dongbo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [9] Collaborative Filtering Recommendation Based on Multi-domain Semantic Fusion
    Li, Xiang
    He, Jingsha
    Zhu, Nafei
    Hou, Ziqiang
    [J]. 2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 255 - 261
  • [10] Collaborative APIs recommendation for Artificial Intelligence of Things with information fusion
    Xu, Yueshen
    Wu, Yinchen
    Gao, Honghao
    Song, Shengli
    Yin, Yuyu
    Xiao, Xichu
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 125 : 471 - 479