Heterogeneous Fusion of Semantic and Collaborative Information for Visually-Aware Food Recommendation

被引:22
|
作者
Meng, Lei [1 ,2 ]
Feng, Fuli [2 ]
He, Xiangnan [3 ]
Gao, Xiaoyan [4 ]
Chua, Tat-Seng [2 ]
机构
[1] Shandong Univ, Jinan, Peoples R China
[2] Natl Univ Singapore, Singapore, Singapore
[3] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
[4] Beijing Inst Technol, Beijing, Peoples R China
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
Visually-aware recommendation; Personalized visual preference; Heterogeneous multi-task learning; Dual-gating module;
D O I
10.1145/3394171.3413598
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visually-aware food recommendation recommends food items based on their visual features. Existing methods typically use the pre-extracted visual features from food classification models, which mainly encode the visual content with limited semantic information, such as the classes and ingredients. Therefore, such features may not cover the personalized visual preferences of users, termed collaborative information, e.g. users may attend to different colors and textures of food based on their preferred ingredients and cooking methods. To address this problem, this paper presents a heterogeneous multi-task learning framework, termed privileged-channel infused network (PiNet). It learns the visual features that contain both the semantic and collaborative information by training the image encoder to simultaneously fulfill the ingredient prediction and food recommendation tasks. However, the heterogeneity between the two tasks may lead to different visual information in need and different directions in model parameter optimization. To handle these challenges, PiNet first employs a dual-gating module (DGM) to enable the encoding and passing of different visual information from the image encoder to individual tasks. Secondly, PiNet adopts a two-phase training strategy and two prior knowledge incorporation methods to ensure an effective model training. Experimental results from two real-world datasets show that the visual features generated by PiNet better attend to the informative image regions, yielding superior performance.
引用
收藏
页码:3460 / 3468
页数:9
相关论文
共 47 条
  • [41] Semantic-Enhanced and Context-Aware Hybrid Collaborative Filtering for Event Recommendation in Even-Based Social Networks
    Xu, Minghua
    Liu, Shenghao
    [J]. IEEE ACCESS, 2019, 7 : 17493 - 17502
  • [42] Hierarchical User Intention-Preference for Sequential Recommendation with Relation-Aware Heterogeneous Information Network Embedding
    Yang, Fan
    Li, Gangmin
    Yue, Yong
    [J]. BIG DATA, 2022, 10 (05) : 466 - 478
  • [43] Collaborative multi-feature extraction and scale-aware semantic information mining for medical image segmentation
    Zhang, Ruijun
    He, Zixuan
    Zhu, Jian
    Yuan, Xiaochen
    Huang, Guoheng
    Pun, Chi-Man
    Peng, Jianhong
    Lin, Junzhong
    Zhou, Jian
    [J]. PHYSICS IN MEDICINE AND BIOLOGY, 2022, 67 (20):
  • [44] First-order and High-order Information Fusion over Heterogeneous Information Network for Top-N Recommendation System
    Mu, Nan
    Zha, Daren
    [J]. PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 1105 - 1110
  • [45] Deep attentive multimodal learning for food information enhancement via early-stage heterogeneous fusion
    Saklani, Avantika
    Tiwari, Shailendra
    Pannu, H. S.
    [J]. VISUAL COMPUTER, 2024,
  • [46] Semantic-Structure-Aware Multi-Level Information Fusion for Robust Global Orientation Optimization of Autonomous Mobile Robots
    Xiang, Guofei
    Dian, Songyi
    Zhao, Ning
    Wang, Guodong
    [J]. SENSORS, 2023, 23 (03)
  • [47] Demand-aware mobile bike-sharing service using collaborative computing and information fusion in 5G IoT environment
    Yang, Xiaoxian
    Xu, Yueshen
    Zhou, Yishan
    Song, Shengli
    Wu, Yinchen
    [J]. DIGITAL COMMUNICATIONS AND NETWORKS, 2022, 8 (06) : 984 - 994