A hybrid similarity model for mitigating the cold-start problem of collaborative filtering in sparse data

被引:0
|
作者
Guan, Jiewen [1 ,2 ]
Chen, Bilian [1 ,2 ]
Yu, Shenbao [3 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen 361005, Peoples R China
[2] Xiamen Key Lab Big Data Intelligent Anal & Decis M, Xiamen 361005, Peoples R China
[3] Fujian Normal Univ, Coll Comp & Cyber Secur, Fuzhou 350108, Peoples R China
基金
中国国家自然科学基金;
关键词
Collaborative filtering; Similarity; Wasserstein distance; Cold-start problem; Sparse data; RECOMMENDER SYSTEMS; WASSERSTEIN DISTANCE;
D O I
10.1016/j.eswa.2024.123700
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Similarity is a vital component for neighborhood -based collaborative filtering (CF). To improve the quality of recommendation, many similarity methods have been proposed and analyzed in recent decades. However, nearly all traditional similarity methods and many advanced similarity methods only utilize corated items among users to compute their similarity, which provides limited information in cold-start/sparse scenarios and yields misleading results. In addition, although a few advanced hybrid similarity models consider items beyond corated items, which can partly mitigate the above limitation, they still have drawbacks, such as disregarding penalizing noncorated items that have many disadvantages. In this paper, we explore a new robust hybrid similarity model, namely Wasserstein distance -based CF (WCF) model, for mitigating the cold -start problem of CF in sparse data. Specifically, we measure item similarity via the Wasserstein distance, which can help circumvent the drawbacks in the Bhattacharyya coefficient and KL divergence that are used in the literature, and is thus more robust in a cold-start/sparse scenario. Besides, we further design a new multiplicative user similarity formula which identifies all noncorated items as a whole to prioritize the importance of corated items and impair the negative effects of noncorated items, which will also play an important role in a coldstart/sparse scenario. In addition, we also propose two novel heuristic similarity factors to impair the negative effects of popular users and items as supplements. We conduct extensive experiments on five real -world benchmark recommendation datasets to test WCF. The experimental results show the superiority of WCF over other existing similarity methods in cold-start/sparse scenarios.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Mitigating Data Sparsity Using Similarity Reinforcement-Enhanced Collaborative Filtering
    Hu, Yan
    Shi, Weisong
    Li, Hong
    Hu, Xiaohui
    [J]. ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2017, 17 (03)
  • [42] Improved genome-scale multi-target virtual screening via a novel collaborative filtering approach to cold-start problem
    Lim, Hansaim
    Gray, Paul
    Xie, Lei
    Poleksic, Aleksandar
    [J]. SCIENTIFIC REPORTS, 2016, 6
  • [43] Improved genome-scale multi-target virtual screening via a novel collaborative filtering approach to cold-start problem
    Hansaim Lim
    Paul Gray
    Lei Xie
    Aleksandar Poleksic
    [J]. Scientific Reports, 6
  • [44] AR-CF: Augmenting Virtual Users and Items in Collaborative Filtering for Addressing Cold-Start Problems
    Chae, Dong-Kyu
    Kim, Jihoo
    Chau, Duen Horng
    Kim, Sang-Wook
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1251 - 1260
  • [45] Merging trust in collaborative filtering to alleviate data sparsity and cold start
    Guo, Guibing
    Zhang, Jie
    Thalmann, Daniel
    [J]. KNOWLEDGE-BASED SYSTEMS, 2014, 57 : 57 - 68
  • [46] Budget-Constrained Item Cold-Start Handling in Collaborative Filtering Recommenders via Optimal Design
    Anava, Oren
    Golan, Shahar
    Golbandi, Nadav
    Karnin, Zohar
    Lempel, Ronny
    Rokhlenko, Oleg
    Somekh, Oren
    [J]. PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 45 - 54
  • [47] Mitigating Cold-Start Delay using Warm-Start Containers in Serverless Platform
    Kumari, Anisha
    Sahoo, Bibhudatta
    Behera, Ranjan Kumar
    [J]. 2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [48] Cold-Start User-Based Weighted Collaborative Filtering for an Implicit Recommender System for Research Facilities
    Kale, Yogesh
    Petrie, Samantha E.
    Bikdash, Marwan
    Topal, Michael D.
    [J]. 2018 4TH IEEE INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (CIC 2018), 2018, : 466 - 471
  • [49] Alleviating New User Cold-Start in User-Based Collaborative Filtering via Bipartite Network
    Zhang, Zhipeng
    Dong, Mianxiong
    Ota, Kaoru
    Kudo, Yasuo
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2020, 7 (03): : 672 - 685
  • [50] A Movie Cold-Start Recommendation Method Optimized Similarity Measure
    Yi, Peng
    Yang, Chen
    Zhou, Xiaoming
    Li, Chen
    [J]. 2016 16TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2016, : 231 - 234