A hybrid similarity model for mitigating the cold-start problem of collaborative filtering in sparse data

被引:0
|
作者
Guan, Jiewen [1 ,2 ]
Chen, Bilian [1 ,2 ]
Yu, Shenbao [3 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen 361005, Peoples R China
[2] Xiamen Key Lab Big Data Intelligent Anal & Decis M, Xiamen 361005, Peoples R China
[3] Fujian Normal Univ, Coll Comp & Cyber Secur, Fuzhou 350108, Peoples R China
基金
中国国家自然科学基金;
关键词
Collaborative filtering; Similarity; Wasserstein distance; Cold-start problem; Sparse data; RECOMMENDER SYSTEMS; WASSERSTEIN DISTANCE;
D O I
10.1016/j.eswa.2024.123700
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Similarity is a vital component for neighborhood -based collaborative filtering (CF). To improve the quality of recommendation, many similarity methods have been proposed and analyzed in recent decades. However, nearly all traditional similarity methods and many advanced similarity methods only utilize corated items among users to compute their similarity, which provides limited information in cold-start/sparse scenarios and yields misleading results. In addition, although a few advanced hybrid similarity models consider items beyond corated items, which can partly mitigate the above limitation, they still have drawbacks, such as disregarding penalizing noncorated items that have many disadvantages. In this paper, we explore a new robust hybrid similarity model, namely Wasserstein distance -based CF (WCF) model, for mitigating the cold -start problem of CF in sparse data. Specifically, we measure item similarity via the Wasserstein distance, which can help circumvent the drawbacks in the Bhattacharyya coefficient and KL divergence that are used in the literature, and is thus more robust in a cold-start/sparse scenario. Besides, we further design a new multiplicative user similarity formula which identifies all noncorated items as a whole to prioritize the importance of corated items and impair the negative effects of noncorated items, which will also play an important role in a coldstart/sparse scenario. In addition, we also propose two novel heuristic similarity factors to impair the negative effects of popular users and items as supplements. We conduct extensive experiments on five real -world benchmark recommendation datasets to test WCF. The experimental results show the superiority of WCF over other existing similarity methods in cold-start/sparse scenarios.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Exploiting Bhattacharyya Similarity Measure to Diminish User Cold-Start Problem in Sparse Data
    Patra, Bidyut Kr
    Launonen, Raimo
    Ollikainen, Ville
    Nandi, Sukumar
    [J]. DISCOVERY SCIENCE, DS 2014, 2014, 8777 : 252 - 263
  • [2] Moment Similarity of Random Variables to Solve Cold-start Problems in Collaborative Filtering
    Kwon, Hyeong-Joon
    Hong, Kwang-Seok
    [J]. 2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 3, PROCEEDINGS, 2009, : 584 - 587
  • [3] Social Collaborative Filtering for Cold-start Recommendations
    Sedhain, Suvash
    Sanner, Scott
    Braziunas, Darius
    Xie, Lexing
    Christensen, Jordan
    [J]. PROCEEDINGS OF THE 8TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'14), 2014, : 345 - 348
  • [4] Collaborative Filtering with Hybrid Clustering Integrated Method to Address New-Item Cold-Start Problem
    Hdioud, Ferdaous
    Frikh, Bouchra
    Benghabrit, Asmaa
    Ouhbi, Brahim
    [J]. INTELLIGENT DISTRIBUTED COMPUTING IX, IDC'2015, 2016, 616 : 285 - 296
  • [5] Wasserstein Collaborative Filtering for Item Cold-start Recommendation
    Meng, Yitong
    Yan, Xiao
    Liu, Weiwen
    Wu, Huanhuan
    Cheng, James
    [J]. UMAP'20: PROCEEDINGS OF THE 28TH ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, 2020, : 318 - 322
  • [6] Supervised Learning-Based Collaborative Filtering Using Market Basket Data for the Cold-Start Problem
    Hwang, Wook-Yeon
    Jun, Chi-Hyuck
    [J]. INDUSTRIAL ENGINEERING AND MANAGEMENT SYSTEMS, 2014, 13 (04): : 421 - 431
  • [7] A new neighbourhood formation approach for solving cold-start user problem in collaborative filtering
    Kumar, Rahul
    Bala, Pradip Kumar
    Mukherjee, Shubhadeep
    [J]. INTERNATIONAL JOURNAL OF APPLIED MANAGEMENT SCIENCE, 2020, 12 (02) : 118 - 141
  • [8] An efficient similarity measure to alleviate the cold-start problem
    Jain, Gourav
    Mahara, Tripti
    [J]. 2019 FIFTEENTH INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICINPRO): INTERNET OF THINGS, 2019, : 26 - 33
  • [9] Cold-Start Collaborative Filtering Based on User Registration Process
    Zhu, Peng-yu
    Yao, Zhong
    [J]. 19TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT: MANAGEMENT SYSTEM INNOVATION, 2013, : 1175 - 1186
  • [10] A Genre-Based Item-Item Collaborative Filtering: Facing the Cold-Start Problem
    Das Barman, Surajit
    Hasan, Mahamudul
    Roy, Falguni
    [J]. 2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE AND COMPUTER APPLICATIONS (ICSCA 2019), 2019, : 258 - 262