A model to address the cold-start in peer recommendation by using k-means clustering and sentence embedding

被引:0
|
作者
Shukla, Deepika [1 ]
Chowdary, C. Ravindranath [1 ]
机构
[1] Indian Inst Technol BHU, Dept Comp Sci & Engn, Varanasi 221005, India
关键词
Cold-start problem; Peer recommendation; k-means clustering; Recommender system;
D O I
10.1016/j.jocs.2024.102465
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In academia, research collaboration plays a vital role in enhancing the research quality and enriching academic profile of the authors. Recommending appropriate collaborators from a vast scholarly database, particularly for newcomers, poses a challenging cold-start problem. This study addresses a cold-start problem in peer recommendation, considering a dynamic coauthorship graph as a network structure of academic collaborators. As the coauthorship graph is quite large and complex, an efficient indexing method is essential for speeding up the initial search of similar coauthors. The study introduces an efficient Global Inverted List ( GIL ) for indexing research areas and active authors in the coauthorship network. An attribute-based search and filtering mechanism is proposed to identify relevant collaborators, followed by the application k-means clustering and doc2vec metrics to rank and select top recommendations. A cold user is associated with attributes that identify coauthors with similar research interests. For each attribute of the cold user, model searches the associated authors from the GIL. Further, two filtering approaches are applied to refine retrieved author list. The first ensures that the authors have a significant presence in the specified research areas, whereas the second one helps avoid recommending authors with only superficial connections to cold user. The model creates a feature matrix of filtered authors using the publication features of authors. k-means clustering applied to the feature matrix generates k clusters, among which the model chooses those with seed nodes i.e. the clusters which are having seed nodes are selected for further process. Selected clusters are ranked using doc2vec metrics, with the top-ranked cluster providing the final recommendation. model recommends the top L members of the selected cluster, where L is the length of the recommendations provided to the new user. Our extensive experiments show the efficacy of the proposed model.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Spectral Comparison Using k-Means Clustering
    Ramachandran, Vignesh R.
    Mitchell, Herbert J.
    Jacobs, Samantha K.
    Tzeng, Nigel H.
    Firpi, Alexer H.
    Rodriguez, Benjamin M.
    2014 IEEE AEROSPACE CONFERENCE, 2014,
  • [42] Customer Segmentation using K-means Clustering
    Kansal, Tushar
    Bahuguna, Suraj
    Singh, Vishal
    Choudhury, Tanupriya
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON COMPUTATIONAL TECHNIQUES, ELECTRONICS AND MECHANICAL SYSTEMS (CTEMS), 2018, : 135 - 139
  • [43] Motif discovery using K-means clustering
    Sayed, Mohammed
    Park, Juw Won
    BMC BIOINFORMATICS, 2016, 17
  • [44] K-means clustering algorithm using the entropy
    Palubinskas, G
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING IV, 1998, 3500 : 63 - 71
  • [45] Crime Analysis using k-means Clustering
    Joshi, Anant
    Sabitha, A. Sai
    Choudhury, Tanupriya
    2017 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NETWORKS (CINE), 2017, : 33 - 39
  • [46] Offenders Clustering Using FCM & K-Means
    Farzai, Sara
    Ghasemi, Davood
    Marzuni, Seyed Saeed Mirpour
    JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE-JMCS, 2015, 15 (04): : 294 - 301
  • [47] K-means clustering using entropy minimization
    Okafor, A
    Pardalos, PM
    THEORY AND ALGORITHMS FOR COOPERATIVE SYSTEMS, 2004, 4 : 339 - 351
  • [48] Clustering Quality Improvement of k-means using a Hybrid Evolutionary Model
    Karimov, Jeyhun
    Ozbayoglu, Murat
    COMPLEX ADAPTIVE SYSTEMS, 2015, 2015, 61 : 38 - 45
  • [49] NFC: a deep and hybrid item-based model for item cold-start recommendation
    Bernardis, Cesare
    Cremonesi, Paolo
    USER MODELING AND USER-ADAPTED INTERACTION, 2022, 32 (04) : 747 - 780
  • [50] NFC: a deep and hybrid item-based model for item cold-start recommendation
    Cesare Bernardis
    Paolo Cremonesi
    User Modeling and User-Adapted Interaction, 2022, 32 : 747 - 780