K-PRSCAN: A clustering method based on PageRank

被引:13
|
作者
Liu, Li [1 ,2 ]
Sun, Letian [3 ]
Chen, Shiping [4 ]
Liu, Ming [5 ]
Zhong, Jun [3 ]
机构
[1] Chongqing Univ, Sch Software Engn, Chongqing 400044, Peoples R China
[2] Natl Univ Singapore, Sch Comp, Singapore 117417, Singapore
[3] Lanzhou Univ, Sch Informat Sci & Engn, Lanzhou 730000, Gansu, Peoples R China
[4] CSIRO ICT Ctr, Dickson, ACT, Australia
[5] Southwest Univ, Fac Comp & Informat Sci, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; Page Rank; Binary search; Data mining;
D O I
10.1016/j.neucom.2015.10.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many existing clustering approaches are difficult to cluster non-convex or non-isotropic shapes whose centroids are not highly distinguishable. In addition, most of these approaches are often sensitive to outliers and background noise. To this end, we propose a novel clustering approach called K-PRSCAN, where PageRank algorithm is adopted to estimate the importance of data points in K clusters. The importance exhibits both intra-cluster and inter-cluster relations of a data point, enabling our method to distinguish both globular and non-globular clusters. It can also reduce the negative effect of noisy points whose importance tends to be a small value. The experimental results show that our proposed approach outperforms several well-known clustering approach across seven complex and non-isotropic datasets. We also evaluate the effectiveness of our algorithm on two real-world datasets, i.e. a public dataset of digit handwriting recognition and a dataset for race walking recognition collected by ourselves, and find our approach outperforms other existing algorithms in most aspects. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:65 / 80
页数:16
相关论文
共 50 条
  • [1] Hypergraph Clustering Based on PageRank
    Takai, Yuuki
    Miyauchi, Atsushi
    Ikeda, Masahiro
    Yoshida, Yuichi
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1970 - 1978
  • [2] Personalized PageRank Clustering: A graph clustering algorithm based on random walks
    Tabrizi, Shayan A.
    Shakery, Azadeh
    Asadpour, Masoud
    Abbasi, Maziar
    Tavallaie, Mohammad Ali
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2013, 392 (22) : 5772 - 5785
  • [3] A novel clustering algorithm based on PageRank and minimax similarity
    Qidong Liu
    Ruisheng Zhang
    Xin Liu
    Yunyun Liu
    Zhili Zhao
    Rongjing Hu
    Neural Computing and Applications, 2019, 31 : 7769 - 7780
  • [4] A novel clustering algorithm based on PageRank and minimax similarity
    Liu, Qidong
    Zhang, Ruisheng
    Liu, Xin
    Liu, Yunyun
    Zhao, Zhili
    Hu, Rongjing
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (11): : 7769 - 7780
  • [5] Landmark selection for spectral clustering based on Weighted PageRank
    Rafailidis, D.
    Constantinou, E.
    Manolopoulos, Y.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 68 : 465 - 472
  • [6] A Method of Computing PageRank Based on Extrapolation
    Sun, Tieli
    Deng, Kaiying
    Deng, Jingwei
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2010, 13 (03): : 731 - 740
  • [7] PCMeans: community detection using local PageRank, clustering, and K-means
    Louafi, Wafa
    Titouna, Faiza
    SOCIAL NETWORK ANALYSIS AND MINING, 2023, 13 (01)
  • [8] PCMeans: community detection using local PageRank, clustering, and K-means
    Wafa Louafi
    Faiza Titouna
    Social Network Analysis and Mining, 13
  • [9] A Clustering Method Based on K-Means Algorithm
    Li, Youguo
    Wu, Haiyan
    INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 : 1104 - 1109
  • [10] DRank+: A directory based PageRank prediction method for fast PageRank convergence
    Kao, Hung-Yu
    Liu, Chia-Sheng
    Tsai, Yu-Chuan
    Shih, Chia-Chun
    Tsai, Tse-Ming
    WEBIST 2008: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 2, 2008, : 175 - 180