K-PRSCAN: A clustering method based on PageRank

被引:13
|
作者
Liu, Li [1 ,2 ]
Sun, Letian [3 ]
Chen, Shiping [4 ]
Liu, Ming [5 ]
Zhong, Jun [3 ]
机构
[1] Chongqing Univ, Sch Software Engn, Chongqing 400044, Peoples R China
[2] Natl Univ Singapore, Sch Comp, Singapore 117417, Singapore
[3] Lanzhou Univ, Sch Informat Sci & Engn, Lanzhou 730000, Gansu, Peoples R China
[4] CSIRO ICT Ctr, Dickson, ACT, Australia
[5] Southwest Univ, Fac Comp & Informat Sci, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; Page Rank; Binary search; Data mining;
D O I
10.1016/j.neucom.2015.10.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many existing clustering approaches are difficult to cluster non-convex or non-isotropic shapes whose centroids are not highly distinguishable. In addition, most of these approaches are often sensitive to outliers and background noise. To this end, we propose a novel clustering approach called K-PRSCAN, where PageRank algorithm is adopted to estimate the importance of data points in K clusters. The importance exhibits both intra-cluster and inter-cluster relations of a data point, enabling our method to distinguish both globular and non-globular clusters. It can also reduce the negative effect of noisy points whose importance tends to be a small value. The experimental results show that our proposed approach outperforms several well-known clustering approach across seven complex and non-isotropic datasets. We also evaluate the effectiveness of our algorithm on two real-world datasets, i.e. a public dataset of digit handwriting recognition and a dataset for race walking recognition collected by ourselves, and find our approach outperforms other existing algorithms in most aspects. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:65 / 80
页数:16
相关论文
共 50 条
  • [21] Viewpoint Ranking Method Based on PageRank in Group Discussion
    Zhou, Jian
    Du, Ning
    Sun, Li-juan
    Xiao, Fu
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS AND COMMUNICATION TECHNOLOGY (CNCT 2016), 2016, 54 : 501 - 509
  • [22] An influence power-based clustering approach with PageRank-like model
    Liu, Li
    Chen, Xiwei
    Liu, Ming
    Jia, Yongpo
    Zhong, Jun
    Gao, Rongmin
    Zhao, Yu
    APPLIED SOFT COMPUTING, 2016, 40 : 17 - 32
  • [23] A Zombie Account Detection Method in Microblog Based on the PageRank
    Li, Shaobo
    Li, Xuhang
    Yang, He
    Sun, Guanglu
    Lang, Fei
    2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C), 2017, : 267 - 270
  • [24] An Improved Fractal Coding Method based on K-means Clustering
    Guo, Hui
    He, Jie
    Proceedings of the 2016 4th International Conference on Mechanical Materials and Manufacturing Engineering (MMME 2016), 2016, 79 : 294 - 300
  • [25] Scalable graph clustering with parallel approximate PageRank
    Perozzi, Bryan
    McCubbin, Christopher
    Halbert, J. T.
    SOCIAL NETWORK ANALYSIS AND MINING, 2014, 4 (01) : 1 - 11
  • [26] An Improved K-Means Clustering Algorithm Based on Spectral Method
    Tian, Shengwen
    Yang, Hongyong
    Wang, Yilei
    Li, Ali
    ADVANCES IN COMPUTATION AND INTELLIGENCE, PROCEEDINGS, 2008, 5370 : 530 - 536
  • [27] An Improved K-means Clustering Method based on Data Field
    Xu, Cui
    Liu, Yuhua
    Xu, Ke
    INTERNATIONAL CONFERENCE ON CONTROL SYSTEM AND AUTOMATION (CSA 2013), 2013, : 454 - 459
  • [28] Method of Redundant Features Eliminating based on K-means Clustering
    Li, Limin
    Wang, Zhongsheng
    MATERIALS SCIENCE, CIVIL ENGINEERING AND ARCHITECTURE SCIENCE, MECHANICAL ENGINEERING AND MANUFACTURING TECHNOLOGY, PTS 1 AND 2, 2014, 488-489 : 1023 - 1026
  • [29] AN APPROXIMATE CLUSTERING TECHNIQUE BASED ON THE K-NEAREST NEIGHBORS METHOD
    KOVALENKO, AP
    AUTOMATION AND REMOTE CONTROL, 1992, 53 (10) : 1592 - 1598
  • [30] An ordered clustering algorithm based on K-means and the PROMETHEE method
    Chen, Liuhao
    Xu, Zeshui
    Wang, Hai
    Liu, Shousheng
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (06) : 917 - 926