K-PRSCAN: A clustering method based on PageRank

被引:13
|
作者
Liu, Li [1 ,2 ]
Sun, Letian [3 ]
Chen, Shiping [4 ]
Liu, Ming [5 ]
Zhong, Jun [3 ]
机构
[1] Chongqing Univ, Sch Software Engn, Chongqing 400044, Peoples R China
[2] Natl Univ Singapore, Sch Comp, Singapore 117417, Singapore
[3] Lanzhou Univ, Sch Informat Sci & Engn, Lanzhou 730000, Gansu, Peoples R China
[4] CSIRO ICT Ctr, Dickson, ACT, Australia
[5] Southwest Univ, Fac Comp & Informat Sci, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; Page Rank; Binary search; Data mining;
D O I
10.1016/j.neucom.2015.10.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many existing clustering approaches are difficult to cluster non-convex or non-isotropic shapes whose centroids are not highly distinguishable. In addition, most of these approaches are often sensitive to outliers and background noise. To this end, we propose a novel clustering approach called K-PRSCAN, where PageRank algorithm is adopted to estimate the importance of data points in K clusters. The importance exhibits both intra-cluster and inter-cluster relations of a data point, enabling our method to distinguish both globular and non-globular clusters. It can also reduce the negative effect of noisy points whose importance tends to be a small value. The experimental results show that our proposed approach outperforms several well-known clustering approach across seven complex and non-isotropic datasets. We also evaluate the effectiveness of our algorithm on two real-world datasets, i.e. a public dataset of digit handwriting recognition and a dataset for race walking recognition collected by ourselves, and find our approach outperforms other existing algorithms in most aspects. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:65 / 80
页数:16
相关论文
共 50 条
  • [41] PageRank-based Word Sense Induction within Web Search Results Clustering
    Moreno, Jose G.
    Dias, Gael
    2014 IEEE/ACM JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL), 2014, : 465 - 466
  • [42] Comprehensive Evaluation Method of Ethnic Costume Color Based on K-Means Clustering Method
    Zhao, Linqi
    Wang, Zhenya
    Zuo, Yaxue
    Hu, Danyang
    SYMMETRY-BASEL, 2021, 13 (10):
  • [43] An Improved PageRank Algorithm Based on Fuzzy C-Means Clustering and Information Entropy
    Zheng, Wenbo
    Mo, Shaocong
    Duan, Pengfei
    Jin, Xiaotian
    CONFERENCE PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON CONTROL SCIENCE AND SYSTEMS ENGINEERING (ICCSSE), 2017, : 615 - 618
  • [44] An Improved PageRank Method based on Genetic Algorithm for Web Search
    Yan, Lili
    Gui, Zhanji
    Du, Wencai
    Guo, Qingju
    CEIS 2011, 2011, 15
  • [45] Text Clustering Method Based on K-medoids Social Evolutionary Programming
    Hao, ZhanGang
    ADVANCES IN ELECTRONIC COMMERCE, WEB APPLICATION AND COMMUNICATION, VOL 1, 2012, 148 : 473 - 477
  • [46] A density-grid-based method for clustering k-dimensional data
    Elham S. Kashani
    Saeed Bagheri Shouraki
    Yaser Norouzi
    Bernard De Baets
    Applied Intelligence, 2023, 53 : 10559 - 10573
  • [47] Clustering method of time series based on EMD and K-means algorithm
    School of Computer Science and Technology, Anhui University, Hefei 230039, China
    不详
    Moshi Shibie yu Rengong Zhineng, 2009, 5 (803-808): : 803 - 808
  • [48] Cleaning RFID data streams based on K-means clustering method
    Lin Qiaomin
    Fa Anqi
    Pan Min
    Xie Qiang
    Du Kun
    Sheng Michael
    TheJournalofChinaUniversitiesofPostsandTelecommunications, 2020, 27 (02) : 72 - 81
  • [49] A Missing Data Complement Method Based on K-means Clustering Analysis
    Shi, Pengjia
    Zhang, Linyao
    2017 IEEE CONFERENCE ON ENERGY INTERNET AND ENERGY SYSTEM INTEGRATION (EI2), 2017,
  • [50] K-Means-Based Method for Clustering and Validating Wireless Sensor Network
    Almajidi, Abdo Mahyoub
    Pawar, V. P.
    Alammari, Abdulsalam
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, VOL 1, 2019, 55 : 251 - 258