Multilevel Hierarchical Kernel Spectral Clustering for Real-Life Large Scale Complex Networks

被引:19
|
作者
Mall, Raghvendra [1 ]
Langone, Rocco [1 ]
Suykens, Johan A. K. [1 ]
机构
[1] Katholieke Univ Leuven, ESAT STADIUS, Leuven, Belgium
来源
PLOS ONE | 2014年 / 9卷 / 06期
关键词
COMMUNITY; CUTS;
D O I
10.1371/journal.pone.0099966
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Kernel spectral clustering corresponds to a weighted kernel principal component analysis problem in a constrained optimization framework. The primal formulation leads to an eigen-decomposition of a centered Laplacian matrix at the dual level. The dual formulation allows to build a model on a representative subgraph of the large scale network in the training phase and the model parameters are estimated in the validation stage. The KSC model has a powerful out-of-sample extension property which allows cluster affiliation for the unseen nodes of the big data network. In this paper we exploit the structure of the projections in the eigenspace during the validation stage to automatically determine a set of increasing distance thresholds. We use these distance thresholds in the test phase to obtain multiple levels of hierarchy for the large scale network. The hierarchical structure in the network is determined in a bottom-up fashion. We empirically showcase that real-world networks have multilevel hierarchical organization which cannot be detected efficiently by several state-of-the-art large scale hierarchical community detection techniques like the Louvain, OSLOM and Infomap methods. We show that a major advantage of our proposed approach is the ability to locate good quality clusters at both the finer and coarser levels of hierarchy using internal cluster quality metrics on 7 real-life networks.
引用
收藏
页数:18
相关论文
共 50 条
  • [11] Solving a real-life, large-scale energy management problem
    Steffen Godskesen
    Thomas Sejr Jensen
    Niels Kjeldsen
    Rune Larsen
    Journal of Scheduling, 2013, 16 : 567 - 583
  • [12] An improved spectral clustering method for large-scale sparse networks
    Ding, Yi
    Deng, Jiayi
    Zhang, Bo
    STATISTICS AND ITS INTERFACE, 2025, 18 (02) : 257 - 266
  • [13] Solving a real-life, large-scale energy management problem
    Godskesen, Steffen
    Jensen, Thomas Sejr
    Kjeldsen, Niels
    Larsen, Rune
    JOURNAL OF SCHEDULING, 2013, 16 (06) : 567 - 583
  • [14] Multilevel Hierarchical Clustering protocol for wireless sensor networks
    Ateya, Abdelhamied A.
    Sayed, Mohammed S.
    Abdalla, Mahmoud I.
    2014 INTERNATIONAL CONFERENCE ON ENGINEERING AND TECHNOLOGY (ICET), 2014,
  • [15] Optimal and hierarchical clustering of large-scale hybrid networks for scientific mapping
    Liu, Xinhai
    Glanzel, Wolfgang
    De Moor, Bart
    SCIENTOMETRICS, 2012, 91 (02) : 473 - 493
  • [16] Optimal and hierarchical clustering of large-scale hybrid networks for scientific mapping
    Xinhai Liu
    Wolfgang Glänzel
    Bart De Moor
    Scientometrics, 2012, 91 : 473 - 493
  • [17] Fast spectral clustering learning with hierarchical bipartite graph for large-scale data
    Yang, Xiaojun
    Yu, Weizhong
    Wang, Rong
    Zhang, Guohao
    Nie, Feiping
    PATTERN RECOGNITION LETTERS, 2020, 130 : 345 - 352
  • [18] FRED -: An application for a real-life large scale multihop ad hoc network
    Glombitza, Nils
    Lipphardt, Martin
    Hellbrueck, Horst
    Fischer, Stefan
    2008 FIFTH ANNUAL CONFERENCE ON WIRELESS ON DEMAND NETWORK SYSTEMS AND SERVICES, 2008, : 73 - 76
  • [19] Scalable Spectral Clustering for Overlapping Community Detection in Large-Scale Networks
    Van Lierde, Hadrien
    Chow, Tommy W. S.
    Chen, Guanrong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (04) : 754 - 767
  • [20] Subsampling spectral clustering for stochastic block models in large-scale networks
    Deng, Jiayi
    Huang, Danyang
    Ding, Yi
    Zhu, Yingqiu
    Jing, Bingyi
    Zhang, Bo
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 189