Multilevel Hierarchical Kernel Spectral Clustering for Real-Life Large Scale Complex Networks

被引:19
|
作者
Mall, Raghvendra [1 ]
Langone, Rocco [1 ]
Suykens, Johan A. K. [1 ]
机构
[1] Katholieke Univ Leuven, ESAT STADIUS, Leuven, Belgium
来源
PLOS ONE | 2014年 / 9卷 / 06期
关键词
COMMUNITY; CUTS;
D O I
10.1371/journal.pone.0099966
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Kernel spectral clustering corresponds to a weighted kernel principal component analysis problem in a constrained optimization framework. The primal formulation leads to an eigen-decomposition of a centered Laplacian matrix at the dual level. The dual formulation allows to build a model on a representative subgraph of the large scale network in the training phase and the model parameters are estimated in the validation stage. The KSC model has a powerful out-of-sample extension property which allows cluster affiliation for the unseen nodes of the big data network. In this paper we exploit the structure of the projections in the eigenspace during the validation stage to automatically determine a set of increasing distance thresholds. We use these distance thresholds in the test phase to obtain multiple levels of hierarchy for the large scale network. The hierarchical structure in the network is determined in a bottom-up fashion. We empirically showcase that real-world networks have multilevel hierarchical organization which cannot be detected efficiently by several state-of-the-art large scale hierarchical community detection techniques like the Louvain, OSLOM and Infomap methods. We show that a major advantage of our proposed approach is the ability to locate good quality clusters at both the finer and coarser levels of hierarchy using internal cluster quality metrics on 7 real-life networks.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Large scale hierarchical clustering of protein sequences
    Antje Krause
    Jens Stoye
    Martin Vingron
    BMC Bioinformatics, 6
  • [22] Large scale hierarchical clustering of protein sequences
    Krause, A
    Stoye, J
    Vingron, M
    BMC BIOINFORMATICS, 2005, 6 (1)
  • [23] Supporting Real-Life Applications in Hierarchical Component Systems
    Jezek, Pavel
    Bures, Tomas
    Hnetynka, Petr
    SOFTWARE ENGINEERING RESEARCH, MANAGEMENT AND APPLICATIONS 2009, 2009, 253 : 107 - +
  • [24] Power and Personality Perception in Real-Life Hierarchical Relationships
    Leikas, Sointu
    Lonnqvist, Jan-Erik
    Verkasalo, Markku
    Nissinen, Vesa
    EUROPEAN JOURNAL OF PERSONALITY, 2013, 27 (02) : 155 - 168
  • [25] Clustering Methods for Hierarchical Traffic Grooming in Large-Scale Mesh WDM Networks
    Chen, Bensong
    Rouskas, George N.
    Dutta, Rudra
    JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2010, 2 (08) : 502 - 514
  • [26] Clustering Algorithm of Hierarchical Structures in Large-Scale Wireless Sensor and Actuator Networks
    Pham Tran Anh Quang
    Kim, Dong-Seong
    JOURNAL OF COMMUNICATIONS AND NETWORKS, 2015, 17 (05) : 473 - 481
  • [27] Large-scale real-life analysis of survival and usage of therapies in multiple myeloma
    Lopez-Munoz, N.
    Hernandez-Ibarburu, G.
    Alonso, R.
    Sanchez-Pina, J. M.
    Ayala, R.
    Calbacho, M.
    Cuellar, C.
    Cedena, M. T.
    Jimenez, A.
    Iniguez, R.
    Pedrera, M.
    Cruz, J.
    Meloni, L.
    Perez-Rey, D.
    Serrano, P.
    de la Cruz, J.
    Martinez-Lopez, J.
    JOURNAL OF HEMATOLOGY & ONCOLOGY, 2023, 16 (01)
  • [28] Netbench - large-scale network device testing with real-life traffic patterns
    Stancu, Stefan Nicolae
    Krajewski, Adam Lukasz
    Cadeddu, Mattia
    Antosik, Marta
    Panzer-Steinde, Bernd
    23RD INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP 2018), 2019, 214
  • [29] A Framework for Modeling "Real-Life" Airline Networks
    Bilotkach, Volodymyr
    REVIEW OF NETWORK ECONOMICS, 2009, 8 (03) : 255 - 270
  • [30] Multilevel Optimization for Large-Scale Hierarchical FPGA Placement
    Hui Dai
    Qiang Zhou
    Ji-Nian Bian
    Journal of Computer Science and Technology, 2010, 25 : 1083 - 1091