Kernel Spectral Clustering for Big Data Networks

被引:49
|
作者
Mall, Raghvendra [1 ]
Langone, Rocco [1 ]
Suykens, Johan A. K. [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn ESAT SCD SISTA, B-3001 Louvain, Belgium
关键词
kernel spectral clustering; out-of-sample extensions; sampling graphs; angular similarity; COMMUNITY STRUCTURE;
D O I
10.3390/e15051567
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
This paper shows the feasibility of utilizing the Kernel Spectral Clustering (KSC) method for the purpose of community detection in big data networks. KSC employs a primal-dual framework to construct a model. It results in a powerful property of effectively inferring the community affiliation for out-of-sample extensions. The original large kernel matrix cannot fitinto memory. Therefore, we select a smaller subgraph that preserves the overall community structure to construct the model. It makes use of the out-of-sample extension property for community membership of the unseen nodes. We provide a novel memory-and computationally efficient model selection procedure based on angular similarity in the eigenspace. We demonstrate the effectiveness of KSC on large scale synthetic networks and real world networks like the YouTube network, a road network of California and the Livejournal network. These networks contain millions of nodes and several million edges.
引用
收藏
页码:1567 / 1586
页数:20
相关论文
共 50 条
  • [41] Consensus Clustering on Big Data
    Liu, Hongfu
    Cheng, Gong
    Wu, Junjie
    2015 12TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM), 2015,
  • [42] Big Data clustering validity
    Tlili, Monia
    Hamdani, Tarek M.
    2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 348 - 352
  • [43] SPECTRAL CLUSTERING IN HETEROGENEOUS NETWORKS
    Sengupta, Srijan
    Chen, Yuguo
    STATISTICA SINICA, 2015, 25 (03) : 1081 - 1106
  • [44] Spectral Clustering in Social Networks
    Kurucz, Miklos
    Benczur, Andras A.
    Csalogany, Karoly
    Lukacs, Laszlo
    ADVANCES IN WEB MINING AND WEB USAGE ANALYSIS, 2009, 5439 : 1 - 20
  • [45] Kernel based clustering for multiclass data
    Satish, DS
    Sekhar, CC
    NEURAL INFORMATION PROCESSING, 2004, 3316 : 1266 - 1272
  • [46] Using Spectral Clustering Association Algorithm upon Teaching Big Data for Precise Education
    Zhou, Yongfu
    Zeng, Zhi
    Wang, Huabin
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [47] Research on spectral clustering algorithm for network communication big data based on wavelet analysis
    Dai, Xinjian
    Zeng, Zhichao
    INTERNATIONAL JOURNAL OF AUTONOMOUS AND ADAPTIVE COMMUNICATIONS SYSTEMS, 2022, 15 (02) : 93 - 105
  • [48] Combining Compression and Clustering Techniques to Handle Big Data Collected in Sensor Networks
    Harb, Hassan
    Abou Jaoude, Chady
    2018 IEEE MIDDLE EAST AND NORTH AFRICA COMMUNICATIONS CONFERENCE (MENACOMM), 2018, : 122 - 127
  • [49] A Fast Clustering Algorithm for Analyzing Big Data Generated in Ubiquitous Sensor Networks
    Zahwe, Oussama
    Majed, Ola
    Harb, Hassan
    Hamze, Mohamad
    Nasser, Abbass
    2018 19TH INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2018, : 142 - 147
  • [50] Spectral clustering with adaptive similarity measure in Kernel space
    Ye, Xiucai
    Sakurai, Tetsuya
    INTELLIGENT DATA ANALYSIS, 2018, 22 (04) : 751 - 765