Community detection in large-scale networks: a survey and empirical evaluation

被引:143
|
作者
Harenberg, Steve
Bello, Gonzalo
Gjeltema, L.
Ranshous, Stephen
Harlalka, Jitendra
Seay, Ramona
Padmanabhan, Kanchana
Samatova, Nagiza [1 ]
机构
[1] North Carolina State Univ, Dept Comp Sci, Raleigh, NC 27695 USA
基金
美国国家科学基金会;
关键词
clustering; community detection; empirical evaluation; graphs; ground-truth; networks;
D O I
10.1002/wics.1319
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Community detection is a common problem in graph data analytics that consists of finding groups of densely connected nodes with few connections to nodes outside of the group. In particular, identifying communities in large-scale networks is an important task in many scientific domains. In this review, we evaluated eight state-of-the-art and five traditional algorithms for overlapping and disjoint community detection on large-scale real-world networks with known ground-truth communities. These 13 algorithms were empirically compared using goodness metrics that measure the structural properties of the identified communities, as well as performance metrics that evaluate these communities against the ground-truth. Our results show that these two types of metrics are not equivalent. That is, an algorithm may perform well in terms of goodness metrics, but poorly in terms of performance metrics, or vice versa. (C) 2014 The Authors. WIREs Computational Statistics published byWiley Periodicals, Inc.
引用
收藏
页码:426 / 439
页数:14
相关论文
共 50 条
  • [1] Community Detection in Large-scale Bipartite Networks
    Liu, Xin
    Murata, Tsuyoshi
    [J]. 2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2009, : 50 - 57
  • [2] Community Detection in Large-Scale Bipartite Biological Networks
    Calderer, Genis
    Kuijjer, Marieke L.
    [J]. FRONTIERS IN GENETICS, 2021, 12
  • [3] A Distributed Algorithm for Overlapped Community Detection in Large-Scale Networks
    Saha, Dibakar
    Mandal, Partha Sarathi
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2021, : 483 - 491
  • [4] A UNIFIED COMMUNITY DETECTION ALGORITHM IN LARGE-SCALE COMPLEX NETWORKS
    Long, Hao
    Liu, Xiao-Wei
    [J]. ADVANCES IN COMPLEX SYSTEMS, 2019, 22 (03):
  • [5] Community Detection Based on DeepWalk Model in Large-Scale Networks
    Chen, Yunfang
    Wang, Li
    Qi, Dehao
    Ma, Tinghuai
    Zhang, Wei
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2020, 2020
  • [6] Towards Online Multiresolution Community Detection in Large-Scale Networks
    Huang, Jianbin
    Sun, Heli
    Liu, Yaguang
    Song, Qinbao
    Weninger, Tim
    [J]. PLOS ONE, 2011, 6 (08):
  • [7] A Survey of Malicious Accounts Detection in Large-Scale Online Social Networks
    Xin, Yang
    Zhao, Chensu
    Zhu, Hongliang
    Gao, Mingcheng
    [J]. 2018 IEEE 4TH INTERNATIONAL CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY), 4THIEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, (HPSC) AND 3RD IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2018, : 155 - 158
  • [8] A Survey on Community Detection Algorithms in Large Scale Real World Networks
    Chintalapudi, S. Rao
    Prasad, M. H. M. Krishna
    [J]. 2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 1323 - 1327
  • [9] Parallel k-Clique Community Detection on Large-Scale Networks
    Gregori, Enrico
    Lenzini, Luciano
    Mainardi, Simone
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2013, 24 (08) : 1651 - 1660
  • [10] Structural and functional analytics for community detection in large-scale complex networks
    Chopade P.
    Zhan J.
    [J]. Journal of Big Data, 2 (1)