Scalable community detection in massive social networks using MapReduce

被引:13
|
作者
Shi, J. [1 ]
Xue, W. [2 ]
Wang, W. [3 ]
Zhang, Y.
Yang, B. [4 ]
Li, J. [5 ]
机构
[1] IBM Res China, Beijing 100193, Peoples R China
[2] Tencent Inc, Beijing 100080, Peoples R China
[3] Shanghai Synacast Media Tech PPLive Inc, Shanghai 201203, Peoples R China
[4] IBM Software Grp, China Dev Lab, Beijing 100193, Peoples R China
[5] IBM Res Austin, Austin, TX 78758 USA
关键词
MODULARITY;
D O I
10.1147/JRD.2013.2251982
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a community-detection solution for massive-scale social networks using MapReduce, a parallel programming framework. We use a similarity metric to model the community probability, and the model is designed to be parallelizable and scalable in the MapReduce framework. More importantly, we propose a set of degree-based preprocessing and postprocessing techniques named DEPOLD (DElayed Processing of Large Degree nodes) that significantly improve both the community-detection accuracy and performance. With DEPOLD, delaying analysis of 1% of high-degree nodes to the postprocessing stage reduces both processing time and storage space by one order of magnitude. DEPOLD can be applied to other graph-clustering problems. Furthermore, we design and implement two similarity calculation algorithms using MapReduce with different computation and communication characteristics in order to adapt to various system configurations. Finally, we conduct experiments with publicly available datasets. Our evaluation demonstrates the effectiveness, efficiency, and scalability of the proposed solution.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Evolutionary Community Detection in Social Networks
    He, Tiantian
    Chan, Keith C. C.
    2014 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2014, : 1496 - 1503
  • [42] Hidden community detection in social networks
    He, Kun
    Li, Yingru
    Soundarajan, Sucheta
    Hoperoft, John E.
    INFORMATION SCIENCES, 2018, 425 : 92 - 106
  • [43] Community detection for emerging social networks
    Zhan, Qianyi
    Zhang, Jiawei
    Yu, Philip
    Xie, Junyuan
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2017, 20 (06): : 1409 - 1441
  • [44] Hybrid Community Detection in Social Networks
    Du, Hongwei
    Wu, Weili
    Cui, Lei
    Du, Ding-Zhu
    MODELS, ALGORITHMS AND TECHNOLOGIES FOR NETWORK ANALYSIS, NET 2014, 2016, 156 : 127 - 133
  • [45] Community Detection in Multiplex Social Networks
    Nguyen, Hung T.
    Dinh, Thang N.
    Tam Vu
    2015 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2015, : 654 - 659
  • [46] Community detection for emerging social networks
    Qianyi Zhan
    Jiawei Zhang
    Philip Yu
    Junyuan Xie
    World Wide Web, 2017, 20 : 1409 - 1441
  • [47] Overlapping Community Detection in Social Networks
    Dhouioui, Zeineb
    Akaichi, Jalel
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [48] Overlapping Community Detection in Social Networks Using Cellular Learning Automata
    Khomami, Mohammad Mehdi Daliri
    Rezvanian, Alireza
    Saghiri, Ali Mohammad
    Meybodi, Mohammad Reza
    2020 28TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2020, : 1602 - 1607
  • [49] Enhanced Overlapping Community detection in Social Networks using Wise Initialization
    Jalili, Sajjad
    Hamzeh, Ali
    2013 5TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2013, : 463 - 466
  • [50] Community detection in social networks using user frequent pattern mining
    Moosavi, Seyed Ahmad
    Jalali, Mehrdad
    Misaghian, Negin
    Shamshirband, Shahaboddin
    Anisi, Mohammad Hossein
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (01) : 159 - 186