Scalable community detection in massive social networks using MapReduce

被引:13
|
作者
Shi, J. [1 ]
Xue, W. [2 ]
Wang, W. [3 ]
Zhang, Y.
Yang, B. [4 ]
Li, J. [5 ]
机构
[1] IBM Res China, Beijing 100193, Peoples R China
[2] Tencent Inc, Beijing 100080, Peoples R China
[3] Shanghai Synacast Media Tech PPLive Inc, Shanghai 201203, Peoples R China
[4] IBM Software Grp, China Dev Lab, Beijing 100193, Peoples R China
[5] IBM Res Austin, Austin, TX 78758 USA
关键词
MODULARITY;
D O I
10.1147/JRD.2013.2251982
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a community-detection solution for massive-scale social networks using MapReduce, a parallel programming framework. We use a similarity metric to model the community probability, and the model is designed to be parallelizable and scalable in the MapReduce framework. More importantly, we propose a set of degree-based preprocessing and postprocessing techniques named DEPOLD (DElayed Processing of Large Degree nodes) that significantly improve both the community-detection accuracy and performance. With DEPOLD, delaying analysis of 1% of high-degree nodes to the postprocessing stage reduces both processing time and storage space by one order of magnitude. DEPOLD can be applied to other graph-clustering problems. Furthermore, we design and implement two similarity calculation algorithms using MapReduce with different computation and communication characteristics in order to adapt to various system configurations. Finally, we conduct experiments with publicly available datasets. Our evaluation demonstrates the effectiveness, efficiency, and scalability of the proposed solution.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Overlapping community detection in social networks using coalitional games
    Annapurna Jonnalagadda
    Lakshmanan Kuppusamy
    Knowledge and Information Systems, 2018, 56 : 637 - 661
  • [22] Fuzzy Community Detection in Social Networks Using a Genetic Algortihm
    Su, Jianhai
    Havens, Timothy C.
    2014 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2014, : 2039 - 2046
  • [23] Community detection in attributed social networks using deep learning
    Rashnodi, Omid
    Rastegarpour, Maryam
    Moradi, Parham
    Zamanifar, Azadeh
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (18): : 25933 - 25973
  • [24] Influence maximization in social networks using effective community detection
    Kazemzadeh, Farzaneh
    Safaei, Ali Asghar
    Mirzarezaee, Mitra
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2022, 598
  • [25] COMMUNITY DETECTION IN ONLINE SOCIAL NETWORKS USING ACTIONS OF USERS
    Moosavi, Seyed Ahmad
    Jalali, Mehrdad
    2014 IRANIAN CONFERENCE ON INTELLIGENT SYSTEMS (ICIS), 2014,
  • [26] Using Cohen's k for Community Detection in Social Networks
    Hoffman, Michaela
    Steinley, Douglas
    Gates, Kathleen M.
    Prinstein, Mitchell J.
    Brusco, Michael J.
    MULTIVARIATE BEHAVIORAL RESEARCH, 2015, 50 (06) : 740 - 741
  • [27] Community Detection in Social Networks Using Content and Link Analysis
    Kakisim, Arzu
    Sogukpinar, Ibrahim
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 1521 - 1524
  • [28] Community detection in social networks using structural and content information
    Akachar, Elyazid
    Ouhbi, Brahim
    Frikh, Bouchra
    IIWAS2018: THE 20TH INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2014, : 282 - 288
  • [29] Link Prediction in Social Networks Using Hierarchical Community Detection
    Deylami, Hasti Akbari
    Asadpour, Masoud
    2015 7TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2015,
  • [30] Scalable Distributed Reasoning Using MapReduce
    Urbani, Jacopo
    Kotoulas, Spyros
    Oren, Eyal
    van Harmelen, Frank
    SEMANTIC WEB - ISWC 2009, PROCEEDINGS, 2009, 5823 : 634 - 649