Node Attributes and Edge Structure for Large-Scale Big Data Network Analytics and Community Detection

被引:0
|
作者
Chopade, Pravin [1 ]
Zhan, Justin [1 ]
Bikdash, Marwan [1 ]
机构
[1] North Carolina A&T State Univ, Dept Comp Sci & CSE, Greensboro, NC 27411 USA
基金
美国国家科学基金会;
关键词
Large-scale network; Big data; Community detection; Statistical analysis;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Identifying network communities is one of the most important tasks when analyzing complex networks. Most of these networks possess a certain community structure that has substantial importance in building an understanding regarding the dynamics of the large-scale network. Intriguingly, such communities appear to be connected with unique spectral property of the graph Laplacian of the adjacency matrix and we exploit this connection by using modified relationship between Laplacian and adjacency matrix. We propose modularity optimization based on a greedy agglomerative method, coupled with fast unfolding of communities in large-scale networks using Louvain community finding method. Our proposed modified algorithm is linearly scalable for efficient identification of communities in huge directed/undirected networks. The proposed algorithm shows great performance and scalability on benchmark networks in simulations and successfully recovers communities in real network applications. In this paper, we develop communities from node attributes and edge structure. New modified algorithm statistically models the interaction between the network structure and the node attributes which leads to more accurate community detection as well as helps for identifying robustness of the network structure. We also show that any community must contain a dense Erdos-Renyi (ER) subgraph. We carried out comparisons of the Chung and Lu (CL) and Block Two-Level Erdos-Renyi (BTER) models with four real-world data sets. Results demonstrate that it accurately captures the observable properties of many real-world networks.
引用
下载
收藏
页数:8
相关论文
共 50 条
  • [11] Too Big to Mail: On the Way to Publish Large-scale Mobile Analytics Data
    Peltonen, Ella
    Lagerspetz, Eemil
    Nurmi, Petteri
    Tarkoma, Sasu
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2374 - 2377
  • [12] Big Data for Enhanced Learning Analytics: A Case for Large-Scale Comparative Assessments
    Korfiatis, Nikolaos
    METADATA AND SEMANTICS RESEARCH, MTSR 2013, 2013, 390 : 225 - 233
  • [13] Software Abstractions for Large-Scale Deep Learning Models in Big Data Analytics
    Khan, Ayaz H.
    Qamar, Ali Mustafa
    Yusuf, Aneeq
    Khan, Rehanullah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (04) : 557 - 566
  • [14] Big Data Analytics for User Association Characterization in Large-Scale WiFi System
    Lyu, Feng
    Ren, Lu
    Cheng, Nan
    Yang, Peng
    Li, Minglu
    Zhang, Yaoxue
    Shen, Xuemin
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [15] Gravity algorithm for the community detection of large-scale network
    Majid Arasteh
    Somayeh Alizadeh
    Chi-Guhn Lee
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 1217 - 1228
  • [16] Gravity algorithm for the community detection of large-scale network
    Arasteh, Majid
    Alizadeh, Somayeh
    Lee, Chi-Guhn
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (2) : 1217 - 1228
  • [17] Structure and Evolution of a Large-Scale Wireless Community Network
    Elianos, Fotios A.
    Plakia, Georgia
    Frangoudis, Pantelis A.
    Polyzos, George C.
    2009 IEEE INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS & WORKSHOPS, 2009, : 357 - 362
  • [18] Erratum to: Structural and functional analytics for community detection in large-scale complex networks
    Pravin Chopade
    Justin Zhan
    Journal of Big Data, 2 (1)
  • [19] Big Data Analytics on Large-Scale Scientific Datasets in the INDIGO-DataCloud Project
    Fiore, Sandro
    Palazzo, Cosimo
    D'Anca, Alessandro
    Elia, Donatello
    Londero, Elisa
    Knapic, Cristina
    Monna, Stephen
    Marcucci, Nicola M.
    Aguilar, Fernando
    Plociennik, Marcin
    De Lucas, Jesus E. Marco
    Aloisio, Giovanni
    ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2017, 2017, : 343 - 348
  • [20] HiPerData: An Autonomous Large-Scale Model Building and Management Platform for Big Data Analytics
    Duan, Rubing
    Goh, Rick Siow Mong
    Yang, Feng
    Di Shang, Richard
    Liu, Yong
    Li, Zengxiang
    Wang, Long
    Lu, Sifei
    Yang, Xulei
    Qin, Zheng
    PROCEEDINGS OF THE 2015 10TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, 2015, : 449 - 454