Group Centrality Maximization for Large-scale Graphs

被引:0
|
作者
Angriman, Eugenio [1 ]
van der Grinten, Alexander [1 ]
Bojchevski, Aleksandar [2 ]
Zuegner, Daniel [2 ]
Guennemann, Stephan [2 ]
Meyerhenke, Henning [1 ]
机构
[1] Humboldt Univ, Dept Comp Sci, Berlin, Germany
[2] Tech Univ Munich, Dept Informat, Munich, Germany
关键词
Large-scale graph analysis; group centrality measure; greedy approximation; DATABASE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The study of vertex centrality measures is a key aspect of network analysis. Naturally, such centrality measures have been generalized to groups of vertices; for popular measures it was shown that the problem of finding the most central group is NP-hard. As a result, approximation algorithms to maximize group centralities were introduced recently. Despite a nearly-linear running time, approximation algorithms for group betweenness and (to a lesser extent) group closeness are rather slow on large networks due to high constant overheads. That is why we introduce GED-Walk centrality, a new submodular group centrality measure inspired by Katz centrality. In contrast to closeness and betweenness, it considers walks of any length rather than shortest paths, with shorter walks having a higher contribution. We define algorithms that (i) efficiently approximate the GED-Walk score of a given group and (ii) efficiently approximate the (proved to be NP-hard) problem of finding a group with highest GED-Walk score. Experiments on several real-world datasets show that scores obtained by GED-Walk improve performance on common graph mining tasks such as collective classification and graph-level classification. An evaluation of empirical running times demonstrates that maximizing GED-Walk (in approximation) is two orders of magnitude faster compared to group betweenness approximation and for group sizes <= 100 one to two orders faster than group closeness approximation. For graphs with tens of millions of edges, approximate GED-Walk maximization typically needs less than one minute. Furthermore, our experiments suggest that the maximization algorithms scale linearly with the size of the input graph and the size of the group.
引用
收藏
页码:56 / 69
页数:14
相关论文
共 50 条
  • [1] Group-Centrality and Impact of Large-scale Localized Faults in Spatial Networks
    Feyessa, Teshome
    Bikdash, Marwan
    2012 PROCEEDINGS OF IEEE SOUTHEASTCON, 2012,
  • [2] Effective Large-Scale Online Influence Maximization
    Lagree, Paul
    Cappe, Olivier
    Cautis, Bogdan
    Maniu, Silviu
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 937 - 942
  • [3] LARGE-SCALE AND GLOBAL MAXIMIZATION OF THE DISTANCE TO INSTABILITY
    Mengi, Emre
    SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2018, 39 (04) : 1776 - 1809
  • [4] Ranking of closeness centrality for large-scale social networks
    Okamoto, Kazuya
    Chen, Wei
    Li, Xiang-Yang
    FRONTIERS IN ALGORITHMICS, 2008, 5059 : 186 - +
  • [5] Link transmission centrality in large-scale social networks
    Zhang, Qian
    Karsai, Marton
    Vespignani, Alessandro
    EPJ DATA SCIENCE, 2018, 7
  • [6] Spanning Edge Centrality: Large-scale Computation and Applications
    Mavroforakis, Charalampos
    Garcia-Lebron, Richard
    Koutis, Ioannis
    Terzi, Evimaria
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 732 - 742
  • [7] Link transmission centrality in large-scale social networks
    Qian Zhang
    Márton Karsai
    Alessandro Vespignani
    EPJ Data Science, 7
  • [8] Finding Structures in Large-scale Graphs
    Chin, Sang Peter
    Reilly, Elizabeth
    Lu, Linyuan
    CYBER SENSING 2012, 2012, 8408
  • [9] Large-scale structures in random graphs
    Bottcher, Julia
    SURVEYS IN COMBINATORICS 2017, 2017, 440 : 87 - 140
  • [10] Protocol for large-scale group
    Taguchi, K
    Takizawa, M
    13TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2002, : 12 - 16