Sparsity measure of a network graph: Gini index

被引:31
|
作者
Goswami, Swati [1 ,2 ]
Murthy, C. A. [1 ]
Das, Asit K. [2 ]
机构
[1] Indian Stat Inst, Machine Intelligence Unit, 203 BT Road, Kolkata 700108, India
[2] Indian Inst Engn Sci & Technol, Dept Comp Sci & Technol, Sibpur 711103, Howrah, India
关键词
Network graph; Sparsity measure; Gini index; Edge density; Degree distribution; Power law; Network community detection algorithm; DISTRIBUTIONS; COMMUNITY;
D O I
10.1016/j.ins.2018.05.044
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article explores the problem of formulating a general measure of sparsity of network graphs. Based on an available definition sparsity of a dataset, namely Gini index, it provides a way to define sparsity measure of a network graph. We name the sparsity measure so introduced as sparsity index. Sparsity measures are commonly associated with six properties, namely, Robin Hood, Scaling, Rising Tide. Cloning, Bill Gates and Babies. Sparsity index directly satisfies four of these six properties; does not satisfy Cloning and satisfies Scaling for some specific cases. A comparison of the proposed index is drawn with Edge Density (the proportion of the sum of degrees of all nodes in a graph compared to the total possible degrees in the corresponding fully connected graph), by showing mathematically that as the edge density of an undirected graph increases, its sparsity index decreases. The paper highlights how the proposed sparsity measure can reveal important properties of a network graph. Further, a relationship has been drawn analytically between the sparsity index and the exponent term of a power law distribution (a distribution known to approximate the degree distribution of a wide variety of network graphs). To illustrate application of the proposed index, a community detection algorithm for network graphs is presented. The algorithm produces overlapping communities with no input requirement on number or size of the communities; has a computational complexity O(n(2)), where n is the number of nodes of the graph. The results validated on artificial and real networks show its effectiveness. (C) 2018 Elsevier Inc. All rights reserved.
引用
收藏
页码:16 / 39
页数:24
相关论文
共 50 条
  • [31] Shrinkage estimation of Gini index
    Ghori, R.
    Ahmed, S. E.
    Hussein, A. A.
    CONTRIBUTIONS TO PROBABILITY AND STATISTICS: APPLICATIONS AND CHALLENGES, 2006, : 234 - +
  • [32] Reliable inference for the Gini index
    Davidson, Russell
    JOURNAL OF ECONOMETRICS, 2009, 150 (01) : 30 - 40
  • [33] Practical modified Gini index
    Malul, Miki
    Shapira, Daniel
    Shoham, Amir
    APPLIED ECONOMICS LETTERS, 2013, 20 (04) : 324 - 327
  • [34] THE BRADFORD DISTRIBUTION AND THE GINI INDEX
    BURRELL, QL
    SCIENTOMETRICS, 1991, 21 (02) : 181 - 194
  • [35] A characterization of the Gini segregation index
    Carmen Puerta
    Ana Urrutia
    Social Choice and Welfare, 2016, 47 : 519 - 529
  • [36] The robustness of the generalized Gini index
    S. Settepanella
    A. Terni
    M. Franciosi
    L. Li
    Decisions in Economics and Finance, 2022, 45 : 521 - 539
  • [37] A data science based standardized Gini index as a Lorenz dominance preserving measure of the inequality of distributions
    Ultsch, Alfred
    Loetsch, Joern
    PLOS ONE, 2017, 12 (08):
  • [38] Political inequality in participation index-a Gini-based measure of inequalities in political participation
    Susanszky, Pal
    Somogyi, Robert
    Toth, Gergely
    RATIONALITY AND SOCIETY, 2023, 35 (02) : 231 - 255
  • [39] Concordance and Gini's measure of association
    Nelsen, RB
    JOURNAL OF NONPARAMETRIC STATISTICS, 1998, 9 (03) : 227 - 238
  • [40] The robustness of the generalized Gini index
    Settepanella, S.
    Terni, A.
    Franciosi, M.
    Li, L.
    DECISIONS IN ECONOMICS AND FINANCE, 2022, 45 (02) : 521 - 539