Network Sampling Based on Centrality Measures for Relational Classification

被引:1
|
作者
Berton, Lilian [2 ]
Vega-Oliveros, Didier A. [1 ]
Valverde-Rebaza, Jorge [1 ]
da Silva, Andre Tavares [2 ]
Lopes, Alneu de Andrade [1 ]
机构
[1] Univ Sao Paulo, ICMC, Dept Comp Sci, BR-13560970 Sao Carlos, SP, Brazil
[2] Univ Santa Catarina State, Technol Sci Ctr, BR-89219710 Joinville, SC, Brazil
来源
基金
巴西圣保罗研究基金会;
关键词
Network sampling; Relational classification; Centrality measures; Missing data; Complex networks;
D O I
10.1007/978-3-319-55209-5_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many real-world networks, such as the Internet, social networks, biological networks, and others, are massive in size, which impairs their processing and analysis. To cope with this, the network size could be reduced without losing relevant information. In this paper, we extend a work that proposed a sampling method based on the following centrality measures: degree, k-core, clustering, eccentricity and structural holes. For our experiments, we remove 30% and 50% of the vertices and their edges from the original network. After, we evaluate our proposal on six real-world networks on relational classification task using eight different classifiers. Classification results achieved on sampled graphs generated from our proposal are similar to those obtained on the entire graphs. The execution time for learning step of the classifier is shorter on the sampled graph compared to the entire graph and random sampling. In most cases, the original graph was reduced by up to 50% of its initial number of edges without losing topological properties.
引用
收藏
页码:43 / 56
页数:14
相关论文
共 50 条
  • [21] A relational altmetric? Network centrality on ResearchGate as an indicator of scientific impact
    Hoffmann, Christian Pieter
    Lutz, Christoph
    Meckel, Miriam
    [J]. JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2016, 67 (04) : 765 - 775
  • [22] NetClass: A network-based relational model for document classification
    Mourao, Fernando
    Rocha, Leonardo
    Viegas, Felipe
    Salles, Thiago
    Goncalves, Marcos
    Parthasarathy, Srinivasan
    Meira, Wagner, Jr.
    [J]. INFORMATION SCIENCES, 2018, 469 : 60 - 78
  • [23] Comparison-based centrality measures
    Luca Rendsburg
    Damien Garreau
    [J]. International Journal of Data Science and Analytics, 2021, 11 : 243 - 259
  • [24] SET OF MEASURES OF CENTRALITY BASED ON BETWEENNESS
    FREEMAN, LC
    [J]. SOCIOMETRY, 1977, 40 (01): : 35 - 41
  • [25] Comparison-based centrality measures
    Rendsburg, Luca
    Garreau, Damien
    [J]. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2021, 11 (03) : 243 - 259
  • [26] Fusion of Classifiers based on Centrality Measures
    Silva, Ronan A.
    Britto, Alceu S., Jr.
    Enembreck, Fabricio
    Sabourin, Robert
    Oliveira, Luis S.
    [J]. 2018 IEEE 30TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2018, : 363 - 370
  • [27] Measures of centrality based on the spectrum of the Laplacian
    Pauls, Scott D.
    Remondini, Daniel
    [J]. PHYSICAL REVIEW E, 2012, 85 (06)
  • [28] A note on measures of similarity based on centrality
    Kang, Soong Moon
    [J]. SOCIAL NETWORKS, 2007, 29 (01) : 137 - 142
  • [29] Centrality measures based on current flow
    Brandes, U
    Fleischer, D
    [J]. STACS 2005, PROCEEDINGS, 2005, 3404 : 533 - 544
  • [30] Memetic Differential Evolution Using Network Centrality Measures
    Homolya, Viktor
    Vinko, Tomas
    [J]. 14TH INTERNATIONAL GLOBAL OPTIMIZATION WORKSHOP (LEGO), 2019, 2070