Irregular cellular learning automata-based algorithm for sampling social networks

被引:19
|
作者
Ghavipour, Mina [1 ]
Meybodi, Mohammad Reza [1 ]
机构
[1] Amirkabir Univ Technol, Dept Comp Engn & IT, Tehran, Iran
关键词
Complex networks; Social networks; Network sampling; Graph mining; Cellular learning automata; COMPLEX NETWORKS; STATISTICAL-MECHANICS;
D O I
10.1016/j.engappai.2017.01.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Since online social networks usually have quite huge size and limited access, smaller subgraphs of them are often produced and analysed as the representative samples of original graphs. Sampling algorithms proposed so far are categorized into three main classes: node sampling, edge sampling, and topology-based sampling. Classic node sampling algorithm, despite its simplicity, performs surprisingly well in many situations. But the problem with node sampling is that the connectivity in sampled subgraph is less likely to be preserved. This paper proposes a topology based node sampling algorithm using irregular cellular learning automata (ICLA), called ICLA-NS. In this algorithm, at first an initial sample subgraph of the input graph is generated using the node sampling method and then an ICLA isomorphic to the input graph is utilized to improve the sample in such a way that the connectivity of the sample is ensured and at the same time the high degree nodes are also included in the sample. Experimental results on real world social networks indicate that the proposed sampling algorithm ICLA-NS preserves more accurately the underlying properties of the original graph compared to existing sampling methods in terms of Kolmogorov-Smirnov (KS) test.
引用
收藏
页码:244 / 259
页数:16
相关论文
共 50 条