Large Graph Sampling Algorithm for Frequent Subgraph Mining

被引:5
|
作者
Zheng, Tianyu [1 ]
Li Wang [1 ]
机构
[1] Univ Sci & Technol Liaoning, Sch Comp & Software Engn, Anshan 114051, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; Sampling methods; Task analysis; Social networking (online); Licenses; Wireless communication; Software engineering; Graph sampling algorithm; frequent subgraph mining; random areas selection;
D O I
10.1109/ACCESS.2021.3089699
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large graph networks frequently appear in the latest applications. Their graph structures are very large, and the interaction among the vertices makes it difficult to split the structures into separate multiple structures, thus increasing the difficulty of frequent subgraph mining. The process of calculating subgraph isomorphism often requires many calculations. Reducing the unessential structure of the graph is an effective method to improve the efficiency. Therefore, we propose a large graph sampling algorithm (RASI) based on random areas selection sampling and incorporate graph induction techniques to reduce the structure of the original graph. In addition, we find that constraining the weight of the number of vertices in the entire graph is essential to reduce the calculation of subgraph isomorphisms. This parameter is constrained in the sampling process to improve the efficiency of frequent subgraph mining. Experimental results show that RASI has more stable performance and performs better than other sampling algorithms in non-connected graphs. Mining frequent subgraphs by graph sampling can significantly improve the efficiency of mining, and the number of subgraphs before and after sampling is very similar.
引用
收藏
页码:88970 / 88980
页数:11
相关论文
共 50 条
  • [1] Optimizing Frequent Subgraph Mining for Single Large Graph
    Dhiman, Aarzoo
    Jain, S. K.
    [J]. TWELFTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2016 / TWELFTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2016 / TWELFTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2016, 2016, 89 : 378 - 385
  • [2] GRAMI: Frequent Subgraph and Pattern Mining in a Single Large Graph
    Elseidy, Mohammed
    Abdelhamid, Ehab
    Skiadopoulos, Spiros
    Kalnis, Panos
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (07): : 517 - 528
  • [3] POSGRAMI: Possibilistic Frequent Subgraph Mining in a Single Large Graph
    Moussaoui, Mohamed
    Zaghdoud, Montaceur
    Akaichi, Jalel
    [J]. INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS, IPMU 2016, PT I, 2016, 610 : 549 - 561
  • [4] A Method for Closed Frequent Subgraph Mining in a Single Large Graph
    Nguyen, Lam B. Q.
    Nguyen, Loan T. T.
    Zelinka, Ivan
    Snasel, Vaclav
    Hung Son Nguyen
    Bay Vo
    [J]. IEEE ACCESS, 2021, 9 : 165719 - 165733
  • [5] FSMBUS: A frequent subgraph mining algorithm in single large-scale graph using spark
    Yan, Yuliang
    Dong, Yihong
    He, Xianmang
    Wang, Wei
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (08): : 1768 - 1783
  • [6] Application-Specific Graph Sampling for Frequent Subgraph Mining and Community Detection
    Purohit, Sumit
    Choudhury, Sutanay
    Holder, Lawrence B.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 1000 - 1005
  • [7] ScaleMine: Scalable Parallel Frequent Subgraph Mining in a Single Large Graph
    Abdelhamid, Ehab
    Abdelaziz, Ibrahim
    Kalnis, Panos
    Khayyat, Zuhair
    Jamour, Fuad
    [J]. SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 716 - 726
  • [8] ScaleMine: Scalable Parallel Frequent Subgraph Mining in a Single Large Graph
    Abdelhamid, Ehab
    Abdelaziz, Ibrahim
    Kalnis, Panos
    Khayyat, Zuhair
    Jamour, Fuad
    [J]. SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 727 - 727
  • [9] A Parallel Algorithm for Frequent Subgraph Mining
    Bay Vo
    Dang Nguyen
    Thanh-Long Nguyen
    [J]. ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, 2015, 358 : 163 - 173
  • [10] Efficient frequent subgraph mining algorithm
    Li, Xian-Tong
    Li, Jian-Zhong
    Gao, Hong
    [J]. Ruan Jian Xue Bao/Journal of Software, 2007, 18 (10): : 2469 - 2480