High Performance Frequent Subgraph Mining on Transaction Datasets: A Survey and Performance Comparison

被引:1
|
作者
Bismita S.Jena
Cynthia Khan
Rajshekhar Sunderraman
机构
[1] Department of Computer Science
[2] Georgia State University
关键词
frequent subgraphs; isomorphism; Spark;
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
Graph data mining has been a crucial as well as inevitable area of research. Large amounts of graph data are produced in many areas, such as Bioinformatics, Cheminformatics, Social Networks, etc. Scalable graph data mining methods are getting increasingly popular and necessary due to increased graph complexities. Frequent subgraph mining is one such area where the task is to find overly recurring patterns/subgraphs. To tackle this problem, many main memory-based methods were proposed, which proved to be inefficient as the data size grew exponentially over time. In the past few years, several research groups have attempted to handle the Frequent Subgraph Mining(FSM) problem in multiple ways. Many authors have tried to achieve better performance using Graphic Processing Units(GPUs) which has multi-fold improvement over in-memory while dealing with large datasets. Later, Google’s MapReduce model with the Hadoop framework proved to be a major breakthrough in high performance large batch processing. Although MapReduce came with many benefits, its disk I/O and noniterative style model could not help much for FSM domain since subgraph mining process is an iterative approach.In recent years, Spark has emerged to be the De Facto industry standard with its distributed in-memory computing capability. This is a right fit solution for iterative style of programming as well. In this survey, we cover how high-performance computing has helped in improving the performance tremendously in the transactional directed and undirected aspect of graphs and performance comparisons of various FSM techniques are done based on experimental results.
引用
收藏
页码:159 / 180
页数:22
相关论文
共 50 条
  • [1] High Performance Frequent Subgraph Mining on Transaction Datasets: A Survey and Performance Comparison
    Jena, Bismita S.
    Khan, Cynthia
    Sunderraman, Rajshekhar
    BIG DATA MINING AND ANALYTICS, 2019, 2 (03) : 159 - 180
  • [2] Frequent Subgraph Mining Algorithms in Static and Temporal Graph-Transaction Settings: A Survey
    Jazayeri, Ali
    Yang, Christopher C.
    IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (06) : 1443 - 1462
  • [3] A survey of frequent subgraph mining algorithms
    Jiang, Chuntao
    Coenen, Frans
    Zito, Michele
    KNOWLEDGE ENGINEERING REVIEW, 2013, 28 (01): : 75 - 105
  • [4] A qualitative survey on frequent subgraph mining
    Guvenoglu, Busra
    Bostanoglu, Belgin Ergenc
    OPEN COMPUTER SCIENCE, 2018, 8 (01) : 194 - 209
  • [5] Frequent Subgraph Mining Algorithms - A Survey
    Ramraj, T.
    Prabhakar, R.
    GRAPH ALGORITHMS, HIGH PERFORMANCE IMPLEMENTATIONS AND ITS APPLICATIONS (ICGHIA 2014), 2015, 47 : 197 - 204
  • [6] A SURVEY ON MAPREDUCE USING FREQUENT SUBGRAPH MINING
    Gokilavani, M.
    Anitha, B.
    Jayanthi, R.
    IIOAB JOURNAL, 2016, 7 (09) : 584 - 591
  • [7] High-performance parallel frequent subgraph discovery
    Saeed Shahrivari
    Saeed Jalili
    The Journal of Supercomputing, 2015, 71 : 2412 - 2432
  • [8] High-performance parallel frequent subgraph discovery
    Shahrivari, Saeed
    Jalili, Saeed
    JOURNAL OF SUPERCOMPUTING, 2015, 71 (07): : 2412 - 2432
  • [9] High performance subgraph mining in molecular compounds
    Di Fatta, G
    Berthold, MR
    HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2005, 3726 : 866 - 877
  • [10] A survey of Uncertainty Handling in Frequent Subgraph Mining Algorithms
    Moussaoui, Mohamed
    Zaghdoud, Montaceur
    Akaichi, Jalel
    2015 IEEE/ACS 12TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2015,