BF-BigGraph: An efficient subgraph isomorphism approach using machine learning for big graph databases

被引:0
|
作者
Yazici, Adnan [1 ,2 ]
Taskomaz, Ezgi [2 ]
机构
[1] Nazarbayev Univ, Sch Engn & Digital Sci, Dept Comp Sci, Astana, Kazakhstan
[2] Middle East Tech Univ, Dept Comp Engn, Ankara, Turkiye
关键词
Graph-based NoSQL databases; Machine learning; Subgraph isomorphism; QUERY OPTIMIZATION; ALGORITHM;
D O I
10.1016/j.is.2024.102401
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graph databases are flexible NoSQL databases used to efficiently store and query complex and big data. One of the most difficult problems in graph databases is the problem of subgraph isomorphism, which involves finding a matching pattern in a given graph. Subgraph isomorphism algorithms generally encounter problems in the efficient processing of complex queries based on a lack of pruning methods and the use of a matching order. In this study, we present a new subgraph isomorphism approach based on the best-first search design strategy and name it BF-BigGraph. Our approach includes a machine learning technique to efficiently find the best matching order for various complex queries. The parameters we used in our approach as heuristics to improve the performance of complex queries on graph-based NoSQL databases are database volatility, database size, type of query, and the size of the query. We utilized the Random Forest machine learning method to narrow candidate nodes to a higher level of search and effectively reduce the search space for efficient querying and retrieval. We compared BF-BigGraph with state-of-the-art approaches, namely BB-Graph, Neo4j's Cypher, DualIso, GraphQL, TurboIso, and VF3 using publicly available databases including undirected graphs; WorldCup, Pokec, Youtube, and a big graph database of a real demographic application (a population database) with approximately 70 million nodes of a big directed graph. The performance results of our approach for different types of complex queries on all these databases are significantly better in terms of computation time and required memory than other competing approaches in the literature.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Enabling the Analysis of Mental Health Patterns Using an Efficient Machine Learning Approach
    Afzoon, Saleh
    Rezvani, Nabi
    Khunjush, Farshad
    2021 IEEE 25TH INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE WORKSHOPS (EDOCW 2021), 2021, : 59 - 66
  • [32] APuML: An Efficient Approach to Detect Mobile Phishing Webpages using Machine Learning
    Jain, Ankit Kumar
    Debnath, Ninmoy
    Jain, Arvind Kumar
    WIRELESS PERSONAL COMMUNICATIONS, 2022, 125 (04) : 3227 - 3248
  • [33] An efficient approach of epilepsy seizure alert system using IoT and machine learning
    Basavaiah J.
    Anthony A.A.
    Mahadevaswamy S.
    Naveen Kumar H.N.
    Journal of Reliable Intelligent Environments, 2024, 10 (04) : 449 - 461
  • [34] An Efficient Approach to Recognize Hand Gestures Using Machine-Learning Algorithms
    Wahid, Md Ferdous
    Tafreshi, Reza
    Al-Sowaidi, Mubarak
    Langari, Reza
    2018 IEEE 4TH MIDDLE EAST CONFERENCE ON BIOMEDICAL ENGINEERING (MECBME), 2018, : 171 - 176
  • [35] APuML: An Efficient Approach to Detect Mobile Phishing Webpages using Machine Learning
    Ankit Kumar Jain
    Ninmoy Debnath
    Arvind Kumar Jain
    Wireless Personal Communications, 2022, 125 : 3227 - 3248
  • [36] Efficient optimization approach for designing power device structure using machine learning
    Yamano, Hayate
    Kovacs, Alexander
    Fischbacher, Johann
    Danno, Katsunori
    Umetani, Yusuke
    Shoji, Tetsuya
    Schrefl, Thomas
    JAPANESE JOURNAL OF APPLIED PHYSICS, 2023, 62 (SC)
  • [37] EEG Feature Fusion for Person Identification Using Efficient Machine Learning Approach
    Alyasseri, Zaid Abdi Alkareem
    Al-Betar, Mohammed Azmi
    Awadallah, Mohammed A.
    Makhadmeh, Sharif Naser
    Alomari, Osama Ahmad
    Abasi, Ammar Kamal
    Abu Doush, Iyad
    2021 PALESTINIAN INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (PICICT 2021), 2021, : 97 - 102
  • [38] Predicting metamorphic relations for testing scientific software: a machine learning approach using graph kernels
    Kanewala, Upulee
    Bieman, James M.
    Ben-Hur, Asa
    SOFTWARE TESTING VERIFICATION & RELIABILITY, 2016, 26 (03): : 245 - 269
  • [39] Graph-based Compact Modeling (GCM) of CMOS transistors for efficient parameter extraction: A machine learning approach
    Gaidhane, Amol D.
    Yang, Ziyao
    Cao, Yu
    SOLID-STATE ELECTRONICS, 2023, 201
  • [40] An efficient machine learning approach for predicting concrete chloride resistance using a comprehensive dataset
    Maedeh Hosseinzadeh
    Seyed Sina Mousavi
    Alireza Hosseinzadeh
    Mehdi Dehestani
    Scientific Reports, 13