Efficient In-Memory Evaluation of Reachability Graph Pattern Queries on Data Graphs

被引:0
|
作者
Wu, Xiaoying [1 ]
Theodoratos, Dimitri [2 ]
Skoutas, Dimitrios [3 ]
Lan, Michael [2 ]
机构
[1] Wuhan Univ, Wuhan, Peoples R China
[2] New Jersey Inst Technol, Newark, NJ 07102 USA
[3] RC Athena, Athens, Greece
基金
中国国家自然科学基金;
关键词
Graph pattern matching; Edge-to-path homomorphism; Multi-way join;
D O I
10.1007/978-3-031-00123-9_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graphs are a widely used data model in modern data-intensive applications. Graph pattern matching is a fundamental operation for the exploration and analysis of large data graphs. In this paper, we present a novel approach for efficiently finding homomorphic matches of graph pattern queries, where pattern edges denote reachability relationships between nodes in the data graph. We first propose the concept of query reachability graph to compactly encode all the possible homomorphisms from a query pattern to the data graph. Then, we design a graph traversal-based filtering method to prune nodes from the data graph which violate reachability conditions induced by the pattern edges. We use the pruned data graph to generate a refined query reachability graph which serves as a compact search space for the pattern query answer. Finally, we design a multiway join algorithm to enumerate answer tuples from the query reachability graph without generating an excessive number of redundant intermediate results (a drawback of previous approaches). We experimentally verify the efficiency of our approach and demonstrate that it outperforms by far existing approaches and a recent graph DBMS on evaluating reachability graph pattern queries.
引用
收藏
页码:55 / 71
页数:17
相关论文
共 50 条
  • [21] SilverChunk: An Efficient In-Memory Parallel Graph Processing System
    Zheng, Tianqi
    Zhang, Zhibin
    Cheng, Xueqi
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT II, 2019, 11707 : 222 - 236
  • [22] Human-Powered Data Cleaning for Probabilistic Reachability Queries on Uncertain Graphs
    Lin, Xin
    Peng, Yun
    Xu, Jianliang
    Choi, Byron
    [J]. 2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1755 - 1756
  • [23] Human-Powered Data Cleaning for Probabilistic Reachability Queries on Uncertain Graphs
    Lin, Xin
    Peng, Yun
    Choi, Byron
    Xu, Jianliang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (07) : 1452 - 1465
  • [24] SparkNN: A distributed in-memory data partitioning for KNN queries on big spatial data
    Al Aghbari, Zaher
    Ismail, Tasneem
    Kamel, Ibrahim
    [J]. Data Science Journal, 2020, 19 (01) : 1 - 14
  • [25] DLCR : Efficient Indexing for Label-Constrained Reachability Queries on Large Dynamic Graphs
    Chen, Xin
    Peng, You
    Wang, Sibo
    Yu, Jeffrey Xu
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (08): : 1645 - 1657
  • [26] Top-k Durable Graph Pattern Queries on Temporal Graphs
    Semertzidis, Konstantinos
    Pitoura, Evaggelia
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (01) : 181 - 194
  • [27] Efficient reachability queries in multi-relation graph: An index-based approach
    Liu, Xijuan
    Zhang, Mengqi
    Fu, Xianming
    Chen, Chen
    Wang, Xiaoyang
    Wu, Yanping
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2021, 96
  • [28] In-Memory Graph Databases for Web-Scale Data
    Castellana, Vito Giovanni
    Morari, Alessandro
    Weaver, Jesse
    Tumeo, Antonino
    Haglin, David
    Villa, Oreste
    Feo, John
    [J]. COMPUTER, 2015, 48 (03) : 24 - 35
  • [29] Employing In-Memory Data Grids for Distributed Graph Processing
    Tasci, Serafettin
    Demirbas, Murat
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1856 - 1864
  • [30] A Compact In-memory Index for Managing Set Membership Queries on Streaming Data
    Wang, Yong
    Yun, Xiaochun
    Wang, Shupeng
    Wang, Xi
    [J]. BIG DATA COMPUTING AND COMMUNICATIONS, (BIGCOM 2016), 2016, 9784 : 88 - 98