Evaluating regular path queries on compressed adjacency matrices

被引:0
|
作者
Arroyuelo, Diego [1 ,2 ]
Gomez-Brandon, Adrian [3 ,4 ]
Navarro, Gonzalo [5 ,6 ]
机构
[1] Pontificia Univ Catolica Chile, Escuela Ingn, DCC, Santiago, Chile
[2] Pontificia Univ Catolica Chile, Escuela Ingn, DCC, Santiago, Chile
[3] Univ A Coruna, ECOBAS, La Coruna, Spain
[4] Univ A Coruna, CITIC, La Coruna, Spain
[5] Univ Chile, IMFD, Santiago, Chile
[6] Univ Chile, DCC, Santiago, Chile
来源
VLDB JOURNAL | 2025年 / 34卷 / 01期
关键词
Regular path queries on graph databases; Compact data structures for adjacency matrices; Sparse matrices; Sparse Boolean matrices; EFFICIENT TRANSITIVE CLOSURE; ALGORITHM; MULTIPLICATION; ALGEBRA; WEB;
D O I
10.1007/s00778-024-00885-6
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Regular Path Queries (RPQs), which are essentially regular expressions to be matched against the labels of paths in labeled graphs, are at the core of graph database query languages like SPARQL and GQL. A way to solve RPQs is to translate them into a sequence of operations on the adjacency matrices of each label. We design and implement a Boolean algebra on sparse matrix representations and, as an application, use them to handle RPQs. Our baseline representation uses the same space and time as the previously most compact index for RPQs, outperforming it on the hardest types of queries-those where both RPQ endpoints are unspecified. Our more succinct structure, based on k2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k<^>2$$\end{document}-trees, is 4 times smaller than any existing representation that handles RPQs. While slower, it still solves complex RPQs in a few seconds and slightly outperforms the smallest previous structure on the hardest RPQs. Our new sparse-matrix-based solutions dominate a good portion of the space/time tradeoff map, being outperformed only by representations that use much more space. They also implement an algebra of Boolean matrices that is of independent interest beyond solving RPQs.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] Evaluating Regular Path Queries on Compressed Adjacency Matrices
    Arroyuelo, Diego
    Gomez-Brandon, Adrian
    Navarro, Gonzalo
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2023, 2023, 14240 : 35 - 48
  • [2] Compressed Graph Representations for Evaluating Regular Path Queries
    Navarro, Gonzalo
    Robert, Josefa
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2024, 2025, 14899 : 218 - 232
  • [3] Dichotomies for Evaluating Simple Regular Path Queries
    Martens, Wim
    Trautner, Tina
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2019, 44 (04):
  • [4] On the Complexity of Evaluating Regular Path Queries over Linear Existential Rules
    Bienvenu, Meghyn
    Thomazo, Michael
    WEB REASONING AND RULE SYSTEMS, (RR 2016), 2016, 9898 : 1 - 17
  • [5] Parametric regular path queries
    Liu, YHA
    Rothamel, T
    Yu, FX
    Stoller, SD
    Hu, NJ
    ACM SIGPLAN NOTICES, 2004, 39 (06) : 219 - 230
  • [6] Temporal Regular Path Queries
    Arenas, Marcelo
    Bahamondes, Pedro
    Aghasadeghi, Amir
    Stoyanovich, Julia
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2412 - 2425
  • [7] Preferential Regular Path Queries
    Grahne, Goesta
    Thomo, Alex
    Wadge, William W.
    FUNDAMENTA INFORMATICAE, 2008, 89 (2-3) : 259 - 288
  • [8] Regular path queries with constraints
    Abiteboul, S
    Vianu, V
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1999, 58 (03) : 428 - 452
  • [9] Solving regular path queries
    Liu, YA
    Yu, FX
    MATHEMATICS OF PROGRAM CONSTRUCTION, 2002, 2386 : 195 - 208
  • [10] Reasoning on regular path queries
    Calvanese, D
    De Giacomo, G
    Lenzerini, M
    Vardi, MY
    SIGMOD RECORD, 2003, 32 (04) : 83 - 92