Processing SPARQL queries with regular expressions in RDF databases

被引:4
|
作者
Lee, Jinsoo [1 ]
Pham, Minh-Duc [1 ]
Lee, Jihwan [1 ]
Han, Wook-Shin [1 ]
Cho, Hune [2 ]
Yu, Hwanjo [3 ]
Lee, Jeong-Hoon [4 ]
机构
[1] Kyungpook Natl Univ, Dept Comp Engn, Taegu, South Korea
[2] Kyungpook Natl Univ, Dept Med Informat, Taegu, South Korea
[3] POSTECH, Dept Comp Sci & Engn, Pohang, South Korea
[4] Korea Adv Inst Sci & Technol, Dept Comp Sci, Taejon 305701, South Korea
来源
BMC BIOINFORMATICS | 2011年 / 12卷
关键词
Query Processing; Resource Description Framework; Regular Expression; Parse Tree; Execution Plan;
D O I
10.1186/1471-2105-12-S2-S6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: As the Resource Description Framework (RDF) data model is widely used for modeling and sharing a lot of online bioinformatics resources such as Uniprot (dev.isb-sib.ch/projects/uniprot-rdf) or Bio2RDF (bio2rdf.org), SPARQL - a W3C recommendation query for RDF databases - has become an important query language for querying the bioinformatics knowledge bases. Moreover, due to the diversity of users' requests for extracting information from the RDF data as well as the lack of users' knowledge about the exact value of each fact in the RDF databases, it is desirable to use the SPARQL query with regular expression patterns for querying the RDF data. To the best of our knowledge, there is currently no work that efficiently supports regular expression processing in SPARQL over RDF databases. Most of the existing techniques for processing regular expressions are designed for querying a text corpus, or only for supporting the matching over the paths in an RDF graph. Results: In this paper, we propose a novel framework for supporting regular expression processing in SPARQL query. Our contributions can be summarized as follows. 1) We propose an efficient framework for processing SPARQL queries with regular expression patterns in RDF databases. 2) We propose a cost model in order to adapt the proposed framework in the existing query optimizers. 3) We build a prototype for the proposed framework in C++ and conduct extensive experiments demonstrating the efficiency and effectiveness of our technique. Conclusions: Experiments with a full-blown RDF engine show that our framework outperforms the existing ones by up to two orders of magnitude in processing SPARQL queries with regular expression patterns.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] The Complexity of Regular Expressions and Property Paths in SPARQL
    Losemann, Katja
    Martens, Wim
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2013, 38 (04):
  • [32] CORNER: A Completeness Reasoner for SPARQL Queries Over RDF Data Sources
    Darari, Fariz
    Prasojo, Radityo Eko
    Nutt, Werner
    [J]. SEMANTIC WEB: ESWC 2014 SATELLITE EVENTS, 2014, 8798 : 310 - 314
  • [33] Intuitive Ontology-Based SPARQL Queries for RDF Data Exploration
    Rodriguez Diaz, Alejandro
    Benito-Santos, Alejandro
    Dorn, Amelie
    Abgaz, Yalemisew
    Wandl-Vogt, Eveline
    Theron, Roberto
    [J]. IEEE ACCESS, 2019, 7 : 156272 - 156286
  • [34] Rewriting of regular expressions and regular path queries
    Calvanese, D
    De Giacomo, G
    Lenzerini, M
    Vardi, MY
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2002, 64 (03) : 443 - 465
  • [35] Query Processing for RDF Databases
    Kaoudi, Zoi
    Kementsietsidis, Anastasios
    [J]. REASONING WEB: REASONING ON THE WEB IN THE BIG DATA ERA, 2014, 8714 : 141 - +
  • [36] Federated SPARQL Queries Processing with Replicated Fragments
    Montoya, Gabriela
    Skaf-Molli, Hala
    Molli, Pascal
    Vidal, Maria-Esther
    [J]. SEMANTIC WEB - ISWC 2015, PT I, 2015, 9366 : 36 - 51
  • [37] Efficient Processing of SPARQL Queries Over GraphFrames
    Bahrami, Ramazan Ali
    Gulati, Jayati
    Abulaish, Muhammad
    [J]. 2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017), 2017, : 678 - 685
  • [38] Intermediate results processing for aggregated SPARQL queries
    Rabhi, Ahmed
    Fissoune, Rachida
    Tabaa, Mohamed
    Badir, Hassan
    [J]. 2021 IEEE/ACS 18TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2021,
  • [39] Processing Aggregate Queries in a Federation of SPARQL Endpoints
    Ibragimov, Dilshod
    Hose, Katja
    Pedersen, Torben Bach
    Zimanyi, Esteban
    [J]. SEMANTIC WEB: LATEST ADVANCES AND NEW DOMAINS, ESWC 2015, 2015, 9088 : 269 - 285
  • [40] Processing SPARQL Aggregate Queries with Web Preemption
    Grall, Arnaud
    Minier, Thomas
    Skaf-Molli, Hala
    Molli, Pascal
    [J]. SEMANTIC WEB (ESWC 2020), 2020, 12123 : 235 - 251