Processing SPARQL queries with regular expressions in RDF databases

被引:4
|
作者
Lee, Jinsoo [1 ]
Pham, Minh-Duc [1 ]
Lee, Jihwan [1 ]
Han, Wook-Shin [1 ]
Cho, Hune [2 ]
Yu, Hwanjo [3 ]
Lee, Jeong-Hoon [4 ]
机构
[1] Kyungpook Natl Univ, Dept Comp Engn, Taegu, South Korea
[2] Kyungpook Natl Univ, Dept Med Informat, Taegu, South Korea
[3] POSTECH, Dept Comp Sci & Engn, Pohang, South Korea
[4] Korea Adv Inst Sci & Technol, Dept Comp Sci, Taejon 305701, South Korea
来源
BMC BIOINFORMATICS | 2011年 / 12卷
关键词
Query Processing; Resource Description Framework; Regular Expression; Parse Tree; Execution Plan;
D O I
10.1186/1471-2105-12-S2-S6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: As the Resource Description Framework (RDF) data model is widely used for modeling and sharing a lot of online bioinformatics resources such as Uniprot (dev.isb-sib.ch/projects/uniprot-rdf) or Bio2RDF (bio2rdf.org), SPARQL - a W3C recommendation query for RDF databases - has become an important query language for querying the bioinformatics knowledge bases. Moreover, due to the diversity of users' requests for extracting information from the RDF data as well as the lack of users' knowledge about the exact value of each fact in the RDF databases, it is desirable to use the SPARQL query with regular expression patterns for querying the RDF data. To the best of our knowledge, there is currently no work that efficiently supports regular expression processing in SPARQL over RDF databases. Most of the existing techniques for processing regular expressions are designed for querying a text corpus, or only for supporting the matching over the paths in an RDF graph. Results: In this paper, we propose a novel framework for supporting regular expression processing in SPARQL query. Our contributions can be summarized as follows. 1) We propose an efficient framework for processing SPARQL queries with regular expression patterns in RDF databases. 2) We propose a cost model in order to adapt the proposed framework in the existing query optimizers. 3) We build a prototype for the proposed framework in C++ and conduct extensive experiments demonstrating the efficiency and effectiveness of our technique. Conclusions: Experiments with a full-blown RDF engine show that our framework outperforms the existing ones by up to two orders of magnitude in processing SPARQL queries with regular expression patterns.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Processing SPARQL queries with regular expressions in RDF databases
    Jinsoo Lee
    Minh-Duc Pham
    Jihwan Lee
    Wook-Shin Han
    Hune Cho
    Hwanjo Yu
    Jeong-Hoon Lee
    [J]. BMC Bioinformatics, 12
  • [2] Fast Processing SPARQL Queries on Large RDF Data
    Yang, Guang
    Yuan, Pingpeng
    Jin, Hai
    [J]. 2016 IEEE 14TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 14TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 2ND INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/DATACOM/CYBERSC, 2016, : 921 - 926
  • [3] RIQ: Fast processing of SPARQL queries on RDF quadruples
    Katib, Anas
    Slavov, Vasil
    Rao, Praveen
    [J]. JOURNAL OF WEB SEMANTICS, 2016, 37-38 : 90 - 111
  • [4] Processing SPARQL queries over distributed RDF graphs
    Peng Peng
    Lei Zou
    M. Tamer Özsu
    Lei Chen
    Dongyan Zhao
    [J]. The VLDB Journal, 2016, 25 : 243 - 268
  • [5] Processing SPARQL queries over distributed RDF graphs
    Peng, Peng
    Zou, Lei
    Ozsu, M. Tamer
    Chen, Lei
    Zhao, Dongyan
    [J]. VLDB JOURNAL, 2016, 25 (02): : 243 - 268
  • [6] SPANG: a SPARQL client supporting generation and reuse of queries for distributed RDF databases
    Chiba, Hirokazu
    Uchiyama, Ikuo
    [J]. BMC BIOINFORMATICS, 2017, 18
  • [7] SPANG: a SPARQL client supporting generation and reuse of queries for distributed RDF databases
    Hirokazu Chiba
    Ikuo Uchiyama
    [J]. BMC Bioinformatics, 18
  • [8] Distributed processing of regular path queries in RDF graphs
    Guo, Xintong
    Gao, Hong
    Zou, Zhaonian
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (04) : 993 - 1027
  • [9] Distributed processing of regular path queries in RDF graphs
    Xintong Guo
    Hong Gao
    Zhaonian Zou
    [J]. Knowledge and Information Systems, 2021, 63 : 993 - 1027
  • [10] Evaluating SPARQL Queries on Massive RDF Datasets
    Harbi, Razen
    Abdelaziz, Ibrahim
    Kalnis, Panos
    Mamoulis, Nikos
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (12): : 1848 - 1851