Efficient Processing of SPARQL Queries Over GraphFrames

被引:2
|
作者
Bahrami, Ramazan Ali [1 ]
Gulati, Jayati [1 ]
Abulaish, Muhammad [1 ]
机构
[1] South Asian Univ, Dept Comp Sci, Delhi, India
关键词
Graph mining; Linked data mining; SPARQL query processing; GraphFrames; GraphX;
D O I
10.1145/3106426.3106534
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the advent of huge data management systems storing voluminous data, there arises a need to develop efficient data analytics techniques for knowledge discovery at different levels of granularity. Resource Description Framework (RDF), mainly developed for Semantic Web, is presumably a good option when considering graph databases dealing with huge real-world data. RDF models information in the form of triples <subject, predicate, object>, and is considered as a useful tool to store graph data (aka linked data) where each edge can be stored as a triple. Due to existence of huge amount of linked data, mostly in the form of graphs, graph mining has been successful in attracting researchers from different research fields for efficient handling (storage, indexing, retrieval, etc.) of graph data. As a result, various APIs like GraphX and GraphFrames are developed to facilitate relational queries over graph data. Though GraphX is older than GraphFrames and processing SPARQL queries over GraphX has been explored by some researchers, to the best of our knowledge, SPARQL query processing over GraphFrames has not been explored yet. In this paper, we present an initial study on query-specific search space pruning and query optimization approach to process SPARQL queries over GraphFrames in an efficient manner. The experimental results, in terms of low response time for query execution, are encouraging, and give way to invest more research efforts in this direction.
引用
收藏
页码:678 / 685
页数:8
相关论文
共 50 条
  • [21] Sparklify: A Scalable Software Component for Efficient Evaluation of SPARQL Queries over Distributed RDF Datasets
    Stadler, Claus
    Sejdiu, Gezim
    Graux, Damien
    Lehmann, Jens
    [J]. SEMANTIC WEB - ISWC 2019, PT II, 2019, 11779 : 293 - 308
  • [22] Efficient Processing of Queries over Recursive XML Data
    Alghamdi, Norah Saleh
    Rahayu, Wenny
    Pardede, Eric
    [J]. 2015 IEEE 29th International Conference on Advanced Information Networking and Applications (IEEE AINA 2015), 2015, : 134 - 142
  • [23] Processing SPARQL Property Path Queries Online with Web Preemption
    Aimonier-Davat, Julien
    Skaf-Molli, Hala
    Molli, Pascal
    [J]. SEMANTIC WEB, ESWC 2021, 2021, 12731 : 57 - 72
  • [24] A parallel processing architecture to optimize runtime in aggregated SPARQL queries
    Rabhi, Ahmed
    Fissoune, Rachida
    Tabaa, Mohamed
    Badir, Hassan
    [J]. PROCEEDINGS OF 2022 14TH INTERNATIONAL CONFERENCE ON MANAGEMENT OF DIGITAL ECOSYSTEMS, MEDES 2022, 2022, : 9 - 15
  • [25] EMBEDDING XPATH QUERIES INTO SPARQL QUERIES
    Droop, Matthias
    Flarer, Markus
    Groppe, Jinghua
    Groppe, Sven
    Linnemann, Volker
    Pinggera, Jakob
    Santner, Florian
    Schier, Michael
    Schoepf, Felix
    Staffler, Hannes
    Zugal, Stefan
    [J]. ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL DISI: DATABASES AND INFORMATION SYSTEMS INTEGRATION, 2008, : 5 - +
  • [26] Translating XPath queries into SPARQL queries
    Droop, M.
    Flarer, M.
    Groppe, J.
    Groppe, S.
    Linnemann, V.
    Pinggeral, J.
    Santner, F.
    Schier, M.
    Schoepf, F.
    Staffler, H.
    Zugal, S.
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2007: OTM 2007 WORKSHOPS, PT 1, PROCEEDINGS, 2007, 4805 : 9 - +
  • [27] Lower and Upper Bounds for SPARQL Queries over OWL Ontologies
    Glimm, Birte
    Kazakov, Yevgeny
    Kollia, Ilianna
    Stamou, Giorgos
    [J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 109 - 115
  • [28] ARDBS: Efficient Processing of Provenance Queries Over Annotated Relations
    Mohammadi, Sareh
    Shiri, Nematollaah
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022, PT II, 2022, 13427 : 263 - 269
  • [29] Efficient Processing of Range Queries over Distributed Relational Databases
    Price, Richard
    Ramaswamy, Lakshmish
    Pouriyeh, Seyedamin
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2018, : 331 - 337
  • [30] Efficient Processing of Skyline Group Queries over a Data Stream
    Guo, Xi
    Li, Hailing
    Wulamu, Aziguli
    Xie, Yonghong
    Fu, Yajing
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2016, 21 (01) : 29 - 39