H2RDF+ : An Efficient Data Management System for Big RDF Graphs

被引:24
|
作者
Papailiou, Nikolaos [1 ]
Tsoumakos, Dimitrios [1 ]
Konstantinou, Ioannis [1 ]
Karras, Panagiotis [2 ]
Koziris, Nectarios [1 ]
机构
[1] Natl Tech Univ Athens, Sch ECE, Comp Syst Lab, Athens, Greece
[2] Rutgers State Univ, Management Sci & Informat Syst, Piscataway, NJ USA
关键词
RDF; SparQL; Hadoop; MapReduce; HBase; NoSQL; Joins;
D O I
10.1145/2588555.2594535
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The proliferation of data in RDF format has resulted in the emergence of a plethora of specialized management systems. While the ability to adapt to the complexity of a SPARQL query {given their inherent diversity {is crucial, current approaches do not scale well when faced with substantially complex, non-selective joins, resulting in exponential growth of execution times. In this demonstration we present H2RDF+, an RDF store that efficiently performs distributed Merge and Sort-Merge joins using a multiple-index scheme over HBase indexes. Through a greedy planner that incorporates our cost-model, it adaptively commands for either single or multi-machine query execution based on join complexity. In this paper, we present its key scientific contributions and allow participants to interact with an H2RDF+ deployment over a Cloud infrastructure. Using a web-based GUI we allow users to load different datasets (both real and synthetic), apply any query (custom or predefined) and monitor its execution. By allowing real-time inspection of cluster status, response times and committed resources the audience will evaluate the validity of H2RDF+'s claims and perform direct comparisons to two other state-of-the-art RDF stores.
引用
收藏
页码:909 / 912
页数:4
相关论文
共 50 条
  • [1] H2RDF+: High-performance Distributed Joins over Large-scale RDF Graphs
    Papailiou, Nikolaos
    Konstantinou, Ioannis
    Tsoumakos, Dimitrios
    Karras, Panagiotis
    Koziris, Nectarios
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [2] Efficient indexing RDF query algorithm for big data
    Zeng, Yiqun
    Wang, Jingbin
    [J]. MACHINERY ELECTRONICS AND CONTROL ENGINEERING III, 2014, 441 : 691 - 694
  • [3] Big RDF Data Cleaning
    Tang, Nan
    [J]. 2015 13TH IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2015, : 77 - 79
  • [4] SRX: efficient management of spatial RDF data
    Konstantinos Theocharidis
    John Liagouris
    Nikos Mamoulis
    Panagiotis Bouros
    Manolis Terrovitis
    [J]. The VLDB Journal, 2019, 28 : 703 - 733
  • [5] SRX: efficient management of spatial RDF data
    Theocharidis, Konstantinos
    Liagouris, John
    Mamoulis, Nikos
    Bouros, Panagiotis
    Terrovitis, Manolis
    [J]. VLDB JOURNAL, 2019, 28 (05): : 703 - 733
  • [6] Presto-RDF: SPARQL Querying over Big RDF Data
    Mammo, Mulugeta
    Bansal, Srividya K.
    [J]. DATABASES THEORY AND APPLICATIONS, 2015, 9093 : 281 - 293
  • [7] Efficient RDF Interchange (ERI) Format for RDF Data Streams
    Fernandez, Javier D.
    Llaves, Alejandro
    Corcho, Oscar
    [J]. SEMANTIC WEB - ISWC 2014, PT II, 2014, 8797 : 244 - 259
  • [8] Transforming RDF Data into Property Graphs
    Angles, Renzo
    Garcia, Roberto
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2020, 18 (01) : 130 - 137
  • [9] An RDF Data Management System for Conflict Casualties
    Fatah, Yad
    Nourallah, Mark
    Wahab, Lynn
    Abu Salem, Fatima K.
    Elbassuoni, Shady
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4711 - 4715
  • [10] DiploCloud: Efficient and Scalable Management of RDF Data in the Cloud
    Wylot, Marcin
    Cudre-Mauroux, Philippe
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (03) : 659 - 674