Fast execution of RDF queries using Apache Hadoop

被引:0
|
作者
Mazumdar, Somnath [1 ]
Scionti, Alberto [2 ]
机构
[1] Univ Siena, Dept Informat Engn & Math, Siena, Italy
[2] Ist Super Mario Boella ISMB, Turin, Italy
来源
关键词
SPARQL; ENGINE;
D O I
10.1016/bs.adcom.2020.03.001
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Map-Reduce (MR) is a distributed programming framework which became very popular since its introduction, due to its ability to process massive data sets. MR provides a robust and straightforward mechanism to implement distributed applications without worrying much about manymanagement aspects of parallel programming (e.g., instantiating jobs, data distribution, job synchronization). On the other hand, the Resource Description Framework (RDF) with its simplicity and flexibility, can represent semistructured and unstructured data which are very important for representing web-semantics. SPARQL is a query language aimed at retrieving and manipulating data stored in RDF format and also supports "Big Data" applications. In this book chapter, we present a framework designed to evaluate complex SPARQL queries fast. To improve the execution of SPARQL queries, we implemented the query engine on the Hadoop framework. The engine can handle large and complex queries involving multiple join variables while running on large RDF data sets. Further execution speedup is obtained by preprocessing the input datawith parallel Bloomfilters. The query engine has been tested on the SP2 benchmark, and the results demonstrate the benefits of the design. In this case, the minimum query improvement is 5% while the maximum improvement has been achieved is 82%.
引用
收藏
页码:1 / 33
页数:33
相关论文
共 50 条
  • [21] Fast and Concurrent RDF Queries with RDMA-based Distributed Graph Exploration
    Shi, Jiaxin
    Yao, Youyang
    Chen, Rong
    Chen, Haibo
    Li, Feifei
    [J]. PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, 2016, : 317 - 332
  • [22] Content Based Audiobooks Indexing using Apache Hadoop Framework
    Shetty, Sonal
    Sabarad, Akash
    Hebballi, Harish
    Husain, Moula
    Meena, S. M.
    Nagaralli, Shiddu
    [J]. PROCEEDING OF THE THIRD INTERNATIONAL SYMPOSIUM ON WOMEN IN COMPUTING AND INFORMATICS (WCI-2015), 2015, : 496 - 501
  • [23] RDF aggregate queries and views
    Hung, E
    Deng, Y
    Subrahmanian, VS
    [J]. ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 717 - 728
  • [24] Processing of Big Educational Data in the Cloud Using Apache Hadoop
    Machova, Renata
    Komarkova, Jitka
    Lnenicka, Martin
    [J]. INTERNATIONAL CONFERENCE ON INFORMATION SOCIETY (I-SOCIETY 2016), 2016, : 46 - 49
  • [25] Color and Texture Feature Extraction using Apache Hadoop Framework
    Sabarad, Akash K.
    Kankudti, Mohamed Humair
    Meena, S. M.
    Husain, Moula
    [J]. 1ST INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION ICCUBEA 2015, 2015, : 585 - 588
  • [26] Tidy Tuples and Flying Start: fast compilation and fast execution of relational queries in Umbra
    Timo Kersten
    Viktor Leis
    Thomas Neumann
    [J]. The VLDB Journal, 2021, 30 : 883 - 905
  • [27] Tidy Tuples and Flying Start: fast compilation and fast execution of relational queries in Umbra
    Kersten, Timo
    Leis, Viktor
    Neumann, Thomas
    [J]. VLDB JOURNAL, 2021, 30 (05): : 883 - 905
  • [28] Disambiguating Keyword Queries on RDF Databases Using "Deep" Segmentation
    Fu, Haizhou
    Gao, Sidan
    Anyanwu, Kemafor
    [J]. 2010 IEEE FOURTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2010), 2010, : 236 - 243
  • [29] Clustering Remote RDF Data Using SPARQL Update Queries
    Qi, Letao
    Lin, Harris T.
    Honavar, Vasant
    [J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2013, : 236 - 242
  • [30] Optimizing Aggregate SPARQL Queries Using Materialized RDF Views
    Ibragimov, Dilshod
    Hose, Katja
    Pedersen, Torben Bach
    Zimanyi, Esteban
    [J]. SEMANTIC WEB - ISWC 2016, PT I, 2016, 9981 : 341 - 359