Efficient Parallel Processing of Analytical Queries on Linked Data

被引:0
|
作者
Hagedorn, Stefan [1 ]
Sattler, Kai-Uwe [1 ]
机构
[1] Ilmenau Univ Technol, Ilmenau, Germany
关键词
linked data; parallel query processing; micro benchmark; SEMANTIC WEB; JOIN;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Linked data has become one of the most successful movements of the Semantic Web community. RDF and SPARQL have been established as de-facto standards for representing and querying linked data and there exists quite a number of RDF stores and SPARQL engines that can be used to work with the data. However, for many types of queries on linked data these stores are not the best choice regarding query execution times. For example, users are interested in analytical tasks such as profiling or finding correlated entities in their datasets. In this paper we argue that currently available RDF stores are not optimal for such scan-intensive tasks. In order to address this issue, we discuss query evaluation techniques for linked data exploiting the features of modern hardware architectures such as big memory and multi-core processors. Particularly, we describe parallelization techniques as part of our CameLOD system. Furthermore, we compare our system with the well-known linked data stores Virtuoso and RDF-3X by running different analytical queries on the DBpedia dataset and show that we can outperform these systems significantly.
引用
收藏
页码:452 / 469
页数:18
相关论文
共 50 条
  • [31] Processing Analytical Queries over Polystore System for a Large Astronomy Data Repository
    Poudel, Manoj
    Sarode, Rashmi P.
    Watanobe, Yutaka
    Mozgovoy, Maxim
    Bhalla, Subhash
    APPLIED SCIENCES-BASEL, 2022, 12 (05):
  • [32] HPPQ: A Parallel Package Queries Processing Approach for Large-Scale Data
    Meihui Shi
    Derong Shen
    Tiezheng Nie
    Yue Kou
    Ge Yu
    Big Data Mining and Analytics, 2018, (02) : 146 - 159
  • [33] An efficient approach for big data processing using spatial Boolean queries
    Dadheech, Pankaj
    Goyal, Dinesh
    Srivastava, Sumit
    Choudhary, C. M.
    JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS, 2018, 21 (04): : 583 - 591
  • [34] An Efficient Filtering Method for Processing Continuous Skyline Queries on Sensor Data
    Jang, Su Min
    Park, Choon Seo
    Seo, Dong Min
    Yo, Jae Soo
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2010, E93B (08) : 2180 - 2183
  • [35] Efficient processing of multiple continuous skyline queries over a data stream
    Lee, Yu Won
    Lee, Ki Yong
    Kim, Myoung Ho
    INFORMATION SCIENCES, 2013, 221 : 316 - 337
  • [36] An efficient processing of queries with joins and aggregate functions in data warehousing environment
    Kim, JH
    Kim, YH
    Kim, SW
    Ok, SH
    13TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2002, : 785 - 791
  • [37] DC Proposal: Online Analytical Processing of Statistical Linked Data
    Kaempgen, Benedikt
    SEMANTIC WEB - ISWC 2011, PT II, 2011, 7032 : 301 - 308
  • [38] Parallel dataflow method for optimizing and processing queries on parallel databases
    Li, Jianzhong
    Ruan Jian Xue Bao/Journal of Software, 1998, 9 (03): : 174 - 180
  • [39] Efficient optical reservoir computing for parallel data processing
    Bu, Ting
    Zhang, He
    Kumar, Santosh
    Jin, Mingwei
    Kumar, Prajnesh
    Huang, Yuping
    OPTICS LETTERS, 2022, 47 (15) : 3784 - 3787
  • [40] Parallel Memory-Efficient Processing of BCI Data
    Alexander, Trevor
    Kuh, Anthony
    Hamada, Katsuhiko
    Mori, Hiromu
    Shinoda, Hiroyuki
    Rutkowski, Tomasz
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,