Diversified Stress Testing of RDF Data Management Systems

被引:0
|
作者
Aluc, Gunes [1 ]
Hartig, Olaf [1 ]
Ozsu, M. Tamer [1 ]
Daudjee, Khuzaima [1 ]
机构
[1] David R Cheriton Sch Comp Sci, Waterloo, ON, Canada
来源
关键词
RDF; SPARQL; systems; benchmarking; workload diversity; STORE; WEB;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Resource Description Framework (RDF) is a standard for conceptually describing data on the Web, and SPARQL is the query language for RDF. As RDF data continue to be published across heterogeneous domains and integrated at Web-scale such as in the Linked Open Data (LOD) cloud, RDF data management systems are being exposed to queries that are far more diverse and workloads that are far more varied. The first contribution of our work is an in-depth experimental analysis that shows existing SPARQL benchmarks are not suitable for testing systems for diverse queries and varied workloads. To address these shortcomings, our second contribution is the Waterloo SPARQL Diversity Test Suite (WatDiv) that provides stress testing tools for RDF data management systems. Using WatDiv, we have been able to reveal issues with existing systems that went unnoticed in evaluations using earlier benchmarks. Specifically, our experiments with five popular RDF data management systems show that they cannot deliver good performance uniformly across workloads. For some queries, there can be as much as five orders of magnitude difference between the query execution time of the fastest and the slowest system while the fastest system on one query may unexpectedly time out on another query. By performing a detailed analysis, we pinpoint these problems to specific types of queries and workloads.
引用
收藏
页码:197 / 212
页数:16
相关论文
共 50 条
  • [1] A survey of RDF data management systems
    Ozsu, M. Tamer
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2016, 10 (03) : 418 - 432
  • [2] A survey of RDF data management systems
    M. Tamer Özsu
    [J]. Frontiers of Computer Science, 2016, 10 : 418 - 432
  • [3] A survey of RDF data management systems
    M.Tamer ZSU
    [J]. Frontiers of Computer Science, 2016, 10 (03) : 418 - 432
  • [4] Diversified spatial keyword search on RDF data
    Cai, Zhi
    Kalamatianos, Georgios
    Fakas, Georgios J.
    Mamoulis, Nikos
    Papadias, Dimitris
    [J]. VLDB JOURNAL, 2020, 29 (05): : 1171 - 1189
  • [5] Diversified spatial keyword search on RDF data
    Zhi Cai
    Georgios Kalamatianos
    Georgios J. Fakas
    Nikos Mamoulis
    Dimitris Papadias
    [J]. The VLDB Journal, 2020, 29 : 1171 - 1189
  • [6] RDF for temporal data management - a survey
    Zhang, Fu
    Li, Zhiyin
    Peng, Dunhong
    Cheng, Jingwei
    [J]. EARTH SCIENCE INFORMATICS, 2021, 14 (02) : 563 - 599
  • [7] RDF for temporal data management – a survey
    Fu Zhang
    Zhiyin Li
    Dunhong Peng
    Jingwei Cheng
    [J]. Earth Science Informatics, 2021, 14 : 563 - 599
  • [8] A Survey of Distributed RDF Data Management
    Zou L.
    Peng P.
    [J]. 2017, Science Press (54): : 1213 - 1224
  • [9] Completeness Management for RDF Data Sources
    Darari, Fariz
    Nutt, Werner
    Pirro, Giuseppe
    Razniewski, Simon
    [J]. ACM TRANSACTIONS ON THE WEB, 2018, 12 (03)
  • [10] Research Issues in RDF Management Systems
    Chawla, Tanvi
    Singh, Girdhari
    Pilli, Emmanuel S.
    Govil, M. C.
    [J]. 2016 INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN COMMUNICATION TECHNOLOGIES (ETCT), 2016,