High-performance, Distributed Dictionary Encoding of RDF Datasets

被引:1
|
作者
Morari, Alessandro [1 ]
Weaver, Jesse [1 ]
Villa, Oreste [2 ]
Haglin, David [1 ]
Tumeo, Antonino [1 ]
Castellana, Vito Giovanni [1 ]
Feo, John [1 ]
机构
[1] Pacific NW Natl Lab, Richland, WA 99354 USA
[2] NVIDIA Res, Santa Clara, CA 95051 USA
关键词
D O I
10.1109/CLUSTER.2015.44
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this work we propose a novel approach for RDF (Resource Description Framework) dictionary encoding that employs a parallel RDF parser and a distributed dictionary data structure, exploiting RDF-specific optimizations. In contrast with previous solutions, this approach exploits the Partitioned Global Address Space (PGAS) programming model combined with active messages. We evaluate the performance of our dictionary encoder in our RDF database, GEMS (Graph Engine for Multithreaded Systems), and provide an empirical comparison against previous approaches. Our comparison shows that our dictionary encoder scales significantly better and achieves higher performance than the current state of the art, providing a key element for the realization of a more efficient RDF database.
引用
收藏
页码:250 / 253
页数:4
相关论文
共 50 条
  • [1] H2RDF+: High-performance Distributed Joins over Large-scale RDF Graphs
    Papailiou, Nikolaos
    Konstantinou, Ioannis
    Tsoumakos, Dimitrios
    Karras, Panagiotis
    Koziris, Nectarios
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [2] Towards Making Sense of Spark-SQL Performance for Processing Vast Distributed RDF Datasets
    Ragab, Mohamed
    Tommasini, Riccardo
    Eyvazov, Sadiq
    Sakr, Sherif
    [J]. PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON SEMANTIC BIG DATA (SBD 2020), 2020,
  • [3] Strabo 2: Distributed Management of Massive Geospatial RDF Datasets
    Bilidas, Dimitris
    Ioannidis, Theofilos
    Mamoulis, Nikos
    Koubarakis, Manolis
    [J]. SEMANTIC WEB - ISWC 2022, 2022, 13489 : 411 - 427
  • [4] QPPDs: Querying Property Paths Over Distributed RDF Datasets
    Mehmood, Qaiser
    Saleem, Muhammad
    Sahay, Ratnesh
    Ngomo, Axel-Cyrille Ngonga
    D'Aquin, Mathieu
    [J]. IEEE ACCESS, 2019, 7 : 101031 - 101045
  • [5] HIGH-PERFORMANCE DISTRIBUTED COMPUTING
    RAGHAVENDRA, CS
    [J]. CONCURRENCY-PRACTICE AND EXPERIENCE, 1994, 6 (04): : 231 - 233
  • [6] RDFBroker: A signature-based high-performance RDF store
    Sintek, Michael
    Kiesel, Malte
    [J]. SEMANTIC WEB: RESEARCH AND APPLICATIONS, PROCEEDINGS, 2006, 4011 : 363 - 377
  • [7] Distributed Top-K Join Queries Optimizing for RDF Datasets
    Gu, Jinguang
    Dong, Hao
    Liu, Zhao
    Xu, Fangfang
    [J]. INTERNATIONAL JOURNAL OF WEB SERVICES RESEARCH, 2017, 14 (03) : 67 - 83
  • [8] High-Performance Distributed Computing with Smartphones
    Ishikawa, Nadeem
    Nomura, Hayato
    Yoda, Yuya
    Uetsuki, Osamu
    Fukunaga, Keisuke
    Nagoya, Seiji
    Sawara, Junya
    Ishihata, Hiroaki
    Senoguchi, Junsuke
    [J]. EURO-PAR 2023: PARALLEL PROCESSING WORKSHOPS, PT II, EURO-PAR 2023, 2024, 14352 : 229 - 232
  • [9] A high-performance distributed graphics system
    Ng, CM
    [J]. HIGH-PERFORMANCE COMPUTING AND NETWORKING, 1995, 919 : 947 - 952
  • [10] Analysis of Metabolomics Datasets with High-Performance Computing and Metabolite Atlases
    Yao, Yushu
    Sun, Terence
    Wang, Tony
    Ruebel, Oliver
    Northen, Trent
    Bowen, Benjamin P.
    [J]. METABOLITES, 2015, 5 (03) : 431 - 442