Statement Hypergraph as Partitioning Model for RDF Data Processing

被引:2
|
作者
Yuan, Pingpeng [1 ]
Zhang, Wenya [1 ]
Jin, Hai [1 ]
Wu, Buwen [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Cluster & Grid Comp Lab, Serv Comp Technol & Syst Lab, Wuhan 430074, Peoples R China
关键词
ARCHITECTURE;
D O I
10.1109/APSCC.2012.73
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The scale of RDF graph grows very rapidly. Managing huge scale RDF graph distributively is becoming increasingly important. Partitioning RDF graph is a vital pre-processing step for the goal. When applying graph partitioning algorithms developed over past decades to RDF graph represented using well known RDF model such as Directed Labeled Graphs, Bipartite Graph, the vertices which a triple depends on may be in different partitions. Such partitioning on the RDF models induces huge communication overhead during processing queries. We argue in this paper that there is need for a representation of RDF to enable the parallel and distributed computing application on RDF data. We propose statement hypergraph model which avoid this crucial deficiency of the graph model of RDF data. The proposed models reduce the decomposition problem to the well-known hypergraph partitioning problem. In the light of this model, we explore the cases like horizontal partitioning, vertical partitioning, grid partitioning, etc and evaluate their performance.
引用
收藏
页码:138 / 145
页数:8
相关论文
共 50 条
  • [1] System Π: A Native RDF Repository Based on the Hypergraph Representation for RDF Data Model
    Gang Wu
    Juan-Zi Li
    Jian-Qiang Hu
    Ke-Hong Wang
    [J]. Journal of Computer Science and Technology, 2009, 24 : 652 - 664
  • [2] System Ⅱ:A Native RDF Repository Based on the Hypergraph Representation for RDF Data Model
    吴刚
    李涓子
    胡建强
    王克宏
    [J]. Journal of Computer Science & Technology, 2009, 24 (04) : 652 - 664
  • [3] System Π: A Native RDF Repository Based on the Hypergraph Representation for RDF Data Model
    Wu, Gang
    Li, Juan-Zi
    Hu, Jian-Qiang
    Wang, Ke-Hong
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2009, 24 (04): : 652 - 664
  • [4] Linked Data Partitioning for RDF Processing on Apache Spark
    Atashkar, Amir Hossein
    Ghadiri, Nasser
    Joodaki, Mehdi
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2017, : 73 - 77
  • [5] Pre-processing of RDF data for METIS partitioning
    Benhamed, Siham
    Nait-Bahloul, Safia
    [J]. International Journal of Metadata, Semantics and Ontologies, 2023, 16 (02) : 152 - 171
  • [6] Efficient and Customizable Data Partitioning Framework for Distributed Big RDF Data Processing in the Cloud
    Lee, Kisung
    Liu, Ling
    Tang, Yuzhe
    Zhang, Qi
    Zhou, Yang
    [J]. 2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 327 - 334
  • [7] Incremental Data Partitioning of RDF Data in SPARK
    Agathangelos, Giannis
    Troullinou, Georgia
    Kondylakis, Haridimos
    Stefanidis, Kostas
    Plexousakis, Dimitris
    [J]. SEMANTIC WEB: ESWC 2018 SATELLITE EVENTS, 2018, 11155 : 50 - 54
  • [8] Hypergraph Partitioning for Big Data Applications
    Yang, Wenyin
    Ma, Li
    Cui, Ruchun
    Wang, Guojun
    [J]. 2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 1705 - 1710
  • [9] RDF partitioning for scalable SPARQL query processing
    Wang, Xiaoyan
    Yang, Tao
    Chen, Jinchuan
    He, Long
    Du, Xiaoyong
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2015, 9 (06) : 919 - 933
  • [10] RDF partitioning for scalable SPARQL query processing
    Xiaoyan Wang
    Tao Yang
    Jinchuan Chen
    Long He
    Xiaoyong Du
    [J]. Frontiers of Computer Science, 2015, 9 : 919 - 933