Efficient Subgraph Matching on Large RDF Graphs Using MapReduce

被引:0
|
作者
Xin Wang
Lele Chai
Qiang Xu
Yajun Yang
Jianxin Li
Junhu Wang
Yunpeng Chai
机构
[1] Tianjin University,College of Intelligence and Computing
[2] Tianjin Key Laboratory of Cognitive Computing and Application,School of Information Technology
[3] Deakin University,School of Information and Communication Technology
[4] Griffith University,School of Information
[5] Renmin University of China,undefined
来源
关键词
Star decomposition; Subgraph matching; MapReduce; RDF graphs;
D O I
暂无
中图分类号
学科分类号
摘要
With the popularity of knowledge graphs growing rapidly, large amounts of RDF graphs have been released, which raises the need for addressing the challenge of distributed subgraph matching queries. In this paper, we propose an efficient distributed method to answer subgraph matching queries on big RDF graphs using MapReduce. In our method, query graphs are decomposed into a set of stars that utilize the semantic and structural information embedded RDF graphs as heuristics. Two optimization techniques are proposed to further improve the efficiency of our algorithms. One algorithm, called RDF property filtering, filters out invalid input data to reduce intermediate results; the other is to improve the query performance by postponing the Cartesian product operations. The extensive experiments on both synthetic and real-world datasets show that our method outperforms the close competitors S2X and SHARD by an order of magnitude on average.
引用
收藏
页码:24 / 43
页数:19
相关论文
共 50 条
  • [31] Storage and Retrieval of Large RDF Graph Using Hadoop and MapReduce
    Husain, Mohammad Farhan
    Doshi, Pankil
    Khan, Latifur
    Thuraisingham, Bhavani
    [J]. CLOUD COMPUTING, PROCEEDINGS, 2009, 5931 : 680 - 686
  • [32] An approach for approximate subgraph matching in fuzzy RDF graph
    Li, Guanfeng
    Yan, Li
    Ma, Zongmin
    [J]. FUZZY SETS AND SYSTEMS, 2019, 376 : 106 - 126
  • [33] Spatiotemporal RDF Data Query Based on Subgraph Matching
    Meng, Xiangfu
    Zhu, Lin
    Li, Qing
    Zhang, Xiaoyan
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (12)
  • [34] IRSMG: Accelerating inexact RDF subgraph matching on the GPU
    [J]. Zhang, Xiaowang (xiaowangzhang@tju.edu.cn), 1600, CEUR-WS (1690):
  • [35] GCSM: GPU-Accelerated Continuous Subgraph Matching for Large Graphs
    Wei, Yihua
    Jiang, Peng
    [J]. PROCEEDINGS 2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS 2024, 2024, : 1046 - 1057
  • [36] Enhanced subgraph matching for large graphs using candidate region-based decomposition and ordering
    Ansari, Zubair Ali
    Parwez, Md Aslam
    Thoker, Irfan Rashid
    Jahiruddin
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (08)
  • [37] DualIso: An Algorithm for Subgraph Pattern Matching on Very Large Labeled Graphs
    Saltz, Matthew
    Jain, Ayushi
    Kothari, Abhishek
    Fard, Arash
    Miller, John A.
    Ramaswamy, Lakshmish
    [J]. 2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 498 - 505
  • [38] Efficient Keyword Search on Graphs using MapReduce
    Hao, Yifan
    Cao, Huiping
    Qi, Yan
    Hu, Chuan
    Brahma, Sukumar
    Han, Jingyu
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2871 - 2873
  • [39] Sparsifying and Sampling of Large Graphs for Efficient Dense Subgraph Detection
    Cheng, Kai
    [J]. 2016 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2016, : 333 - 336
  • [40] Efficient continual cohesive subgraph search in large temporal graphs
    Yuan Li
    Jinsheng Liu
    Huiqun Zhao
    Jing Sun
    Yuhai Zhao
    Guoren Wang
    [J]. World Wide Web, 2021, 24 : 1483 - 1509