G-Diff: A Grouping Algorithm for RDF Change Detection on MapReduce

被引:1
|
作者
Ahn, Jinhyun [1 ,2 ]
Im, Dong-Hyuk [3 ]
Eom, Jae-Hong [1 ,2 ]
Zong, Nansu [1 ]
Kim, Hong-Gee [1 ,2 ]
机构
[1] Seoul Natl Univ, Biomed Knowledge Engn Lab, Seoul, South Korea
[2] Seoul Natl Univ, Dent Res Inst, Seoul, South Korea
[3] Hoseo Univ, Dept Comp & Informat Engn, Cheonan, South Korea
来源
关键词
D O I
10.1007/978-3-319-15615-6_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Linked Data is a collection of RDF data that can grow exponentially and change over time. Detecting changes in RDF data is important to support Linked Data consuming applications with version management. Traditional approaches for change detection are not scalable. This has led researchers to devise algorithms on the MapReduce framework. Most works simply take a URI as a Map key. We observed that it is not efficient to handle RDF data with a large number of distinct URIs since many Reduce tasks have to be created. Even though the Reduce tasks are scheduled to run simultaneously, too many small Reduce tasks would increase the overall running time. In this paper, we propose G-Diff, an efficient MapReduce algorithm for RDF change detection. G-Diff groups triples by URIs during Map phase and sends the triples to a particular Reduce task rather than multiple Reduce tasks. Experiments on real datasets showed that the proposed approach takes less running time than previous works.
引用
下载
收藏
页码:230 / 235
页数:6
相关论文
共 50 条
  • [1] Similarity-based Change Detection for RDF in MapReduce
    Lee, Taewhi
    Im, Dong-Hyuk
    Won, Jongho
    PROMOTING BUSINESS ANALYTICS AND QUANTITATIVE MANAGEMENT OF TECHNOLOGY: 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2016), 2016, 91 : 789 - 797
  • [2] DTD-DIFF: A change detection algorithm for DTDs
    Leonardi, Erwin
    Hoai, Tran T.
    Bhowmick, Sourav S.
    Madria, Sanjay
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2006, 3882 : 817 - 827
  • [3] DTD-DIFF: A change detection algorithm for DTDs
    Leonardi, Erwin
    Hoai, Tran T.
    Bhowinick, Sourav S.
    Madria, Sanjay
    DATA & KNOWLEDGE ENGINEERING, 2007, 61 (02) : 384 - 402
  • [4] XS-Diff: XML schema change detection algorithm
    Baqasah, Abdullah
    Pardede, Eric
    Rahayu, Wenny
    Holubova, Irena
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2015, 11 (02) : 160 - 192
  • [5] X-Diff: An effective change detection algorithm for XML documents
    Wang, Y
    DeWitt, DJ
    Cai, JY
    19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2003, : 519 - 530
  • [6] CX-DIFF: a change detection algorithm for XML content and change visualization for WebVigiL
    Jacob, J
    Sachde, A
    Chakravarthy, S
    DATA & KNOWLEDGE ENGINEERING, 2005, 52 (02) : 209 - 230
  • [7] CX-DIFF: A change detection algorithm for XML content and change presentation issues for WebVigiL
    Jacob, J
    Sachde, A
    Chakravarthy, S
    CONCEPTUAL MODELING FOR NOVEL APPLICATION DOMAINS, PROCEEDINGS, 2003, 2814 : 273 - 284
  • [8] KF-Diff+: Highly efficient change detection algorithm for XML documents
    Xu, HY
    Wu, QY
    Wang, HM
    Yang, GG
    Jia, Y
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2002: COOPLS, DOA, AND ODBASE, 2002, 2519 : 1273 - 1286
  • [9] SX-Diff: A change detection algorithm for multi-version XML documents
    Li, Min
    Wang, Yuanzhen
    Li, Guiling
    2008 PROCEEDINGS OF INFORMATION TECHNOLOGY AND ENVIRONMENTAL SYSTEM SCIENCES: ITESS 2008, VOL 2, 2008, : 480 - 486
  • [10] X-tree Diff+: Efficient change detection algorithm in XML documents
    Lee, Suk Kyoon
    Kim, Dong Ah
    EMBEDDED AND UBIQUITOUS COMPUTING, PROCEEDINGS, 2006, 4096 : 1037 - 1046