XANDY: A scalable change detection technique for ordered XML documents using relational databases

被引:9
|
作者
Leonardi, Erwin [1 ]
Bhowmick, Sourav S. [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
XML; change detection; RDBMS; schema-unconscious approach; performance; result quality;
D O I
10.1016/j.datak.2005.06.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous work in change detection to XML documents is not suitable for detecting the changes to large XML documents as it requires a lot of memory to keep the two versions of XML documents in the memory. In this article, we take a more conservative yet novel approach of using traditional relational database engines for detecting the changes to large ordered XML documents. To this end, we have implemented a prototype system called XANDY that converts XML documents into relational tuples and detects the changes from these tuples by using SQL queries. Our experimental results show that the relational-based approach has better scalability compared to published algorithm like X-Diff. It has comparable efficiency and result quality compared to X-Diff in some cases. Our experimental results also show that, generally, XANDY has better result quality than XyDiff. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:476 / 507
页数:32
相关论文
共 50 条
  • [41] X-Diff: An effective change detection algorithm for XML documents
    Wang, Y
    DeWitt, DJ
    Cai, JY
    [J]. 19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2003, : 519 - 530
  • [42] A Reversible Hiding Technique Using LSB Matching for Relational Databases
    Hwang, Min-Shiang
    Xie, Ming-Ru
    Wu, Chia-Chun
    [J]. INFORMATICA, 2020, 31 (03) : 481 - 497
  • [43] Mapping XML data to relational databases using a graph-based clustering mechanism
    Che, D
    Hou, WC
    [J]. IKE'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2003, : 239 - 245
  • [44] Generating nested XML documents from unnormalized relational views using a statistically approach
    Nasser, Mohammed
    Ibrahim, Hamidah
    Mamat, Ali
    Sulaiman, Nasir
    [J]. 2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, : 843 - 848
  • [45] Schema-less, semantics-based change detection for XML documents
    Zhang, SH
    Dyreson, C
    Snodgrass, RT
    [J]. WEB INFORMATION SYSTEMS - WISE 2004, PROCEEDINGS, 2004, 3306 : 279 - 290
  • [46] A dataflow approach to efficient change detection of HTML']HTML/XML documents in WebVigiL
    Sanka, Anoop
    Chamakura, Shravan
    Chakravarthy, Sharma
    [J]. COMPUTER NETWORKS, 2006, 50 (10) : 1547 - 1563
  • [47] KF-Diff+: Highly efficient change detection algorithm for XML documents
    Xu, HY
    Wu, QY
    Wang, HM
    Yang, GG
    Jia, Y
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2002: COOPLS, DOA, AND ODBASE, 2002, 2519 : 1273 - 1286
  • [48] SX-Diff: A change detection algorithm for multi-version XML documents
    Li, Min
    Wang, Yuanzhen
    Li, Guiling
    [J]. 2008 PROCEEDINGS OF INFORMATION TECHNOLOGY AND ENVIRONMENTAL SYSTEM SCIENCES: ITESS 2008, VOL 2, 2008, : 480 - 486
  • [49] WebVigiL: User profile-based change detection for HTML']HTML/XML documents
    Pandrangi, N
    Jacob, J
    Sanka, A
    Chakravarthy, S
    [J]. NEW HORIZONS IN INFORMATION MANAGEMENT, 2003, 2712 : 38 - 57
  • [50] X-tree Diff+: Efficient change detection algorithm in XML documents
    Lee, Suk Kyoon
    Kim, Dong Ah
    [J]. EMBEDDED AND UBIQUITOUS COMPUTING, PROCEEDINGS, 2006, 4096 : 1037 - 1046