A change detection system for unordered XML data using a relational model

被引:8
|
作者
Sundaram, Sathya [1 ]
Madria, Sanjay K. [1 ]
机构
[1] Missouri Univ Sci & Technol, Dept Comp Sci, Rolla, MO 65409 USA
关键词
XML; Change detection; Edit distance; Tree comparison; SQL; ALGORITHM;
D O I
10.1016/j.datak.2011.11.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The dramatic increase in the evolution of XML data available on the Internet requires a change detection system to keep track of important changes occurring during their life time. In this paper, we introduce a novel approach of detecting changes between two versions of unordered XML data stored in a traditional relational database using approaches like XRel. Most of the existing work in the area of XML change detection is mainly focused on detecting changes between two versions of XML data by constructing their Document Object Model (DOM) trees and then comparing these two tree structures based on Longest Common Sequence (LCS) using minimum edit distances. The basic tree comparison approach is not efficient in handling large XML files due to the fact that (1) an equivalent XML DOM tree will be twice as large as the original document and (2) the entire trees of both versions have to be memory resident during the comparison process. These two issues are constrained by the available main memory. In addition, existing approaches fail to detect changes among versions of XML data stored in relational databases as reverse mapping is not loss-less. We propose an efficient algorithm (XRel_Change_SQL) for detecting unordered changes between two XML data files stored in XRel as the underlying relational data model, using Structured Query Language (SQL). We compare the efficiency and quality of our change detection algorithm with existing XML change detection tools like X-Diff, DeltaXML and XANDY. We provide an experimental evaluation of the results obtained from the benchmark datasets as well as some synthetic datasets to show that our approach is highly scalable, and results in a much better efficiency and delta quality than the aforementioned approaches and tools. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:257 / 284
页数:28
相关论文
共 50 条
  • [31] Triggers over XML views of relational data
    Shao, F
    Novak, A
    Shanmugasundaram, J
    ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 483 - 484
  • [32] Querying and Browsing XML and Relational Data Sources
    Lu, James J.
    Huang, Chia-Hsin
    Chuang, Tyng-Ruey
    APPLIED COMPUTING 2007, VOL 1 AND 2, 2007, : 489 - +
  • [33] Using XML in relational database applications
    Malaika, S
    15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 167 - 167
  • [34] Exporting relational data into a native XML store
    Pokorny, Jaroslav
    Reschke, Jakub
    ADVANCES IN INFORMATION SYSTEMS DEVELOPMENT, VOL 1 AND 2: BRIDGING THE GAP BETWEEN ACADEMIA AND INDUSTRY, 2006, : 807 - +
  • [35] Research on store XML data in relational database based on XML schema
    Yi-Wu, Xie
    Chen-Yang, Wang
    Zhi-Ying, Cao
    Yan, Chen
    2007 IFIP INTERNATIONAL CONFERENCE ON NETWORK AND PARALLEL COMPUTING WORKSHOPS, PROCEEDINGS, 2007, : 1001 - +
  • [36] Using relational database to build OWL ontology from XML data sources
    Xu, Jiuyun
    Li, Weichong
    CIS WORKSHOPS 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY WORKSHOPS, 2007, : 124 - +
  • [37] Converting probabilistic relational data to probabilistic XML data tree
    Wang J.
    Hao Z.
    Information Technology Journal, 2010, 9 (08) : 1706 - 1712
  • [38] Fuzzy XML data modeling with the UML and relational data models
    Ma, Z. M.
    Yan, Li
    DATA & KNOWLEDGE ENGINEERING, 2007, 63 (03) : 972 - 996
  • [39] On Change Detection of XML Schemas
    Baqasah, Abdullah
    Pardede, Eric
    Rahayu, Wenny
    Holubova , Irena
    2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013), 2013, : 974 - 982
  • [40] A novel system for publishing relational data in XML Based on ER-DTD method
    Liu, GH
    He, Q
    Chen, ZY
    Wang, Y
    COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 2003, : 7 - 10