Incremental validation of XML documents

被引:51
|
作者
Balmin, A
Papakonstantinou, Y
Vianu, V
机构
[1] IBM Corp, Almaden Res Ctr, San Jose, CA 95120 USA
[2] Univ Calif San Diego, San Diego, CA 92103 USA
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2004年 / 29卷 / 04期
关键词
update; validation; XML; algorithms; experimentation;
D O I
10.1145/1042046.1042050
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We investigate the incremental validation of XML documents with respect to DTDs, specialized DTDs, and XML Schemas, under updates consisting of element tag renamings, insertions, and deletions. DTDs are modeled as extended context-free grammars. "Specialized DTDs" allow the decoupling of element types from element tags. XML Schemas are abstracted as specialized DTDs with limitations on the type assignment. For DTDs and XML Schemas, we exhibit an O(m log n) incremental validation algorithm using an auxiliary structure of size O(n), where n is the size of the document and m the number of updates. The algorithm does not handle the incremental validation of XML Schema wrt renaming of internal nodes, which is handled by the specialized DTDs incremental validation algorithm. For specialized DTDs, we provide an O(m log(2) n) incremental algorithm, again using an auxiliary structure of size O(n). This is a significant improvement over brute-force re-validation from scratch. We exhibit a restricted class of DTDs called local that arise commonly in practice and for which incremental validation can be done in practically constant time by maintaining only a list of counters. We present implementations of both general incremental validation and local validation on an XML database built on top of a relational database. Our experimentation includes a study of the applicability of local validation in practice, results on the calibration of parameters of the auxiliary data structure, and results on the performance comparison between the general incremental validation technique, the local validation technique, and brute-force validation from scratch.
引用
收藏
页码:710 / 751
页数:42
相关论文
共 50 条
  • [1] Incremental validation of XML documents
    Papakonstantinou, Y
    Vianu, V
    [J]. DATABASE THEORY ICDT 2003, PROCEEDINGS, 2003, 2572 : 47 - 63
  • [2] Efficient incremental validation of XML documents
    Barbosa, D
    Mendelzon, AO
    Libkin, L
    Mignet, L
    Arenas, M
    [J]. 20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 671 - 682
  • [3] Updates and incremental validation of XML documents
    Bouchou, A
    Alves, MHF
    [J]. DATABASE PROGRAMMING LANGUAGES, 2004, 2921 : 216 - 232
  • [4] Efficient incremental validation of XML documents after composite updates
    Barbosa, Denilson
    Leighton, Gregory
    Smith, Andrew
    [J]. DATABASE AND XML TECHNOLOGIES, PROCEEDINGS, 2006, 4156 : 107 - 121
  • [5] Incremental constraint checking for XML documents
    Abrao, MA
    Bouchou, B
    Ferrari, MH
    Laurent, D
    Musicante, MA
    [J]. DATABASE AND XML TECHNOLOGIES, PROCEEDINGS, 2004, 3186 : 112 - 127
  • [6] Validation of XML documents:: From UML models to XML schemas and XSLT stylesheets
    Dominguez, Eladio
    Lloret, Jorge
    Rubio, Angel L.
    Zapata, Maria A.
    [J]. ADVANCES IN INFORMATION SYSTEMS, PROCEEDINGS, 2006, 4243 : 48 - 59
  • [7] XML schema evolution: Incremental validation and efficient document adaptation
    Guerrini, Giovanna
    Mesiti, Marco
    Sorrenti, Matteo A.
    [J]. DATABASE AND XML TECHNOLOGIES, PROCEEDINGS, 2007, 4704 : 92 - 106
  • [8] XML document correction: Incremental approach activated by schema validation
    Bouchou, Beatrice
    Cheriat, Ahmed
    Ferrari, Mirian Halfeld
    Savary, Agata
    [J]. 10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2006, : 228 - 235
  • [9] Mechanism for Multi-XML Documents Correlating Validation and its Implementation
    Li, Nan
    Tang, Yazhe
    Pei, Chao
    Wang, Xiaofei
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1, 2009, : 467 - 471
  • [10] Constant-memory validation of streaming XML documents against DTDs
    Segoufin, Luc
    Sirangelo, Cristina
    [J]. Database Theory - ICDT 2007, Proceedings, 2006, 4353 : 299 - 313