Efficient memory representation of XML documents

被引:0
|
作者
Busatto, G [1 ]
Lohrey, M
Maneth, S
机构
[1] Carl von Ossietzky Univ Oldenburg, Dept Informat, D-2900 Oldenburg, Germany
[2] Univ Stuttgart, FMI, D-7000 Stuttgart, Germany
来源
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Implementations that load XML documents and give access to them via, e.g., the DOM, suffer from huge memory demands: the space needed to load an XML document is usually many times larger than the size of the document. A considerable amount of memory is needed to store the tree structure of the XML document. Here a technique is presented that allows to represent the tree structure of an XML document in an efficient way. The representation exploits the high regularity in XML documents by "compressing" their tree structure; the latter means to detect and remove repetitions of tree patterns. The functionality of basic tree operations, like traversal along edges, is preserved in the compressed representation. This allows to directly execute queries (and in particular, bulk operations) without prior decompression. For certain tasks like validation against an XML type or checking equality of documents, the representation allows for provably more efficient algorithms than those running on conventional representations.
引用
收藏
页码:199 / 216
页数:18
相关论文
共 50 条
  • [31] Developing an efficient query system for encrypted XML documents
    Chang, Tao-Ku
    Hwang, Gwan-Hwan
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2011, 84 (08) : 1292 - 1305
  • [32] XXS: Efficient XPath Evaluation on Compressed XML Documents
    Brisaboa, Nieves R.
    Cerdeira-Pena, Ana
    Navarro, Gonzalo
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2014, 32 (03)
  • [33] Efficient complex query support for multiversion XML documents
    Chien, SY
    Tsotras, VJ
    Zaniolo, C
    Zhang, DH
    [J]. ADVANCES IN DATABASE TECHNOLOGY - EDBT 2002, 2002, 2287 : 161 - 178
  • [34] Indexing XML documents for XPath query processing in external memory
    Chen, Qun
    Lim, Andrew
    Ong, Kian Win
    Tang, Jiqing
    [J]. DATA & KNOWLEDGE ENGINEERING, 2006, 59 (03) : 681 - 699
  • [35] An Efficient Classification of Fuzzy XML Documents Based on Kernel ELM
    Zhao, Zhen
    Ma, Zongmin
    Yan, Li
    [J]. INFORMATION SYSTEMS FRONTIERS, 2021, 23 (03) : 515 - 530
  • [36] Efficient schema extraction from a large collection of XML documents
    Xing, Guangming
    Parthepan, Vijayeandra
    [J]. PROCEEDINGS OF THE 49TH ANNUAL ASSOCIATION FOR COMPUTING MACHINERY SOUTHEAST CONFERENCE (ACMSE '11), 2011, : 92 - 96
  • [37] An efficient similarity-based approach for comparing XML documents
    Oliveira, Alessandreia
    Tessarolli, Gabriel
    Ghiotto, Gleiph
    Pinto, Bruno
    Campello, Fernando
    Marques, Matheus
    Oliveira, Carlos
    Rodrigues, Igor
    Kalinowski, Marcos
    Souza, Ueverton
    Murta, Leonardo
    Braganholo, Vanessa
    [J]. INFORMATION SYSTEMS, 2018, 78 : 40 - 57
  • [38] An Efficient Classification of Fuzzy XML Documents Based on Kernel ELM
    Zhen Zhao
    Zongmin Ma
    Li Yan
    [J]. Information Systems Frontiers, 2021, 23 : 515 - 530
  • [39] Efficient incremental validation of XML documents after composite updates
    Barbosa, Denilson
    Leighton, Gregory
    Smith, Andrew
    [J]. DATABASE AND XML TECHNOLOGIES, PROCEEDINGS, 2006, 4156 : 107 - 121
  • [40] Extracting global policies for efficient access control of XML documents
    Iwaihara, M
    Wang, B
    Chatvichienchai, S
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2005, 2005, 3806 : 161 - 174