Efficient memory representation of XML documents

被引:0
|
作者
Busatto, G [1 ]
Lohrey, M
Maneth, S
机构
[1] Carl von Ossietzky Univ Oldenburg, Dept Informat, D-2900 Oldenburg, Germany
[2] Univ Stuttgart, FMI, D-7000 Stuttgart, Germany
来源
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Implementations that load XML documents and give access to them via, e.g., the DOM, suffer from huge memory demands: the space needed to load an XML document is usually many times larger than the size of the document. A considerable amount of memory is needed to store the tree structure of the XML document. Here a technique is presented that allows to represent the tree structure of an XML document in an efficient way. The representation exploits the high regularity in XML documents by "compressing" their tree structure; the latter means to detect and remove repetitions of tree patterns. The functionality of basic tree operations, like traversal along edges, is preserved in the compressed representation. This allows to directly execute queries (and in particular, bulk operations) without prior decompression. For certain tasks like validation against an XML type or checking equality of documents, the representation allows for provably more efficient algorithms than those running on conventional representations.
引用
收藏
页码:199 / 216
页数:18
相关论文
共 50 条
  • [1] Virtual DOM: an efficient virtual memory representation for large XML documents
    Psaila, Giuseppe
    [J]. DEXA 2008: 19TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2008, : 233 - 237
  • [2] Grove Data Model for Efficient Representation of XML Documents
    Anwar, Yasmin
    Kamel, Amr
    Ahmed, Aziza Saad
    [J]. WOCN: 2009 IFIP INTERNATIONAL CONFERENCE ON WIRELESS AND OPTICAL COMMUNICATIONS NETWORKS, 2009, : 99 - +
  • [3] Efficient memory representation of XML document trees
    Busatto, Giorgio
    Lohrey, Markus
    Maneth, Sebastian
    [J]. INFORMATION SYSTEMS, 2008, 33 (4-5) : 456 - 474
  • [4] A typeful and tagless representation for XML documents
    Zhu, DP
    Xi, HW
    [J]. PROGRAMMING LANGUAGES AND SYSTEMS, PROCEEDINGS, 2003, 2895 : 89 - 104
  • [5] Efficient revalidation of XML documents
    Raghavachari, Mukund
    Shmueli, Oded
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (04) : 554 - 567
  • [6] A Complete Path Representation Method with a Modified Inverted Index for Efficient Retrieval of XML Documents
    Chang, Hsu-Kuang
    Hung, King-Chu
    Jou, I-Chang
    [J]. WSEAS Transactions on Computers, 2011, 10 (10): : 321 - 331
  • [7] Efficient incremental validation of XML documents
    Barbosa, D
    Mendelzon, AO
    Libkin, L
    Mignet, L
    Arenas, M
    [J]. 20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 671 - 682
  • [8] Efficient Change Control of XML Documents
    Roennau, Sebastian
    Philipp, Geraint
    Borghoff, Uwe M.
    [J]. DOCENG'09: PROCEEDINGS OF THE 2009 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, 2009, : 3 - 12
  • [9] Efficient extraction of schemas for XML documents
    Min, JK
    Ahn, JY
    Chung, CW
    [J]. INFORMATION PROCESSING LETTERS, 2003, 85 (01) : 7 - 12
  • [10] Efficient fragmentation of large XML documents
    Bonifati, Angela
    Cuzzocrea, Alfredo
    [J]. Database and Expert Systems Applications, Proceedings, 2007, 4653 : 539 - 550