Temporal XML: modeling, indexing, and query processing

被引:46
|
作者
Rizzolo, Flavio [1 ]
Vaisman, Alejandro A. [2 ,3 ]
机构
[1] Univ Toronto, Dept Comp Sci, Bahen Ctr Informat Technol, Toronto, ON M5S 2E4, Canada
[2] Univ Buenos Aires, Buenos Aires, DF, Argentina
[3] Univ Chile, Santiago, Chile
来源
VLDB JOURNAL | 2008年 / 17卷 / 05期
关键词
XML; temporal databases; semistructured data; structural summaries; XPath;
D O I
10.1007/s00778-007-0058-x
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we address the problem of modeling and implementing temporal data in XML. We propose a data model for tracking historical information in an XML document and for recovering the state of the document as of any given time. We study the temporal constraints imposed by the data model, and present algorithms for validating a temporal XML document against these constraints, along with methods for fixing inconsistent documents. In addition, we discuss different ways of mapping the abstract representation into a temporal XML document, and introduce TXPath, a temporal XML query language that extends XPath 2.0. In the second part of the paper, we present our approach for summarizing and indexing temporal XML documents. In particular we show that by indexing continuous paths, i.e., paths that are valid continuously during a certain interval in a temporal XML graph, we can dramatically increase query performance. To achieve this, we introduce a new class of summaries, denoted TSummary, that adds the time dimension to the well-known path summarization schemes. Within this framework, we present two new summaries: LCP and Interval summaries. The indexing scheme, denoted TempIndex, integrates these summaries with additional data structures. We give a query processing strategy based on TempIndex and a type of ancestor-descendant encoding, denoted temporal interval encoding. We present a persistent implementation of TempIndex, and a comparison against a system based on a non-temporal path index, and one based on DOM. Finally, we sketch a language for updates, and show that the cost of updating the index is compatible with real-world requirements.
引用
收藏
页码:1179 / 1212
页数:34
相关论文
共 50 条
  • [1] Temporal XML: modeling, indexing, and query processing
    Flavio Rizzolo
    Alejandro A. Vaisman
    [J]. The VLDB Journal, 2008, 17 : 1179 - 1212
  • [2] An Efficient Indexing and Compressing Scheme for XML Query Processing
    Liao, I-En
    Hsu, Wen-Chiao
    Chen, Yu-Lin
    [J]. NETWORKED DIGITAL TECHNOLOGIES, PT 1, 2010, 87 : 70 - 84
  • [3] Indexing useful structural patterns for XML query processing
    Lian, W
    Mamoulis, N
    Cheung, DWL
    Yiu, SM
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (07) : 997 - 1009
  • [4] Indexing XML documents for XPath query processing in external memory
    Chen, Qun
    Lim, Andrew
    Ong, Kian Win
    Tang, Jiqing
    [J]. DATA & KNOWLEDGE ENGINEERING, 2006, 59 (03) : 681 - 699
  • [5] Indexing techniques for query of XML documents
    Wang, Y
    Sun, JL
    Dong, JX
    [J]. COMPUTER SCIENCE AND TECHNOLOGY IN NEW CENTURY, 2001, : 581 - 584
  • [6] XIQS: An XML indexing and query system
    Venkatesh, S
    Hu, GZ
    [J]. COMPUTERS AND THEIR APPLICATIONS, 2003, : 180 - 183
  • [7] Optimization of Query Processing in XML Document using TAR and Path Based Indexing
    Karthiga, D.
    Gunasekaran, S.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2013, 13 (10): : 119 - 127
  • [8] XML query processing
    Florescu, D
    Kossmann, D
    [J]. 20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 874 - 874
  • [9] Practical indexing XML document for twig query
    Wang, HZ
    Wang, W
    Li, JZ
    Lin, XM
    Wong, R
    [J]. ADVANCES IN COMPUTER SCIENCE - ASIAN 2005, PROCEEDINGS: DATA MANAGEMENT ON THE WEB, 2005, 3818 : 208 - 222
  • [10] XML query processing and optimization
    Zhang, N
    [J]. CURRENT TRENDS IN DATABASE TECHNOLOGY - EDBT 2004 WORKSHOPS, PROCEEDINGS, 2004, 3268 : 121 - 132