Compressing XML documents using recursive finite state automata

被引:0
|
作者
Subramanian, H [1 ]
Shankar, P [1 ]
机构
[1] Indian Inst Sci, Dept Comp Sci & Automat, Bangalore 560012, Karnataka, India
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We propose a scheme for automatically generating compressors for XML documents from Document Type Definition(DTD) specifications. Our algorithm is a lossless adaptive algorithm where the model used for compression and decompression is generated automatically from the DTD, and is used in conjunction with an arithmetic compressor to produce a compressed version of the document. The structure of the model mirrors the syntactic specification of the document. Our compression scheme is on-line, that is, it can compress the document as it is being read. We have implemented the compressor generator, and provide the results of experiments on some large XML databases whose DTD's are specified. We note that the average compression is better than that of XMLPPM, the only other on-line tool we are aware of. The tool is able to compress massive documents where XMLPPM failed to work as it ran out of memory. We believe the main appeal of this technique is the fact that the underlying model is so simple and yet so effective.
引用
收藏
页码:282 / 293
页数:12
相关论文
共 50 条
  • [21] Turkish lexicon expansion by using finite state automata
    Ozturk, Burak
    Can, Burcu
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (02) : 1012 - 1027
  • [22] Efficient instruction scheduling using finite state automata
    Bala, V
    Rubin, N
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 1997, 25 (02) : 53 - 82
  • [23] An improved algorithm for XML routing based on finite automata
    Chen J.
    Zou Z.
    Pan J.
    Zhai L.
    [J]. Jiangsu Daxue Xuebao (Ziran Kexue Ban)/Journal of Jiangsu University (Natural Science Edition), 2010, 31 (06): : 705 - 709
  • [24] Wavelet-based gender detection on off-line handwritten documents using probabilistic finite state automata
    Akbari, Younes
    Nouri, Kazem
    Sadri, Javad
    Djeddi, Chawki
    Siddiqi, Imran
    [J]. IMAGE AND VISION COMPUTING, 2017, 59 : 17 - 30
  • [25] Validating XML Constraints Using Automata
    Tan, Zijing
    [J]. PROCEEDINGS OF THE 8TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, 2009, : 1205 - 1210
  • [26] Robustness of finite state automata
    Megretski, A
    [J]. MULTIDISCIPLINARY RESEARCH IN CONTROL, 2003, 289 : 147 - 160
  • [27] Residual finite state automata
    Denis, F
    Lemay, A
    Terlutte, A
    [J]. FUNDAMENTA INFORMATICAE, 2002, 51 (04) : 339 - 368
  • [28] Transforming XML Documents using fxt
    Berlea, Alexandru
    Seidl, Helmut
    [J]. Journal of Computing and Information Technology, 2002, 10 (01) : 19 - 35
  • [29] Rendering XML documents using XSL
    [J]. Dr Dobb's J Software Tools Prof Program, 7 (82):
  • [30] Rendering XML documents using XSL
    McGrath, S
    [J]. DR DOBBS JOURNAL, 1998, 23 (07): : 82 - +