Schema-Based Compression of XML Data with Relax NG

被引:6
|
作者
League, Christopher [1 ]
Eng, Kenjone [1 ]
机构
[1] Long Isl Univ, Comp Sci, Brooklyn, NY 11201 USA
关键词
XML; data compression; tree compression; Relax NG; compact binary formats;
D O I
10.4304/jcp.2.10.9-17
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The extensible markup language XML has become indispensable in many areas, but a significant disadvantage is its size: tagging a set of data increases the space needed to store it, the bandwidth needed to transmit it, and the time needed to parse it. We present a new compression technique based on the document type, expressed as a Relax NG schema. Assuming the sender and receiver agree in advance on the document type, conforming documents can be transmitted extremely compactly. On several data sets with high tag density this technique compresses better than other known XML-aware compressors, including those that consider the document type.
引用
收藏
页码:9 / 17
页数:9
相关论文
共 50 条
  • [1] An XML Schema-Based Data Integration
    Ran, Chong-Shan
    Wang, Ma-Chuan
    PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 7, 2010, : 100 - 102
  • [2] An XML schema-based semantic data integration
    Kim, Dongkwang
    Jeong, Karpjoo
    Shin, Hyoseop
    Hwang, Suntae
    GCC 2005: FIFTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING, PROCEEDINGS, 2006, : 522 - +
  • [3] Inferring a Relax NG Schema from XML Documents
    Kim, Guen-Hae
    Ko, Sang-Ki
    Han, Yo-Sub
    LANGUAGE AND AUTOMATA THEORY AND APPLICATIONS, LATA 2016, 2016, 9618 : 400 - 411
  • [4] An efficient schema-based technique for querying XML data
    Kha, DD
    Yoshikawa, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (04): : 1480 - 1489
  • [5] Efficient schema-based revalidation of XML
    Raghavachari, M
    Shmueli, O
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2004, PROCEEDINGS, 2004, 2992 : 639 - 657
  • [6] Xebu: A binary format with schema-based optimizations for XML data
    Kangasharju, J
    Tarkoma, S
    Lindholm, T
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2005, 2005, 3806 : 528 - 535
  • [7] A schema-based XML index structure
    College of Computer Science, Chongqing University, Chongqing 400044, China
    Jisuanji Gongcheng, 2006, 18 (64-66):
  • [8] Efficient schema-based XML-to-relational data mapping
    Atay, Mustafa
    Chebotko, Artem
    Liu, Dapeng
    Lu, Shiyong
    Fotouhi, Farshad
    INFORMATION SYSTEMS, 2007, 32 (03) : 458 - 476
  • [9] Schema-based Constrained XML data indexing and storage Technique
    Chen, Xuebin
    Duan, Guolin
    Yan, Hongcan
    Zhang, Shufen
    Che, Yuee
    2009 INTERNATIONAL CONFERENCE ON NEW TRENDS IN INFORMATION AND SERVICE SCIENCE (NISS 2009), VOLS 1 AND 2, 2009, : 973 - +
  • [10] Schema-Based Independence Analysis for XML Updates
    Benedikt, Michael
    Cheney, James
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2009, 2 (01): : 61 - 72