Schema-Based Compression of XML Data with Relax NG

被引:6
|
作者
League, Christopher [1 ]
Eng, Kenjone [1 ]
机构
[1] Long Isl Univ, Comp Sci, Brooklyn, NY 11201 USA
关键词
XML; data compression; tree compression; Relax NG; compact binary formats;
D O I
10.4304/jcp.2.10.9-17
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The extensible markup language XML has become indispensable in many areas, but a significant disadvantage is its size: tagging a set of data increases the space needed to store it, the bandwidth needed to transmit it, and the time needed to parse it. We present a new compression technique based on the document type, expressed as a Relax NG schema. Assuming the sender and receiver agree in advance on the document type, conforming documents can be transmitted extremely compactly. On several data sets with high tag density this technique compresses better than other known XML-aware compressors, including those that consider the document type.
引用
收藏
页码:9 / 17
页数:9
相关论文
共 50 条
  • [21] Schema-Based Debugging of Federated Data Sources
    Nolle, Andreas
    Meilicke, Christian
    Chekol, Melisachew Wudage
    Nemirovski, German
    Stuckenschmidt, Heiner
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 381 - 389
  • [22] An Efficient Access Control Model for Schema-Based Relational Storage of XML Documents
    Patel, Jigishaben
    Atay, Mustafa
    PROCEEDINGS OF THE 49TH ANNUAL ASSOCIATION FOR COMPUTING MACHINERY SOUTHEAST CONFERENCE (ACMSE '11), 2011, : 97 - 102
  • [23] A Schema-Based Approach to Enable Data Integration on the Fly
    Nicklas, Daniela
    Schwarz, Thomas
    Mitschang, Bernhard
    INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2017, 26 (01)
  • [24] Schema-based Web wrapping
    Bettina Fazzinga
    Sergio Flesca
    Andrea Tagarelli
    Knowledge and Information Systems, 2011, 26 : 127 - 173
  • [25] Schema-based Web wrapping
    Fazzinga, Bettina
    Flesca, Sergio
    Tagarelli, Andrea
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 26 (01) : 127 - 173
  • [26] Schema-Based Automata Determinization
    Niehren, Joachim
    Sakho, Momar
    Al Serhali, Antonio
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2022, (370): : 49 - 65
  • [27] ROLAP Based Data Warehouse Schema to XML Schema Conversion
    Sen, Soumya
    Cortesi, Agostino
    Chaki, Nabendu
    PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2016, : 1736 - 1741
  • [28] Automated database and schema-based data interchange for modeling and simulation
    Harrison, GA
    Maynard, DS
    Pollak, E
    PROCEEDINGS OF THE 2004 WINTER SIMULATION CONFERENCE, VOLS 1 AND 2, 2004, : 191 - 197
  • [29] A mapping scheme of XML documents into relational databases using schema-based path identifiers
    Fujimoto, K
    Shimizu, T
    Kha, D
    Yoshikawa, M
    Amagasa, T
    INTERNATIONAL WORKSHOP ON CHALLENGES IN WEB INFORMATION RETRIEVAL AND INTEGRATION, PROCEEDINGS, 2005, : 82 - 90
  • [30] Schema-agnostic vs Schema-based Configurations for Blocking Methods on Homogeneous Data
    Papadakis, George
    Alexiou, George
    Papastefanatos, George
    Koutrika, Georgia
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 9 (04): : 312 - 323