A GML Compression Approach Based on On-line Semantic Clustering

被引:0
|
作者
Wei, Qingting [1 ]
Guan, Jihong [1 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
关键词
GML compression; semantic similarity; clustering; delta encoding;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Geography Markup Language (GML) has become a de facto international encoding standard for exchanging geospatial data among heterogeneous Geographic Information Systems (GIS). Whereas, structurally redundant tags and textual data representation usually inflate the sizes of GML documents substantially, which makes the storage and transport costly. In this paper, we propose an effective compression approach based on on-line semantic clustering of GML documents. The approach deals with a GML document under compression on the fly via separating data from structures, clustering data based on the semantic similarities exploited from tags and texts, dictionary-encoding structures and delta-encoding geometric coordinate data before the general text compression on back end. We conduct extensive experiments on real GML documents to evaluate the performance of the proposed approach. Results show that our approach outperforms the most popular general text compressor gzip, the acknowledged best XML compressor XMill, and the first and up to now the only GML compressor GPress in compression ratio.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] A model-based reinforcement learning approach using on-line clustering
    Tziortziotis, Nikolaos
    Blekas, Konstantinos
    [J]. 2012 IEEE 24TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2012), VOL 1, 2012, : 712 - 718
  • [2] On-line clustering
    Bouguettaya, A
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (02) : 333 - 339
  • [3] On-line Semantic Mapping
    Bastianelli, E.
    Bloisi, D. D.
    Capobianco, R.
    Cossu, F.
    Gemignani, G.
    Iocchi, L.
    Nardi, D.
    [J]. 2013 16TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2013,
  • [4] A New and Effective Approach to GML Documents Compression
    Wei, Qingting
    Guan, Jihong
    Zhou, Shuigeng
    Wang, Xin
    [J]. COMPUTER JOURNAL, 2014, 57 (11): : 1723 - 1740
  • [5] On-line hierarchical clustering
    El-Sonbaty, Y
    Ismail, MA
    [J]. PATTERN RECOGNITION LETTERS, 1998, 19 (14) : 1285 - 1291
  • [6] Signal compression approach based on lifting scheme algorithm for an on-line vibration monitoring system
    Wang, Y
    Huang, TS
    Li, G
    Sun, D
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 240 - 243
  • [7] Parsing GML data based on integrative GML syntactic and semantic schemas database
    Miao, Lizhi
    Zhang, Shuliang
    Lu, Guonian
    Gao, Xiaoli
    Jiao, Donglai
    Gan, Jiayan
    [J]. GEOINFORMATICS 2007: GEOSPATIAL INFORMATION SCIENCE, PTS 1 AND 2, 2007, 6753
  • [8] Dynamic on-line clustering and state extraction: An approach to symbolic learning
    Das, S
    Mozer, M
    [J]. NEURAL NETWORKS, 1998, 11 (01) : 53 - 64
  • [9] Document clustering based on semantic smoothing approach
    Liu, Yubao
    Cai, Jiarong
    Yin, Jian
    Huang, Zhilan
    [J]. ADVANCES IN INTELLIGENT WEB MASTERING, 2007, 43 : 217 - +
  • [10] A novelty-based clustering method for on-line documents
    Khy, Sophoin
    Ishikawa, Yoshiharu
    Kitagawa, Hiroyuki
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2008, 11 (01): : 1 - 37