Clustering-based compression for raster time series

被引:0
|
作者
Munoz, Martita [1 ,2 ,3 ]
Fuentes-Sepulveda, Jose [1 ,2 ]
Hernandez, Cecilia [1 ,3 ]
Navarro, Gonzalo [2 ,4 ]
Seco, Diego [5 ]
Silva-Coira, Fernando [5 ]
机构
[1] Univ Concepcion, Dept Comp Sci, Edmundo Larenas 219, Concepcion 4070409, Chile
[2] Millennium Inst Fdn Res Data, Vicuna Mackenna 4860, Santiago 7821093, Chile
[3] Ctr Biotechnol & Bioengn CeBiB, Beaucheff 851, Santiago 8370458, Chile
[4] Univ Chile, Dept Comp Sci, Beaucheff 850, Santiago 8370459, Chile
[5] Univ A Coruna, CITIC, Fac Informat, Campus Elvina S-N, La Coruna 15008, Spain
来源
关键词
COMPACT REPRESENTATION;
D O I
10.1093/comjnl/bxae090
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A raster time series is a sequence of independent rasters arranged chronologically covering the same geographical area. These are commonly used to depict the temporal evolution of represented variables. The $T$-$k<^>{2}$-raster is a compact data structure that performs very well in practice for compact representations for raster time series. This structure classifies each raster as a snapshot or a log and encodes logs concerning their reference snapshots, which are the immediately preceding selected snapshots. An enhanced version of the $T$-$k<^>{2}$-raster, called Heuristic $T$-$k<^>{2}$-raster, incorporates a heuristic for automating the selection of snapshots. In this study, we investigate the optimality of the heuristic employed in Heuristic $T$-$k<^>{2}$-raster by comparing it with a dynamic programming (DP) approach. Our experimental evaluation demonstrates that Heuristic $T$-$k<^>{2}$-raster is a near-optimal solution, achieving compression performance almost identical to the DP method. These results indicate that variations of the structure that maintain the temporal order of the rasters are unlikely to significantly improve compression. Consequently, we explore an alternative approach based on clustering, where rasters are grouped according to their similarity, regardless of their temporal order. Our experimental evaluation reveals that this clustering-based strategy can enhance compression in scenarios characterized by cyclic behaviour.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Time Series Clustering Based on Dynamic Time Warping
    Wang, Weizeng
    Lyu, Gaofan
    Shi, Yuliang
    Liang, Xun
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 487 - 490
  • [42] Time series clustering based on forecast densities
    Alonso, A. M.
    Berrendero, J. R.
    Hernandez, A.
    Justel, A.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (02) : 762 - 776
  • [43] MDL-based time series clustering
    Thanawin Rakthanmanon
    Eamonn J. Keogh
    Stefano Lonardi
    Scott Evans
    Knowledge and Information Systems, 2012, 33 : 371 - 399
  • [44] A clustering-based method for fuzzy modeling
    Wong, CC
    Chen, CC
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1999, E82D (06) : 1058 - 1065
  • [45] Clustering-Based Incremental Web Crawling
    Tan, Qingzhao
    Mitra, Prasenjit
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2010, 28 (04)
  • [46] Progressive Exponential Clustering-Based Steganography
    Chang-Tsun Li
    Yue Li
    EURASIP Journal on Advances in Signal Processing, 2010
  • [47] Random clustering-based outlier detector
    Kiersztyn A.
    Pylak D.
    Horodelski M.
    Kiersztyn K.
    Urbanovich P.
    Information Sciences, 2024, 667
  • [48] MDL-based time series clustering
    Rakthanmanon, Thanawin
    Keogh, Eamonn J.
    Lonardi, Stefano
    Evans, Scott
    KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 33 (02) : 371 - 399
  • [49] Clustering time series based on dependence structure
    Zhang, Beibei
    An, Baiguo
    PLOS ONE, 2018, 13 (11):
  • [50] Time Series Forecasting Based on Weighted Clustering
    Lee, Chie-Hong
    Su, Yann-Yean
    Lin, Yu-Chun
    Lee, Shie-Jue
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA), 2017, : 421 - 425