A Randomly Accessible Lossless Compression Scheme for Time-Series Data

被引:0
|
作者
Vestergaard, Rasmus [1 ]
Lucani, Daniel E.
Zhang, Qi
机构
[1] Aarhus Univ, DIGIT, Aarhus, Denmark
关键词
D O I
10.1109/infocom41043.2020.9155450
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We detail a practical compression scheme for lossless compression of time-series data, based on the emerging concept of generalized deduplication. As data is no longer stored for just archival purposes, but needs to be continuously accessed in many applications, the scheme is designed for low-cost random access to its compressed data, avoiding decompression. With this method, an arbitrary bit of the original data can be read by accessing only a few hundred bits in the worst case, several orders of magnitude fewer than state-of-the-art compression schemes. Subsequent retrieval of bits requires visiting at most a few tens of bits. A comprehensive evaluation of the compressor on eight real-life data sets from various domains is provided. The cost of this random access capability is a loss in compression ratio compared with the state-of-the-art compression schemes BZIP2 and 7z, which can be as low as 5% depending on the data set. Compared to GZIP, the proposed scheme has a better compression ratio for most of the data sets. Our method has massive potential for applications requiring frequent random accesses, as the only existing approach with comparable random access cost is to store the data without compression.
引用
收藏
页码:2145 / 2154
页数:10
相关论文
共 50 条
  • [1] Lossless Data Compression for Time-Series Sensor Data Based on Dynamic Bit Packing
    Hwang, Sang-Ho
    Kim, Kyung-Min
    Kim, Sungho
    Kwak, Jong Wook
    [J]. SENSORS, 2023, 23 (20)
  • [2] Lossless Compression of Time-Series Data Based on Increasing Average of Neighboring Signals
    Takezawa, Tetsuya
    Asakura, Koichi
    Watanabe, Toyohide
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2010, 93 (08) : 47 - 56
  • [3] A lossless compression method of time-series data based on increasing average of neighboring signals
    Takezawa, Tetsuya
    Asakura, Koichi
    Watanabe, Toyohide
    [J]. IEEJ Transactions on Electronics, Information and Systems, 2008, 128 (02) : 318 - 325
  • [4] Time-series analysis if data are randomly missing
    Broersen, PMT
    Bos, R
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2006, 55 (01) : 79 - 84
  • [5] Lossless Compression of Time Series Data with Generalized Deduplication
    Vestergaard, Rasmus
    Zhang, Qi
    Lucani, Daniel E.
    [J]. 2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [6] A framework for predictive compression of time-series data
    Mukherjee, S
    Zobel, J
    [J]. PROCEEDINGS OF THE 21ST AUSTRALASIAN COMPUTER SCIENCE CONFERENCE, ACSC'98, 1998, 20 (01): : 95 - 105
  • [7] A new compression algorithm for spectral and time-series data
    Hawkins, SE
    Darlington, EH
    Cheng, AF
    Hayes, JR
    [J]. ACTA ASTRONAUTICA, 2003, 52 (2-6) : 487 - 492
  • [8] An efficient lossless compression of multichannel time-series signals by MPEG-4 ALS
    Kamamoto, Yutaka
    Harada, Noboru
    Moriya, Takehiro
    Ito, Nobutaka
    Ono, Nobutaka
    Nishimoto, Takuya
    Sagayama, Shigeki
    [J]. ISCE: 2009 IEEE 13TH INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS, VOLS 1 AND 2, 2009, : 901 - +
  • [9] A Lossless Astronomical Data Compression Scheme with FPGA Acceleration
    Zheng, Yu
    Zhu, Yongxin
    Song, Yuefeng
    Nan, Tianhao
    Li, Wanyi
    [J]. 32ND IEEE INTERNATIONAL SYSTEM ON CHIP CONFERENCE (IEEE SOCC 2019), 2019, : 45 - 49
  • [10] IMAGE DATA-COMPRESSION USING AUTOREGRESSIVE TIME-SERIES MODELS
    DELP, EJ
    KASHYAP, RL
    MITCHELL, OR
    [J]. PATTERN RECOGNITION, 1979, 11 (5-6) : 313 - 323