A Randomly Accessible Lossless Compression Scheme for Time-Series Data

被引:0
|
作者
Vestergaard, Rasmus [1 ]
Lucani, Daniel E.
Zhang, Qi
机构
[1] Aarhus Univ, DIGIT, Aarhus, Denmark
关键词
D O I
10.1109/infocom41043.2020.9155450
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We detail a practical compression scheme for lossless compression of time-series data, based on the emerging concept of generalized deduplication. As data is no longer stored for just archival purposes, but needs to be continuously accessed in many applications, the scheme is designed for low-cost random access to its compressed data, avoiding decompression. With this method, an arbitrary bit of the original data can be read by accessing only a few hundred bits in the worst case, several orders of magnitude fewer than state-of-the-art compression schemes. Subsequent retrieval of bits requires visiting at most a few tens of bits. A comprehensive evaluation of the compressor on eight real-life data sets from various domains is provided. The cost of this random access capability is a loss in compression ratio compared with the state-of-the-art compression schemes BZIP2 and 7z, which can be as low as 5% depending on the data set. Compared to GZIP, the proposed scheme has a better compression ratio for most of the data sets. Our method has massive potential for applications requiring frequent random accesses, as the only existing approach with comparable random access cost is to store the data without compression.
引用
收藏
页码:2145 / 2154
页数:10
相关论文
共 50 条
  • [21] Multiresolution lossless compression scheme
    Piscaglia, P
    Macq, B
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL I, 1996, : 69 - 72
  • [22] Simultaneous compression and opacity data from time-series radiography with a Lagrangian marker
    Swift, Damian C.
    Kritcher, Andrea L.
    Hawreliak, James A.
    Gaffney, James
    Lazicki, Amy
    MacPhee, Andrew
    Bachmann, Benjamin
    Doppner, Tilo
    Nilsen, Joseph
    Whitley, Heather D.
    Collins, Gilbert W.
    Glenzer, Siegfried
    Rothman, Stephen D.
    Kraus, Dominik
    Falcone, Roger W.
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2021, 92 (06):
  • [23] Change a Bit to save Bytes: Compression for Floating Point Time-Series Data
    Taurone, Francesco
    Lucani, Daniel E.
    Feher, Marcell
    Zhang, Qi
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 3756 - 3761
  • [24] Lossless real-time compression of ultrasonic data
    Wunderlich, J
    Strutz, T
    TECHNISCHES MESSEN, 2000, 67 (11): : 479 - 483
  • [25] Fuzzy data mining for time-series data
    Chen, Chun-Hao
    Hong, Tzung-Pei
    Tseng, Vincent S.
    APPLIED SOFT COMPUTING, 2012, 12 (01) : 536 - 542
  • [26] DEVELOPING AND EVALUATING A LOSSLESS COMPRESSION SCHEME FOR SCIENTIFIC DATA FROM A NANOSATELLITE
    Clark, Spencer
    Makaroff, Dwight
    Stanley, Kevin
    2013 26TH ANNUAL IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2013, : 707 - 710
  • [27] An evaluation of combinations of lossy compression and change-detection approaches for time-series data
    Hollmig, Gregor
    Horne, Matthias
    Leimkuehler, Simon
    Schoell, Frederik
    Strunk, Carsten
    Englhardt, Adrian
    Efros, Pavel
    Buchmann, Erik
    Boehm, Klemens
    INFORMATION SYSTEMS, 2017, 65 : 65 - 77
  • [28] A prediction-based neural network scheme for lossless data compression
    Logeswaran, R
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2002, 32 (04): : 358 - 365
  • [29] Lossless Compression Scheme for Efficient GNSS Data Transmission on IoT Devices
    Perez, Rafael
    Leithardt, Valderi R. Q.
    Correia, Sergio D.
    INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 1668 - 1673
  • [30] A Neural Data Lossless Compression Scheme Based on Spatial and Temporal Prediction
    Pagin, Matteo
    Ortmanns, Maurits
    2017 IEEE BIOMEDICAL CIRCUITS AND SYSTEMS CONFERENCE (BIOCAS), 2017,