DZip: improved general-purpose lossless compression based on novel neural network modeling

被引:19
|
作者
Goyal, Mohit [1 ]
Tatwawadi, Kedar [2 ]
Chandak, Shubham [2 ]
Ochoa, Idoia [1 ,3 ]
机构
[1] Univ Illinois, Elect & Comp Engn, Urbana, IL 61801 USA
[2] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
[3] Univ Navarra, Dept Elect Engn, Pamplona, Spain
关键词
D O I
10.1109/DCC50243.2021.00023
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We consider lossless compression based on statistical data modeling followed by prediction-based encoding, where an accurate statistical model for the input data leads to substantial improvements in compression. We propose DZip, a general-purpose compressor for sequential data that exploits the well-known modeling capabilities of neural networks (NNs) for prediction, followed by arithmetic coding. DZip uses a novel hybrid architecture based on adaptive and semi-adaptive training. Unlike most NN-based compressors, DZip does not require additional training data and is not restricted to specific data types. The proposed compressor outperforms general-purpose compressors such as Gzip (29% size reduction on average) and 7zip (12% size reduction on average) on a variety of real datasets, achieves near-optimal compression on synthetic datasets, and performs close to specialized compressors for large sequence lengths, without any human input. While the main limitation of NN-based compressors is generally the encoding/decoding speed, we empirically demonstrate that DZip achieves comparable compression ratio to other NN-based compressors while being several times faster. The source code for DZip and links to the datasets are available at https : //github . com/mohit1997/Dzip-torch.
引用
收藏
页码:153 / 162
页数:10
相关论文
共 50 条
  • [21] Neural Acceleration for General-Purpose Approximate Programs
    Esmaeilzadeh, Hadi
    Sampson, Adrian
    Ceze, Luis
    Burger, Doug
    2012 IEEE/ACM 45TH INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-45), 2012, : 449 - 460
  • [22] DESIGN OF A GENERAL-PURPOSE NEURAL SIGNAL PROCESSOR
    BEICHTER, J
    BRULS, N
    SICHENEDER, E
    RAMACHER, U
    KLAR, H
    NEUROCOMPUTING, 1993, 5 (01) : 17 - 23
  • [23] A GENERAL-PURPOSE SIMULATION ENVIRONMENT FOR NEURAL MODELS
    MESROBIAN, E
    SKRZYPEK, J
    SIMULATION, 1992, 59 (05) : 286 - 299
  • [24] Automatic general-purpose neural hardware generator
    Fábio D. Baptista
    Fernando Morgado-Dias
    Neural Computing and Applications, 2017, 28 : 25 - 36
  • [25] AN ARRAY PROCESSOR FOR GENERAL-PURPOSE DIGITAL IMAGE COMPRESSION
    YATES, RB
    THACKER, NA
    EVANS, SJ
    WALKER, SN
    IVEY, PA
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1995, 30 (03) : 244 - 250
  • [26] Toward a general-purpose analog VLSI neural network with on-chip learning
    Montalvo, AJ
    Gyurcsik, RS
    Paulos, JJ
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (02): : 413 - 423
  • [27] Using general-purpose compression algorithms for music analysis
    Louboutin, Corentin
    Meredith, David
    JOURNAL OF NEW MUSIC RESEARCH, 2016, 45 (01) : 1 - 16
  • [28] On the basis of brain: neural-network-inspired changes in general-purpose chips
    Prytkova, Ekaterina
    Vannuccini, Simone
    INDUSTRIAL AND CORPORATE CHANGE, 2022, 31 (04) : 1031 - 1055
  • [29] General-Purpose User Modeling with Behavioral Logs
    Fang, Qixiang
    Zhou, Zhihan
    Barbieri, Francesco
    Liu, Yozen
    Neves, Leonardo
    Nguyen, Dong
    Oberski, Daniel
    Bos, Maarten
    Dotsch, Ron
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2431 - 2436
  • [30] A GENERAL-PURPOSE COMMUNICATIONS SHELL FOR A NETWORK OF TRANSPUTERS
    WARING, LC
    MICROPROCESSING AND MICROPROGRAMMING, 1990, 29 (02): : 107 - 119