An efficient parallelization of longest prefix match and application on data compression

被引:3
|
作者
Ozsoy, Adnan [1 ,2 ]
机构
[1] Hacettepe Univ, Dept Comp Engn, Ankara, Turkey
[2] Indiana Univ Bloomington, Sch Informat & Comp, Bloomington, IN USA
关键词
Bit vector; lossless data compression; LZSS; longest prefix match; GPU; CUDA; ALGORITHM;
D O I
10.1177/1094342015621367
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we describe a new approach to parallelize longest prefix match (LPM) algorithm through bit parallelism, also known as bit-vector approach. This approach makes use of bit-wise computations and leverages bit parallelism. The proposed parallel algorithm will be demonstrated in dictionary-based lossless data compression on general-purpose graphics processing units (GPGPUs). One of the main contributions of this work is redesigning the core part of the data compression algorithm and replacing it with the newly proposed bit-vector LPM solution. Using bit parallelism is a fundamentally new approach for data compression and promising in performance for hybrid CPU-GPU environments. The implementation of the new compression algorithm on GPUs improves the performance of the compression process compared to the previous attempts. Moreover, the bit-vector approach opens new opportunities for improvement and increases the applicability to popular heterogeneous environments.
引用
收藏
页码:276 / 289
页数:14
相关论文
共 50 条
  • [31] A New Progressive Algorithm for a Multiple Longest Common Subsequences Problem and Its Efficient Parallelization
    Yang, Jiaoyun
    Xu, Yun
    Sun, Guangzhong
    Shang, Yi
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2013, 24 (05) : 862 - 870
  • [32] Efficient subsequence matching using the Longest Common Subsequence with a Dual Match index
    Han, Tae Sik
    Ko, Seung-Kyu
    Kang, Jaewoo
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS, 2007, 4571 : 585 - +
  • [33] Match-length functions for data compression
    Gavish, A
    Lempel, A
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1996, 42 (05) : 1375 - 1380
  • [34] High-performance longest prefix matching supporting high-speed incremental updates and guaranteed compression
    Sundström, M
    Larzon, LA
    IEEE INFOCOM 2005: THE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-4, PROCEEDINGS, 2005, : 1641 - 1652
  • [35] EFFICIENT APPLICATION OF CODING TECHNIQUE FOR DATA-COMPRESSION OF ECG
    TSUDA, S
    SHIMIZU, K
    MATSUMOTO, G
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1993, E76D (12) : 1425 - 1433
  • [36] Efficient Longest Streak Discovery in Multidimensional Sequence Data
    Wang, Wentao
    Tang, Bo
    Zhu, Min
    WEB AND BIG DATA (APWEB-WAIM 2018), PT II, 2018, 10988 : 166 - 181
  • [37] Efficient Parallelization for Big Data Collaborative Recommendation Decisions
    Aboagye, E. O.
    Gao Jianbin
    Emmanuel, Ampoma Affum
    2018 IEEE 8TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2018, : 268 - 274
  • [38] Multi-value match length functions for data compression
    Khosravifard, SM
    Nasiri-Kenari, M
    2000 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, PROCEEDINGS, 2000, : 345 - 345
  • [39] Efficient compression of scientific floating-point data and an application in structural analysis
    Liu L.
    Ogino M.
    Hagita K.
    1600, Japan Society for Computational Engineering and Science (2017):
  • [40] An efficient image data format for lossless compression and its application to interactive viewing
    Kim, YS
    Kim, WY
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL I, 1996, : 73 - 76