DNA Compression using Referential Compression Algorithm

被引:0
|
作者
Mehta, Kanika [1 ]
Ghrera, Satya Prakash [1 ]
机构
[1] Jaypee Univ Informat Technol, Dept Comp Sci & Engn, Solan 173234, Himachal Prades, India
关键词
Referential Compression; sequences; suffix array; fingerprints;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With rapid technological development and growth of sequencing data, an umpteen gamut of biological data has been generated. As an alternative, Data Compression is employed to reduce the size of data. In this direction, this paper proposes a new reference-based compression approach, which is employed as a solution. Firstly, a reference has been constructed from the common sub strings of randomly selected input sequences. Reference set is a pair of key and value, where key is a fingerprint (or a unique id) and value is a sequence of characters. Next, these given sequences are compressed using referential compression algorithm. This is attained by matching the input with the reference and hence, replacing the match found in input by its fingerprints contained in the reference, thereby achieving better compression. The experimental results of this paper show that the approach proposed herein, outperforms the existing approaches and methodologies applied so far.
引用
收藏
页码:64 / 69
页数:6
相关论文
共 50 条
  • [41] DNA Sequence Compression Using Adaptive Particle Swarm Optimization-Based Memetic Algorithm
    Zhu, Zexuan
    Zhou, Jiarui
    Ji, Zhen
    Shi, Yu-Hui
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2011, 15 (05) : 643 - 658
  • [42] A new signal compression algorithm using AllPass extraction and its use in image compression and coding
    Fahmy, M. F.
    Fahmy, G.
    2006 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2006, : 91 - +
  • [43] A compression method for DNA
    Du, Shengwang
    Li, Junyi
    Bian, Naizheng
    PLOS ONE, 2020, 15 (11):
  • [44] A new efficient referential genome compression technique for FastQ files
    Sanjeev Kumar
    Mukund Pratap Singh
    Soumya Ranjan Nayak
    Asif Uddin Khan
    Anuj Kumar Jain
    Prabhishek Singh
    Manoj Diwakar
    Thota Soujanya
    Functional & Integrative Genomics, 2023, 23
  • [45] DNA sequence compression
    Korodi, Gergely
    Tabus, Ioan
    Rissanen, Jorma
    Astola, Jaakko
    IEEE SIGNAL PROCESSING MAGAZINE, 2007, 24 (01) : 47 - 53
  • [46] A new efficient referential genome compression technique for FastQ files
    Kumar, Sanjeev
    Singh, Mukund Pratap
    Nayak, Soumya Ranjan
    Khan, Asif Uddin
    Jain, Anuj Kumar
    Singh, Prabhishek
    Diwakar, Manoj
    Soujanya, Thota
    FUNCTIONAL & INTEGRATIVE GENOMICS, 2023, 23 (04)
  • [47] Vector quantization using the firefly algorithm for image compression
    Horng, Ming-Huwi
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) : 1078 - 1091
  • [48] Adjustable Model Compression Using Multiple Genetic Algorithm
    Ople, Jose Jaena Mari
    Huang, Tai-Ming
    Chiu, Ming-Chih
    Chen, Yi-Ling
    Hua, Kai-Lung
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1125 - 1132
  • [49] On system behaviour using complex networks of a compression algorithm
    Walker, David M.
    Correa, Debora C.
    Small, Michael
    CHAOS, 2018, 28 (01)
  • [50] Pyramidal Image Compression Using the Parameterized NEDI Algorithm
    M. V. Gashnikov
    Optical Memory and Neural Networks, 2021, 30 : 187 - 193