DNA Sequence Compression Using Adaptive Particle Swarm Optimization-Based Memetic Algorithm

被引:87
|
作者
Zhu, Zexuan [1 ]
Zhou, Jiarui [2 ]
Ji, Zhen [1 ]
Shi, Yu-Hui [3 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen City Key Lab Embedded Syst Design, Shenzhen 518060, Peoples R China
[2] Zhejiang Univ, Coll Biomed Engn & Instrument Sci, Hangzhou 310027, Zhejiang, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Dept Elect & Elect Engn, Suzhou 215123, Peoples R China
基金
中国国家自然科学基金;
关键词
Approximate repeat vector; DNA sequence compression; memetic algorithm; particle swarm optimization; SELECTION;
D O I
10.1109/TEVC.2011.2160399
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of high-throughput DNA sequencing technologies, the amount of DNA sequence data is accumulating exponentially. The huge influx of data creates new challenges for storage and transmission. This paper proposes a novel adaptive particle swarm optimization-based memetic algorithm (POMA) for DNA sequence compression. POMA is a synergy of comprehensive learning particle swarm optimization (CLPSO) and an adaptive intelligent single particle optimizer (AdpISPO)-based local search. It takes advantage of both CLPSO and AdpISPO to optimize the design of approximate repeat vector (ARV) codebook for DNA sequence compression. ARV is first introduced in this paper to represent the repeated fragments across multiple sequences in direct, mirror, pairing, and inverted patterns. In POMA, candidate ARV codebooks are encoded as particles and the optimal solution, which covers the most approximate repeated fragments with the fewest base variations, is identified through the exploration and exploitation of POMA. In each iteration of POMA, the leader particles in the swarm are selected based on weighted fitness values and each leader particle is fine-tuned with an AdpISPO-based local search, so that the convergence of the search in local region is accelerated. A detailed comparison study between POMA and the counterpart algorithms is performed on 29 (23 basic and 6 composite) benchmark functions and 11 real DNA sequences. POMA is observed to obtain better or competitive performance with a limited number of function evaluations. POMA also attains lower bits-per-base than other state-of-the-art DNA-specific algorithms on DNA sequence data. The experimental results suggest that the cooperation of CLPSO and AdpISPO in the framework of memetic algorithm is capable of searching the ARV codebook space efficiently.
引用
收藏
页码:643 / 658
页数:16
相关论文
共 50 条
  • [1] A hybrid particle swarm optimization based memetic algorithm for DNA sequence compression
    Li Tan
    Jifeng Sun
    Xueke Tong
    [J]. Soft Computing, 2015, 19 : 1255 - 1268
  • [2] A hybrid particle swarm optimization based memetic algorithm for DNA sequence compression
    Tan, Li
    Sun, Jifeng
    Tong, Xueke
    [J]. SOFT COMPUTING, 2015, 19 (05) : 1255 - 1268
  • [3] A multiobjective memetic algorithm based on particle swarm optimization
    Liu, Dasheng
    Tan, K. C.
    Goh, C. K.
    Ho, W. K.
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (01): : 42 - 50
  • [4] A new memetic algorithm using particle swarm optimization and genetic algorithm
    Soak, Sang-Moon
    Lee, Sang-Wook
    Mahalik, N. P.
    Ahn, Byung-Ha
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2006, 4251 : 122 - 129
  • [5] A particle swarm optimization based memetic algorithm for dynamic optimization problems
    Wang, Hongfeng
    Yang, Shengxiang
    Ip, W. H.
    Wang, Dingwei
    [J]. NATURAL COMPUTING, 2010, 9 (03) : 703 - 725
  • [6] A particle swarm optimization based memetic algorithm for dynamic optimization problems
    Hongfeng Wang
    Shengxiang Yang
    W. H. Ip
    Dingwei Wang
    [J]. Natural Computing, 2010, 9 : 703 - 725
  • [7] Intelligent DNA sequence data compression using memetic algorithm
    College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, Zhejiang 310027, China
    不详
    不详
    [J]. Tien Tzu Hsueh Pao, 2013, 3 (513-518):
  • [8] DNA sequence data compression method based on Memetic Algorithm
    Tan, Li
    Sun, Ji-Feng
    Guo, Li-Hua
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2014, 36 (01): : 121 - 127
  • [9] A memetic particle swarm optimization algorithm for solving the DNA fragment assembly problem
    Huang, Ko-Wei
    Chen, Jui-Le
    Yang, Chu-Sing
    Tsai, Chun-Wei
    [J]. NEURAL COMPUTING & APPLICATIONS, 2015, 26 (03): : 495 - 506
  • [10] A memetic particle swarm optimization algorithm for solving the DNA fragment assembly problem
    Ko-Wei Huang
    Jui-Le Chen
    Chu-Sing Yang
    Chun-Wei Tsai
    [J]. Neural Computing and Applications, 2015, 26 : 495 - 506