An accurate DNA sequence assembly algorithm based on MapReduce

被引:1
|
作者
Dong, Gaifang [1 ]
Fu, Xueliang [1 ]
Li, Honghui [1 ]
机构
[1] Inner Mongolia Agr Univ, Coll Comp & Informat Engn, Hohhot 010018, Inner Mongolia, Peoples R China
关键词
Sequence assembly; k-mer; compatibility rule; MapReduce;
D O I
10.3233/JCM-160635
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
DNA sequence assembly and gene homology comparison are the most basic and most frequently used methods in bioinformatics, which can be used to estimate the genetic integrity, gene structure and function. These years, a lot of scholars have done a lot of work in the direction of sequence assembly algorithm, and put forward some effective methods. However, accurate and efficient assembly algorithm is very rare. In order to achieve accuracy and efficiency, this paper proposes an accurate and efficient algorithm based on MapReduce. The algorithm makes full use of the advantages of the map and reduce functions, and the experimental results show that it can quickly and accurately assemble DNA fragments no matter small scale sequences or middle-scale genome sequences.
引用
收藏
页码:519 / 526
页数:8
相关论文
共 50 条
  • [1] An accurate algorithm for multiple sequence alignment in MapReduce
    Dong, Gaifang
    Fu, Xueliang
    Li, Honghui
    Li, Jianrong
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2018, 18 (01) : 283 - 295
  • [2] An efficient algorithm for DNA fragment assembly in MapReduce
    Xu, Baomin
    Gao, Jin
    Li, Chunyan
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2012, 426 (03) : 395 - 398
  • [3] An Accurate Sequence Assembly Algorithm for Livestock, Plants and Microorganism Based on Spark
    Dong, Gaifang
    Fu, Xueliang
    Li, Honghui
    Pan, Xu
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (08)
  • [4] Research on Parallel Hamilton-Path DNA Sequence Splicing Algorithm Based on MapReduce
    Li, Ming
    Wang, Ning
    Shi, Xiaodong
    Guo, Zhen
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS, ELECTRONICS AND CONTROL (ICCSEC), 2017, : 1522 - 1527
  • [5] An assembly algorithm for DNA sequence with repeats
    Sheng, QH
    Ding, DF
    ACTA BIOCHIMICA ET BIOPHYSICA SINICA, 1998, 30 (01): : 53 - 58
  • [6] Algorithm for DNA sequence assembly by quantum annealing
    Nalecz-Charkiewicz, Katarzyna
    Nowak, Robert M.
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [7] Algorithm for DNA sequence assembly by quantum annealing
    Katarzyna Nałęcz-Charkiewicz
    Robert M. Nowak
    BMC Bioinformatics, 23
  • [8] MapReduce paradigm: DNA sequence clustering based on repeats as features
    Dasari, Chandra Mohan
    Bhukya, Raju
    EXPERT SYSTEMS, 2022, 39 (01)
  • [9] Quantum algorithm for de novo DNA sequence assembly based on quantum walks on graphs
    Varsamis, G. D.
    Karafyllidis, I. G.
    Gilkes, K. M.
    Arranz, U.
    Martin-Cuevas, R.
    Calleja, G.
    Wong, J.
    Jessen, H. C.
    Dimitrakis, P.
    Kolovos, P.
    Sandaltzopoulos, R.
    BIOSYSTEMS, 2023, 233
  • [10] HapCompass: A Fast Cycle Basis Algorithm for Accurate Haplotype Assembly of Sequence Data
    Aguiar, Derek
    Istrail, Sorin
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (06) : 577 - 590