A De Novo Genome Assembly Algorithm for Repeats and Nonrepeats

被引:5
|
作者
Lian, Shuaibin [1 ]
Li, Qingyan [1 ]
Dai, Zhiming [1 ,2 ]
Xiang, Qian [1 ]
Dai, Xianhua [1 ]
机构
[1] Sun Yat Sen Univ, Sch Informat Sci & Technol, Guangzhou 510006, Guangdong, Peoples R China
[2] SYSU CMU Shunde Int Joint Res Inst, Shunde 528300, Peoples R China
关键词
SEQUENCING TECHNOLOGIES; STRUCTURAL VARIATION; AMPLIFICATION; DNA; IDENTIFICATION;
D O I
10.1155/2014/736473
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background. Next generation sequencing platforms can generate shorter reads, deeper coverage, and higher throughput than those of the Sanger sequencing. These short reads may be assembled de novo before some specific genome analyses. Up to now, the performances of assembling repeats of these current assemblers are very poor. Results. To improve this problem, we proposed a new genome assembly algorithm, named SWA, which has four properties: (1) assembling repeats and nonrepeats; (2) adopting a new overlapping extension strategy to extend each seed; (3) adopting sliding window to filter out the sequencing bias; and (4) proposing a compensational mechanism for low coverage datasets. SWA was evaluated and validated in both simulations and real sequencing datasets. The accuracy of assembling repeats and estimating the copy numbers is up to 99% and 100%, respectively. Finally, the extensive comparisons with other eight leading assemblers show that SWA outperformed others in terms of completeness and correctness of assembling repeats and nonrepeats. Conclusions. This paper proposed a new de novo genome assembly method for resolving complex repeats. SWA not only can detect where repeats or nonrepeats are but also can assemble them completely from NGS data, especially for assembling repeats. This is the advantage over other assemblers.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] De novo assembly and annotation of the singing mouse genome
    Samantha K. Smith
    Paul W. Frazel
    Alireza Khodadadi-Jamayran
    Paul Zappile
    Christian Marier
    Mariam Okhovat
    Stuart Brown
    Michael A. Long
    Adriana Heguy
    Steven M. Phelps
    BMC Genomics, 24
  • [32] Pushing the limits of de novo genome assembly for complex prokaryotic genomes harboring very long, near identical repeats
    Schmid, Michael
    Frei, Daniel
    Patrignani, Andrea
    Schlapbach, Ralph
    Frey, Jurg E.
    Remus-Emsermann, Mitja N. P.
    Ahrens, Christian H.
    NUCLEIC ACIDS RESEARCH, 2018, 46 (17) : 8953 - 8965
  • [33] De Novo Assembly Discovered Novel Structures in Genome of Plastids and Revealed Divergent Inverted Repeats in Mammillaria (Cactaceae, Caryophyllales)
    Solorzano, Sofia
    Chincoya, Delil A.
    Sanchez-Flores, Alejandro
    Estrada, Karel
    Diaz-Velasquez, Clara E.
    Gonzalez-Rodriguez, Antonio
    Vaca-Paniagua, Felipe
    Davila, Patricia
    Arias, Salvador
    PLANTS-BASEL, 2019, 8 (10):
  • [34] An Iterative Algorithm for de novo Optical Map Assembly
    Li, Menglu
    Yiu, Siu-Ming
    Chan, Ting-Fung
    Lam, Ernest T.
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 1952 - 1958
  • [35] De novo genome assembly of tower mustard, Turritis glabra
    Yoshida, Takanori
    Kawabe, Akira
    GENES & GENETIC SYSTEMS, 2016, 91 (06) : 378 - 378
  • [36] A De Novo Whole Genome Assembly and Annotation of Parelaphostrongylus tenuis
    Garwood, Tyler J.
    Richards, Jessie E.
    Macchietto, Marissa G.
    Gerhold, Richard W.
    Kania, Stephen A.
    Garbe, John R.
    Fountain-Jones, Nicholas M.
    Larsen, Peter A.
    Wolf, Tiffany M.
    JOURNAL OF NEMATOLOGY, 2024, 56 (01)
  • [37] De novo assembly of a haplotype-resolved human genome
    Cao, Hongzhi
    Wu, Honglong
    Luo, Ruibang
    Huang, Shujia
    Sun, Yuhui
    Tong, Xin
    Xie, Yinlong
    Liu, Binghang
    Yang, Hailong
    Zheng, Hancheng
    Li, Jian
    Li, Bo
    Wang, Yu
    Yang, Fang
    Sun, Peng
    Liu, Siyang
    Gao, Peng
    Huang, Haodong
    Sun, Jing
    Chen, Dan
    He, Guangzhu
    Huang, Weihua
    Huang, Zheng
    Li, Yue
    Tellier, Laurent C. A. M.
    Liu, Xiao
    Feng, Qiang
    Xu, Xun
    Zhang, Xiuqing
    Bolund, Lars
    Krogh, Anders
    Kristiansen, Karsten
    Drmanac, Radoje
    Drmanac, Snezana
    Nielsen, Rasmus
    Li, Songgang
    Wang, Jian
    Yang, Huanming
    Li, Yingrui
    Wong, Gane Ka-Shu
    Wang, Jun
    NATURE BIOTECHNOLOGY, 2015, 33 (06) : 617 - +
  • [38] FAssem : FPGA based Acceleration of De Novo Genome Assembly
    Varma, B. Sharat Chandra
    Paul, Kolin
    Balakrishnan, M.
    Lavenier, Dominique
    2013 IEEE 21ST ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2013, : 173 - 176
  • [39] De novo assembly of a wild pear (Pyrus betuleafolia) genome
    Dong, Xingguang
    Wang, Zheng
    Tian, Luming
    Zhang, Ying
    Qi, Dan
    Huo, Hongliang
    Xu, Jiayu
    Li, Zhe
    Liao, Rui
    Shi, Miao
    Wahocho, Safdar Ali
    Liu, Chao
    Zhang, Simeng
    Tian, Zhixi
    Cao, Yufen
    PLANT BIOTECHNOLOGY JOURNAL, 2020, 18 (02) : 581 - 595
  • [40] De novo genome assembly and functional annotation for Fusarium langsethiae
    Zuo, Ya
    Verheecke-Vaessen, Carol
    Molitor, Corentin
    Medina, Angel
    Magan, Naresh
    Mohareb, Fady
    BMC GENOMICS, 2022, 23 (01)