A pilot study for channel catfish whole genome sequencing and de novo assembly

被引:20
|
作者
Jiang, Yanliang [1 ]
Lu, Jianguo [1 ]
Peatman, Eric [1 ]
Kucuktas, Huseyin [1 ]
Liu, Shikai [1 ]
Wang, Shaolin [1 ,2 ]
Sun, Fanyue [1 ]
Liu, Zhanjiang [1 ]
机构
[1] Auburn Univ, Fish Mol Genet & Biotechnol Lab, Dept Fisheries & Allied Aquacultures, Program Cell & Mol Biosci,Aquat Genom Unit, Auburn, AL 36849 USA
[2] Univ Virginia, Dept Psychiat & Neurobiol Sci, Charlottesville, VA 22911 USA
来源
BMC GENOMICS | 2011年 / 12卷
关键词
GENETIC-LINKAGE MAP; BAC-END SEQUENCES; ICTALURUS-PUNCTATUS; MARKER DEVELOPMENT; PHYSICAL MAP; GENERATION; CONSTRUCTION; LIBRARY; DNA;
D O I
10.1186/1471-2164-12-629
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Recent advances in next-generation sequencing technologies have drastically increased throughput and significantly reduced sequencing costs. However, the average read lengths in next-generation sequencing technologies are short as compared with that of traditional Sanger sequencing. The short sequence reads pose great challenges for de novo sequence assembly. As a pilot project for whole genome sequencing of the catfish genome, here we attempt to determine the proper sequence coverage, the proper software for assembly, and various parameters used for the assembly of a BAC physical map contig spanning approximately a million of base pairs. Results: A combination of low sequence coverage of 454 and Illumina sequencing appeared to provide effective assembly as reflected by a high N50 value. Using 454 sequencing alone, a sequencing depth of 18 X was sufficient to obtain the good quality assembly, whereas a 70 X Illumina appeared to be sufficient for a good quality assembly. Additional sequencing coverage after 18 X of 454 or after 70 X of Illumina sequencing does not provide significant improvement of the assembly. Considering the cost of sequencing, a 2 X 454 sequencing, when coupled to 70 X Illumina sequencing, provided an assembly of reasonably good quality. With several software tested, Newbler with a seed length of 16 and ABySS with a K-value of 60 appear to be appropriate for the assembly of 454 reads alone and Illumina paired-end reads alone, respectively. Using both 454 and Illumina pairedend reads, a hybrid assembly strategy using Newbler for initial 454 sequence assembly, Velvet for initial Illumina sequence assembly, followed by a second step assembly using MIRA provided the best assembly of the physical map contig, resulting in 193 contigs with a N50 value of 13,123 bp. Conclusions: A hybrid sequencing strategy using low sequencing depth of 454 and high sequencing depth of Illumina provided the good quality assembly with high N50 value and relatively low cost. A combination of Newbler, Velvet, and MIRA can be used to assemble the 454 sequence reads and the Illumina reads effectively. The assembled sequence can serve as a resource for comparative genome analysis. Additional long reads using the third generation sequencing platforms are needed to sequence through repetitive genome regions that should further enhance the sequence assembly.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Whole Genome Amplification and De novo Assembly of Single Bacterial Cells
    Rodrigue, Sebastien
    Malmstrom, Rex R.
    Berlin, Aaron M.
    Birren, Bruce W.
    Henn, Matthew R.
    Chisholm, Sallie W.
    PLOS ONE, 2009, 4 (09):
  • [32] Whole Genome Analyses of Chinese Population and De Novo Assembly of A Northern Han Genome
    Zhenglin Du
    Liang Ma
    Hongzhu Qu
    Wei Chen
    Bing Zhang
    Xi Lu
    Weibo Zhai
    Xin Sheng
    Yongqiao Sun
    Wenjie Li
    Meng Lei
    Qiuhui Qi
    Na Yuan
    Shuo Shi
    Jingyao Zeng
    Jinyue Wang
    Yadong Yang
    Qi Liu
    Yaqiang Hong
    Lili Dong
    Zhewen Zhang
    Dong Zou
    Yanqing Wang
    Shuhui Song
    Fan Liu
    Xiangdong Fang
    Hua Chen
    Xin Liu
    Jingfa Xiao
    Changqing Zeng
    Genomics,Proteomics & Bioinformatics, 2019, (03) : 229 - 247
  • [33] Whole Genome Analyses of Chinese Population and De Novo Assembly of A Northern Han Genome
    Du, Zhenglin
    Ma, Liang
    Qu, Hongzhu
    Chen, Wei
    Zhang, Bing
    Lu, Xi
    Zhai, Weibo
    Sheng, Xin
    Sun, Yongqiao
    Li, Wenjie
    Lei, Meng
    Qi, Qiuhui
    Yuan, Na
    Shi, Shuo
    Zeng, Jingyao
    Wang, Jinyue
    Yang, Yadong
    Liu, Qi
    Hong, Yaqiang
    Dong, Lili
    Zhang, Zhewen
    Zou, Dong
    Wang, Yanqing
    Song, Shuhui
    Liu, Fan
    Fang, Xiangdong
    Chen, Hua
    Liu, Xin
    Xiao, Jingfa
    Zeng, Changqing
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2019, 17 (03) : 229 - 247
  • [34] Whole Genome Analyses of Chinese Population and De Novo Assembly of A Northern Han Genome
    Zhenglin Du
    Liang Ma
    Hongzhu Qu
    Wei Chen
    Bing Zhang
    Xi Lu
    Weibo Zhai
    Xin Sheng
    Yongqiao Sun
    Wenjie Li
    Meng Lei
    Qiuhui Qi
    Na Yuan
    Shuo Shi
    Jingyao Zeng
    Jinyue Wang
    Yadong Yang
    Qi Liu
    Yaqiang Hong
    Lili Dong
    Zhewen Zhang
    Dong Zou
    Yanqing Wang
    Shuhui Song
    Fan Liu
    Xiangdong Fang
    Hua Chen
    Xin Liu
    Jingfa Xiao
    Changqing Zeng
    Genomics,Proteomics & Bioinformatics, 2019, 17 (03) : 229 - 247
  • [35] Whole-Genome Sequencing and De Novo Assembly of Malassezia pachydermatis Isolated from the Ear Canal of a Dog with Otitis
    D'Andreano, S.
    Vines, J.
    Francino, O.
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2021, 10 (21):
  • [36] Whole-Genome Sequencing and Annotation of Aeromonas veronii Isolates from Channel Catfish
    Abernathy, Jason W. W.
    Zhang, Dunhua
    Liles, Mark R. R.
    Lange, Miles D. D.
    Shoemaker, Craig A. A.
    Beck, Benjamin H. H.
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2023, 12 (02):
  • [37] USING WHOLE GENOME SEQUENCING TO IDENTIFY DE NOVO VARIATION IN BIPOLAR DISORDER
    Goes, Fernando
    Pirooznia, Mehdi
    Zandi, Peter
    Tehan, Martin
    Pulver, Ann
    EUROPEAN NEUROPSYCHOPHARMACOLOGY, 2019, 29 : S827 - S828
  • [38] De novo assembly of the cattle reference genome with single-molecule sequencing
    Rosen, Benjamin D.
    Bickhart, Derek M.
    Schnabel, Robert D.
    Koren, Sergey
    Elsik, Christine G.
    Tseng, Elizabeth
    Rowan, Troy N.
    Low, Wai Y.
    Zimin, Aleksey
    Couldrey, Christine
    Hall, Richard
    Li, Wenli
    Rhie, Arang
    Ghurye, Jay
    McKay, Stephanie D.
    Thibaud-Nissen, Francoise
    Hoffman, Jinna
    Murdoch, Brenda M.
    Snelling, Warren M.
    McDaneld, Tara G.
    Hammond, John A.
    Schwartz, John C.
    Nandolo, Wilson
    Hagen, Darren E.
    Dreischer, Christian
    Schultheiss, Sebastian J.
    Schroeder, Steven G.
    Phillippy, Adam M.
    Cole, John B.
    Van Tassell, Curtis P.
    Liu, George
    Smith, Timothy P. L.
    Medrano, Juan F.
    GIGASCIENCE, 2020, 9 (03):
  • [39] Long-read sequencing and de novo assembly of the cynomolgus macaque genome
    Bai, Bing
    Wang, Yi
    Zhu, Ran
    Zhang, Yaolei
    Wang, Hong
    Fan, Guangyi
    Liu, Xin
    Shi, Hong
    Niu, Yuyu
    Ji, Weizhi
    JOURNAL OF GENETICS AND GENOMICS, 2022, 49 (10) : 975 - 978
  • [40] Long-read sequencing and de novo assembly of the cynomolgus macaque genome
    Bing Bai
    Yi Wang
    Ran Zhu
    Yaolei Zhang
    Hong Wang
    Guangyi Fan
    Xin Liu
    Hong Shi
    Yuyu Niu
    Weizhi Ji
    JournalofGeneticsandGenomics, 2022, 49 (10) : 975 - 978