A pilot study for channel catfish whole genome sequencing and de novo assembly

被引:20
|
作者
Jiang, Yanliang [1 ]
Lu, Jianguo [1 ]
Peatman, Eric [1 ]
Kucuktas, Huseyin [1 ]
Liu, Shikai [1 ]
Wang, Shaolin [1 ,2 ]
Sun, Fanyue [1 ]
Liu, Zhanjiang [1 ]
机构
[1] Auburn Univ, Fish Mol Genet & Biotechnol Lab, Dept Fisheries & Allied Aquacultures, Program Cell & Mol Biosci,Aquat Genom Unit, Auburn, AL 36849 USA
[2] Univ Virginia, Dept Psychiat & Neurobiol Sci, Charlottesville, VA 22911 USA
来源
BMC GENOMICS | 2011年 / 12卷
关键词
GENETIC-LINKAGE MAP; BAC-END SEQUENCES; ICTALURUS-PUNCTATUS; MARKER DEVELOPMENT; PHYSICAL MAP; GENERATION; CONSTRUCTION; LIBRARY; DNA;
D O I
10.1186/1471-2164-12-629
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Recent advances in next-generation sequencing technologies have drastically increased throughput and significantly reduced sequencing costs. However, the average read lengths in next-generation sequencing technologies are short as compared with that of traditional Sanger sequencing. The short sequence reads pose great challenges for de novo sequence assembly. As a pilot project for whole genome sequencing of the catfish genome, here we attempt to determine the proper sequence coverage, the proper software for assembly, and various parameters used for the assembly of a BAC physical map contig spanning approximately a million of base pairs. Results: A combination of low sequence coverage of 454 and Illumina sequencing appeared to provide effective assembly as reflected by a high N50 value. Using 454 sequencing alone, a sequencing depth of 18 X was sufficient to obtain the good quality assembly, whereas a 70 X Illumina appeared to be sufficient for a good quality assembly. Additional sequencing coverage after 18 X of 454 or after 70 X of Illumina sequencing does not provide significant improvement of the assembly. Considering the cost of sequencing, a 2 X 454 sequencing, when coupled to 70 X Illumina sequencing, provided an assembly of reasonably good quality. With several software tested, Newbler with a seed length of 16 and ABySS with a K-value of 60 appear to be appropriate for the assembly of 454 reads alone and Illumina paired-end reads alone, respectively. Using both 454 and Illumina pairedend reads, a hybrid assembly strategy using Newbler for initial 454 sequence assembly, Velvet for initial Illumina sequence assembly, followed by a second step assembly using MIRA provided the best assembly of the physical map contig, resulting in 193 contigs with a N50 value of 13,123 bp. Conclusions: A hybrid sequencing strategy using low sequencing depth of 454 and high sequencing depth of Illumina provided the good quality assembly with high N50 value and relatively low cost. A combination of Newbler, Velvet, and MIRA can be used to assemble the 454 sequence reads and the Illumina reads effectively. The assembled sequence can serve as a resource for comparative genome analysis. Additional long reads using the third generation sequencing platforms are needed to sequence through repetitive genome regions that should further enhance the sequence assembly.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A pilot study for channel catfish whole genome sequencing and de novo assembly
    Yanliang Jiang
    Jianguo Lu
    Eric Peatman
    Huseyin Kucuktas
    Shikai Liu
    Shaolin Wang
    Fanyue Sun
    Zhanjiang Liu
    BMC Genomics, 12
  • [2] Whole genome sequencing and de novo genome assembly of the Kazakh native horse Zhabe
    Assanbayev, Tolegen
    Akilzhanov, Rakhmetolla
    Sharapatov, Tlekbol
    Bektayev, Rakhimbek
    Samatkyzy, Diana
    Karabayev, Daniyar
    Gabdulkayum, Aidana
    Daniyarov, Asset
    Rakhimova, Saule
    Kozhamkulov, Ulan
    Sarbassov, Dos
    Akilzhanova, Ainur
    Kairov, Ulykbek
    FRONTIERS IN GENETICS, 2024, 15
  • [3] First de novo whole genome sequencing and assembly of the pink-footed goose
    Pujolar, J. M.
    Dalen, L.
    Olsen, R. A.
    Hansen, M. M.
    Madsen, J.
    GENOMICS, 2018, 110 (02) : 75 - 79
  • [4] First de novo whole genome sequencing and assembly of the bar-headed goose
    Wang, Wen
    Wang, Fang
    Hao, Rongkai
    Wang, Aizhen
    Sharshov, Kirill
    Druzyaka, Alexey
    Lancuo, Zhuoma
    Shi, Yuetong
    Feng, Shuo
    PEERJ, 2020, 8
  • [5] Whole genome sequencing and de novo assembly of three virulent Indian isolates of Leptospira
    Lata, Kumari Snehkant
    Vaghasia, Vibhisha
    Bhairappanavar, Shivarudrappa B.
    Kumar, Swapnil
    Ayachit, Garima
    Patel, Saumya
    Das, Jayashankar
    INFECTION GENETICS AND EVOLUTION, 2020, 85
  • [6] DE NOVO GENOME ASSEMBLY OF THE AFRICAN CATFISH (Clarias gariepinus)
    Kovacs, B.
    Barta, E.
    Pongor, S. L.
    Uri, C. S.
    Patocs, A.
    Orban, L.
    Muller, T.
    Urbanyi, B.
    AQUACULTURE, 2017, 472 : 105 - 105
  • [7] Dataset for genome sequencing and de novo assembly of the Vietnamese bighead catfish (Clarias macrocephalus Gunther, 1864)
    Duong, Thuy-Yen
    Tan, Mun Hua
    Lee, Yin Peng
    Croft, Larry
    Austin, Christopher M.
    DATA IN BRIEF, 2020, 31
  • [8] Current challenges in de novo plant genome sequencing and assembly
    Michael C Schatz
    Jan Witkowski
    W Richard McCombie
    Genome Biology, 13
  • [9] Current challenges in de novo plant genome sequencing and assembly
    Schatz, Michael C.
    Witkowski, Jan
    McCombie, W. Richard
    GENOME BIOLOGY, 2012, 13 (04):
  • [10] De novo genome assembly for third generation sequencing data
    Forc, Mateusz
    Kusmirek, Wiktor
    Nowak, Robert M.
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2018, 2018, 10808