Construction of a new chromosome-scale, long-read reference genome assembly for the Syrian hamster, Mesocricetus auratus

被引:0
|
作者
Harris, R. Alan [1 ,2 ]
Raveendran, Muthuswamy [1 ,2 ]
Lyfoung, Dustin T. [3 ]
Sedlazeck, Fritz J. [1 ,2 ]
Mahmoud, Medhat [1 ,2 ]
Prall, Trent M. [4 ]
Karl, Julie A. [4 ]
Doddapaneni, Harshavardhan [1 ,2 ]
Meng, Qingchang [1 ,2 ]
Han, Yi [1 ,2 ]
Muzny, Donna [1 ,2 ]
Wiseman, Roger W. [3 ,4 ]
O'Connor, David H. [3 ,4 ]
Rogers, Jeffrey [1 ,2 ]
机构
[1] Baylor Coll Med, Human Genome Sequencing Ctr, Houston, TX 77030 USA
[2] Baylor Coll Med, Dept Mol & Human Genet, Houston, TX 77030 USA
[3] Univ Wisconsin, Wisconsin Natl Primate Res Ctr, 1220 Capitol Court, Madison, WI 53711 USA
[4] Univ Wisconsin, Dept Pathol & Lab Med, 3170 UW Med Fdn Centennial Bldg MFCB, Madison, WI 53711 USA
来源
GIGASCIENCE | 2022年 / 11卷
基金
美国国家卫生研究院;
关键词
Syrian hamster; Mesocricetus auratus; genome; disease model; COVID-19;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The Syrian hamster (Mesocricetus auratus) has been suggested as a useful mammalian model for a variety of diseases and infections, including infection with respiratory viruses such as SARS-CoV-2. The MesAur1.0 genome assembly was generated in 2013 using whole-genome shotgun sequencing with short-read sequence data. Current more advanced sequencing technologies and assembly methods now permit the generation of near-complete genome assemblies with higher quality and greater continuity. Findings: Here, we report an improved assembly of the M. auratus genome (BCM_Maur_2.0) using Oxford Nanopore Technologies long-read sequencing to produce a chromosome-scale assembly. The total length of the new assembly is 2.46 Gb, similar to the 2.50-Gb length of a previous assembly of this genome, MesAur1.0. BCM_Maur_2.0 exhibits significantly improved continuity, with a scaffold N50 that is 6.7 times greater than MesAur1.0. Furthermore, 21,616 protein-coding genes and 10,459 noncoding genes are annotated in BCM_Maur_2.0 compared to 20,495 protein-coding genes and 4,168 noncoding genes in MesAurl.0. This new assembly also improves the unresolved regions as measured by nucleotide ambiguities, where similar to 17.11% of bases in MesAur1.0 were unresolved compared to BCM_Maur_2.0, in which the number of unresolved bases is reduced to 3.00%. Conclusions: Access to a more complete reference genome with improved accuracy and continuity will facilitate more detailed, comprehensive, and meaningful research results for a wide variety of future studies using Syrian hamsters as models.
引用
收藏
页数:8
相关论文
共 48 条
  • [41] Long-read based assembly and synteny analysis of a reference Drosophila subobscura genome reveals signatures of structural evolution driven by inversions recombination-suppression effects
    Charikleia Karageorgiou
    Víctor Gámez-Visairas
    Rosa Tarrío
    Francisco Rodríguez-Trelles
    BMC Genomics, 20
  • [42] Long-read based assembly and synteny analysis of a reference Drosophila subobscura genome reveals signatures of structural evolution driven by inversions recombination-suppression effects
    Karageorgiou, Charikleia
    Gamez-Visairas, Victor
    Tarrio, Rosa
    Rodriguez-Trelles, Francisco
    BMC GENOMICS, 2019, 20 (1)
  • [43] Chromosomal-Level Assembly of Antarctic Scaly Rockcod, Trematomus loennbergii Genome Using Long-Read Sequencing and Chromosome Conformation Capture (Hi-C) Technologies
    Jo, Euna
    Lee, Seung Jae
    Kim, Jeong-Hoon
    Parker, Steven J.
    Choi, Eunkyung
    Kim, Jinmu
    Han, So-Ra
    Oh, Tae-Jin
    Park, Hyun
    DIVERSITY-BASEL, 2021, 13 (12):
  • [44] Long-read assembly and comparative evidence-based reanalysis of Cryptosporidium genome sequences reveal expanded transporter repertoire and duplication of entire chromosome ends including subtelomeric regions
    Baptista, Rodrigo P.
    Li, Yiran
    Sateriale, Adam
    Sanders, Mandy J.
    Brooks, Karen L.
    Tracey, Alan
    Ansell, Brendan R. E.
    Jex, Aaron R.
    Cooper, Garrett W.
    Smith, Ethan D.
    Xiao, Rui
    Dumaine, Jennifer E.
    Georgeson, Peter
    Pope, Bernard J.
    Berriman, Matthew
    Striepen, Boris
    Cotton, James A.
    Kissinger, Jessica C.
    GENOME RESEARCH, 2022, 32 (01) : 203 - 213
  • [45] Construction and evaluation of a new rat reference genome assembly, GRCr8, from long reads and long-range scaffolding
    Li, Kai
    Smith, Melissa L.
    Blazier, J. Chris
    Kochan, Kelli J.
    Wood, Jonathan M. D.
    Howe, Kerstin
    Kwitek, Anne E.
    Dwinell, Melinda R.
    Chen, Hao
    Ciosek, Julia L.
    Masterson, Patrick
    Murphy, Terence D.
    Kalbfleisch, Theodore S.
    Doris, Peter A.
    GENOME RESEARCH, 2024, 34 (11) : 2081 - 2093
  • [46] High-Quality de novo Chromosome-Level Genome Assembly of a Single Bombyx mori With BmNPV Resistance by a Combination of PacBio Long-Read Sequencing, Illumina Short-Read Sequencing, and Hi-C Sequencing
    Tang, Min
    He, Suqun
    Gong, Xun
    Lu, Peng
    Taha, Rehab H.
    Chen, Keping
    FRONTIERS IN GENETICS, 2021, 12
  • [47] Chromosome-Scale, Haplotype-Resolved Genome Assembly of Non-Sex-Reversal Females of Swamp Eel Using High-Fidelity Long Reads and Hi-C Data
    Tian, Hai-Feng
    Hu, Qiaomu
    Lu, Hong-Yi
    Li, Zhong
    FRONTIERS IN GENETICS, 2022, 13
  • [48] Automation of Liquid Handling for Long-Read Library Preparation: SPT Labtech’s technology for automated library construction enabled large-scale whole genome sequencing for the Darwin Tree of Life Project
    Karpiyevich, Maryia
    Genetic Engineering and Biotechnology News, 2024, 44 (10): : 44 - 47