Hi-C scaffolded short- and long-read genome assemblies of the California sea lion are broadly consistent for syntenic inference across 45 million years of evolution

被引:10
|
作者
Peart, Claire R. [1 ]
Williams, Christina [2 ]
Pophaly, Saurabh D. [1 ,3 ]
Neely, Benjamin A. [4 ]
Gulland, Frances M. D. [5 ]
Adams, David J. [6 ]
Ng, Bee Ling [6 ]
Cheng, William [6 ]
Goebel, Michael E. [7 ]
Fedrigo, Olivier [8 ]
Haase, Bettina [8 ]
Mountcastle, Jacquelyn [8 ]
Fungtammasan, Arkarachai [9 ]
Formenti, Giulio [8 ,10 ]
Collins, Joanna [11 ]
Wood, Jonathan [11 ]
Sims, Ying [11 ]
Torrance, James [11 ]
Tracey, Alan [11 ]
Howe, Kerstin [11 ]
Rhie, Arang [12 ]
Hoffman, Joseph, I [13 ,14 ]
Johnson, Jeremy [15 ]
Jarvis, Erich D. [8 ,16 ]
Breen, Matthew [2 ,17 ]
Wolf, Jochen B. W. [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Div Evolutionary Biol, Fac Biol, Munich, Germany
[2] North Carolina State Univ, Coll Vet Med, Dept Mol Biomed Sci, Raleigh, NC USA
[3] Max Planck Inst Plant Breeding Res, Cologne, Germany
[4] NIST, Charleston, SC USA
[5] Univ Calif Davis, Karen Dryer Wildlife Hlth Ctr, Davis, CA 95616 USA
[6] Wellcome Sanger Inst, Cytometry Core Facil, Cambridge, England
[7] Univ Calif Santa Cruz, Inst Marine Sci, Santa Cruz, CA 95064 USA
[8] Rockefeller Univ, Vertebrate Genome Lab, New York, NY 10021 USA
[9] DNAnexus Inc, Mountain View, CA USA
[10] Rockefeller Univ, Lab Neurogenet Language, New York, NY 10021 USA
[11] Wellcome Sanger Inst, Tree Life Programme, Cambridge, England
[12] NHGRI, Computat & Stat Genom Branch, Genome Informat Sect, NIH, Bethesda, MD USA
[13] Bielefeld Univ, Dept Anim Behav, Bielefeld, Germany
[14] British Antarctic Survey, Cambridge, England
[15] Broad Inst Harvard & MIT, Cambridge, MA USA
[16] Howard Hughes Med Inst, Chevy Chase, MD USA
[17] North Carolina State Univ, Comparat Med Inst, Raleigh, NC USA
关键词
California sea lion (Zalophus californianus); cancer; Carnivora; chromatin interaction mapping; genome assembly; genome evolution; Hi-C; CHROMOSOME EVOLUTION; CONVERGENT EVOLUTION; CARNIVORA; DOG; REVEALS; BROWSER; CAT; CONSERVATION; SEQUENCES; KARYOTYPE;
D O I
10.1111/1755-0998.13443
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
With the advent of chromatin-interaction maps, chromosome-level genome assemblies have become a reality for a wide range of organisms. Scaffolding quality is, however, difficult to judge. To explore this gap, we generated multiple chromosome-scale genome assemblies of an emerging wild animal model for carcinogenesis, the California sea lion (Zalophus californianus). Short-read assemblies were scaffolded with two independent chromatin interaction mapping data sets (Hi-C and Chicago), and long-read assemblies with three data types (Hi-C, optical maps and 10X linked reads) following the "Vertebrate Genomes Project (VGP)" pipeline. In both approaches, 18 major scaffolds recovered the karyotype (2n = 36), with scaffold N50s of 138 and 147 Mb, respectively. Synteny relationships at the chromosome level with other pinniped genomes (2n = 32-36), ferret (2n = 34), red panda (2n = 36) and domestic dog (2n = 78) were consistent across approaches and recovered known fissions and fusions. Comparative chromosome painting and multicolour chromosome tiling with a panel of 264 genome-integrated single-locus canine bacterial artificial chromosome probes provided independent evaluation of genome organization. Broad-scale discrepancies between the approaches were observed within chromosomes, most commonly in translocations centred around centromeres and telomeres, which were better resolved in the VGP assembly. Genomic and cytological approaches agreed on near-perfect synteny of the X chromosome, and in combination allowed detailed investigation of autosomal rearrangements between dog and sea lion. This study presents high-quality genomes of an emerging cancer model and highlights that even highly fragmented short-read assemblies scaffolded with Hi-C can yield reliable chromosome-level scaffolds suitable for comparative genomic analyses.
引用
收藏
页码:2455 / 2470
页数:16
相关论文
empty
未找到相关数据