Constructing a draft Indian cattle pangenome using short-read sequencing

被引:0
|
作者
Sarwar Azam [1 ]
Abhisek Sahu [2 ]
Naveen Kumar Pandey [1 ]
Mahesh Neupane [1 ]
Curtis P. Van Tassell [3 ]
Benjamin D. Rosen [3 ]
Ravi Kumar Gandham [3 ]
Subha Narayan Rath [1 ]
Subeer S. Majumdar [2 ]
机构
[1] National Institute of Animal Biotechnology,Animal Genomics and Improvement Laboratory
[2] Indian Institute of Technology Hyderabad,undefined
[3] USDA-ARS,undefined
关键词
D O I
10.1038/s42003-025-07978-0
中图分类号
学科分类号
摘要
Indian desi cattle, known for their adaptability and phenotypic diversity, represent a valuable genetic resource. However, a single reference genome often fails to capture the full extent of their genetic variation. To address this, we construct a pangenome for desi cattle by identifying and characterizing non-reference novel sequences (NRNS). We sequence 68 genomes from seven breeds, generating 48.35 billion short reads. Using the PanGenome Analysis (PanGA) pipeline, we identify 13,065 NRNS (~41 Mbp), with substantial variation across the population. Most NRNS were unique to desi cattle, with minimal overlap (4.1%) with the Chinese indicine pangenome. Approximately 40% of NRNS exhibited ancestral origins within the Bos genus and were enriched in genic regions, suggesting functional roles. These sequences are linked to quantitative trait loci for traits such as milk production. The pangenome approach enhances read mapping accuracy, reduces spurious single nucleotide polymorphism calls, and uncovers novel genetic variants, offering a deeper understanding of desi cattle genomics.
引用
收藏
相关论文
共 50 条
  • [21] The effect of strand bias in Illumina short-read sequencing data
    Guo, Yan
    Li, Jiang
    Li, Chung-I
    Long, Jirong
    Samuels, David C.
    Shyr, Yu
    BMC GENOMICS, 2012, 13
  • [22] Blindspots in short-read genome sequencing for classic chromosomal rearrangements
    Gauthier, Lucas
    Caillot, Claire
    Pujalte, Mathilde
    Till, Marianne
    Sanlaville, Damien
    Chatron, Nicolas
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 1572 - 1572
  • [23] Innovations in Short-Read Sequencing Technologies and Their Applications to Clinical Genomics
    Polonis, Katarzyna
    Blommel, Joseph H.
    Hughes, Andrew E. O.
    Spencer, David
    Thompson, Joseph A.
    Schroeder, Molly C.
    CLINICAL CHEMISTRY, 2025, 71 (01) : 97 - 108
  • [24] Sequencing by Binding (SBB): increase accuracy of short-read genomes
    Korlach, Jonas
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 87 - 87
  • [25] The effect of strand bias in Illumina short-read sequencing data
    Yan Guo
    Jiang Li
    Chung-I Li
    Jirong Long
    David C Samuels
    Yu Shyr
    BMC Genomics, 13
  • [26] Indel variant analysis of short-read sequencing data with Scalpel
    Han Fang
    Ewa A Bergmann
    Kanika Arora
    Vladimir Vacic
    Michael C Zody
    Ivan Iossifov
    Jason A O'Rawe
    Yiyang Wu
    Laura T Jimenez Barron
    Julie Rosenbaum
    Michael Ronemus
    Yoon-ha Lee
    Zihua Wang
    Esra Dikoglu
    Vaidehi Jobanputra
    Gholson J Lyon
    Michael Wigler
    Michael C Schatz
    Giuseppe Narzisi
    Nature Protocols, 2016, 11 : 2529 - 2548
  • [27] Efficient yeast ChIP-Seq using multiplex short-read DNA sequencing
    Philippe Lefrançois
    Ghia M Euskirchen
    Raymond K Auerbach
    Joel Rozowsky
    Theodore Gibson
    Christopher M Yellman
    Mark Gerstein
    Michael Snyder
    BMC Genomics, 10
  • [28] Will long-read sequencing technologies replace short-read sequencing technologies in the next 10 years?
    Adewale, Boluwatife A.
    AFRICAN JOURNAL OF LABORATORY MEDICINE, 2020, 9 (01)
  • [29] Efficient yeast ChIP-Seq using multiplex short-read DNA sequencing
    Lefrancois, Philippe
    Euskirchen, Ghia M.
    Auerbach, Raymond K.
    Rozowsky, Joel
    Gibson, Theodore
    Yellman, Christopher M.
    Gerstein, Mark
    Snyder, Michael
    BMC GENOMICS, 2009, 10
  • [30] ProcaryaSV: structural variation detection pipeline for bacterial genomes using short-read sequencing
    Jugas, Robin
    Vitkova, Helena
    BMC BIOINFORMATICS, 2024, 25 (01):