A Strategy of Assessing Gene Copy Number Differentiation Between Populations Using Ultra-Fast De Novo Assembly of Next-Generation Sequencing Data

被引:0
|
作者
Shi, Tao [1 ,2 ]
Gao, Zhiyan [1 ,2 ]
Zhang, Yue [1 ,2 ,3 ]
Rausher, Mark D. [4 ]
Chen, Jinming [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, State Key Lab Plant Divers & Specialty Crops, Wuhan Bot Garden, Wuhan, Peoples R China
[2] Chinese Acad Sci, Hubei Key Lab Wetland Evolut & Ecol Restorat, Wuhan Bot Garden, Wuhan, Peoples R China
[3] Chinese Acad Sci, Aquat Plant Res Ctr, Wuhan Bot Garden, Wuhan, Peoples R China
[4] Duke Univ, Dept Biol, Durham, NC 27708 USA
基金
中国国家自然科学基金;
关键词
de novo assembly; gene copy number variation; Nelumbo; next-generation sequencing; COMPARATIVE GENOMIC HYBRIDIZATION; READ ALIGNMENT; NORTH-AMERICA; EASTERN ASIA; EVOLUTION; DUPLICATION; POLYMORPHISM; DIVERSIFICATION; DIVERSITY; DISCOVERY;
D O I
10.1111/1755-0998.14080
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene duplication and loss play pivotal roles in the evolutionary dynamics of genomes, contributing to species phenotypic diversity and adaptation. However, detecting copy number variations (CNVs) in homoploid populations and newly-diverged species using short reads from next-generation sequencing (NGS) with traditional methods can often be challenging due to uneven read coverage caused by variations in GC content and the presence of repetitive sequences. To address these challenges, we developed a novel pipeline, ST4gCNV, which leverages ultra-fast de novo assemblies of NGS data to detect gene-specific CNVs between populations. The pipeline effectively reduces the variance of read coverage due to technical factors such as GC bias, providing a reliable CNV detection with a minimum sequencing depth of 10. We successfully apply ST4gCNV to the resequencing analysis of homoploid species Nelumbo nucifera and Nelumbo lutea (lotus). We reveal significant CNV-driven differentiation between these species, particularly in genes related to petal colour diversity such as those involved in the anthocyanin pathway. By highlighting the extensive gene duplication and loss events in Nelumbo, our study demonstrates the utility of ST4gCNV in population genomics and underscores its potential of integrating genomic CNV analysis with traditional SNP-based resequencing analysis.
引用
收藏
页数:14
相关论文
共 32 条
  • [21] Clinical implications of copy number alteration detection using panel-based next-generation sequencing data in myelodysplastic syndrome
    Kim, Yoo-Jin
    Jung, Seung-Hyun
    Hur, Eun-Hye
    Choi, Eun-Ji
    Lee, Kyoo-Hyung
    Park, Hyeon-Chun
    Kim, Hye Joung
    Kwon, Yong-Rim
    Park, Silvia
    Lee, Sug Hyung
    Chung, Yeun-Jun
    Lee, Je-Hwan
    LEUKEMIA RESEARCH, 2021, 103
  • [22] RKDOSCNV: A Local Kernel Density-Based Approach to the Detection of Copy Number Variations by Using Next-Generation Sequencing Data
    Liu, Guojun
    Zhang, Junying
    Yuan, Xiguo
    Wei, Chao
    FRONTIERS IN GENETICS, 2020, 11
  • [23] CNV_MCD: Detection of copy number variations based on minimum covariance determinant using next-generation sequencing data
    Li, Yaoyao
    Yang, Fangjia
    Xie, Kun
    DIGITAL SIGNAL PROCESSING, 2024, 154
  • [24] Concurrent Detection of Target Copy Number Variants and Gene Variants (SNV/indels) in CLL Samples Using a Next-Generation Sequencing Panel
    Georgieva, L.
    Uddin, E.
    Heckel, A.
    Pullabhatla, V.
    Reid, J.
    Speight, G.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2019, 21 (06): : 1231 - 1231
  • [25] CNV-CH: A Convex Hull Based Segmentation Approach to Detect Copy Number Variations (CNV) Using Next-Generation Sequencing Data
    Sinha, Rituparna
    Samaddar, Sandip
    De, Rajat K.
    PLOS ONE, 2015, 10 (08):
  • [26] Identification of a Novel De Novo Heterozygous Deletion in the SOX10 Gene in Waardenburg Syndrome Type II Using Next-Generation Sequencing
    Li, Haonan
    Jin, Peng
    Hao, Qian
    Zhu, Wei
    Chen, Xia
    Wang, Ping
    GENETIC TESTING AND MOLECULAR BIOMARKERS, 2017, 21 (11) : 681 - 685
  • [27] Consistent count region-copy number variation (CCR-CNV): an expandable and robust tool for clinical diagnosis of copy number variation at the exon level using next-generation sequencing data
    Kim, Man Jin
    Lee, Sungyoung
    Yun, Hongseok
    Cho, Sung Im
    Kim, Boram
    Lee, Jee-Soo
    Chae, Jong Hee
    Sun, Choonghyun
    Park, Sung Sup
    Seong, Moon-Woo
    GENETICS IN MEDICINE, 2022, 24 (03) : 663 - 672
  • [28] Comprehensive Analysis of Clinically Relevant Copy Number Alterations (CNAs) Using a 523-Gene Next-Generation Sequencing Panel and NxClinical Software in Solid Tumors
    Gupta, Vivek
    Vashisht, Vishakha
    Vashisht, Ashutosh
    Mondal, Ashis K.
    Alptekin, Ahmet
    Singh, Harmanpreet
    Kolhe, Ravindra
    GENES, 2024, 15 (04)
  • [29] Challenges in detecting genomic copy number aberrations using next-generation sequencing data and the eXome Hidden Markov Model: A clinical exome-first diagnostic approach
    Yamamoto T.
    Shimojima K.
    Ondo Y.
    Imai K.
    Chong P.F.
    Kira R.
    Amemiya M.
    Saito A.
    Okamoto N.
    Human Genome Variation, 3 (1)
  • [30] De Novo Assembly-Based Analysis of RPGR Exon ORF15 in an Indigenous African Cohort Overcomes Limitations of a Standard Next-Generation Sequencing (NGS) Data Analysis Pipeline
    Maggi, Jordi
    Roberts, Lisa
    Koller, Samuel
    Rebello, George
    Berger, Wolfgang
    Ramesar, Rajkumar
    GENES, 2020, 11 (07) : 1 - 17