A Strategy of Assessing Gene Copy Number Differentiation Between Populations Using Ultra-Fast De Novo Assembly of Next-Generation Sequencing Data

被引:0
|
作者
Shi, Tao [1 ,2 ]
Gao, Zhiyan [1 ,2 ]
Zhang, Yue [1 ,2 ,3 ]
Rausher, Mark D. [4 ]
Chen, Jinming [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, State Key Lab Plant Divers & Specialty Crops, Wuhan Bot Garden, Wuhan, Peoples R China
[2] Chinese Acad Sci, Hubei Key Lab Wetland Evolut & Ecol Restorat, Wuhan Bot Garden, Wuhan, Peoples R China
[3] Chinese Acad Sci, Aquat Plant Res Ctr, Wuhan Bot Garden, Wuhan, Peoples R China
[4] Duke Univ, Dept Biol, Durham, NC 27708 USA
基金
中国国家自然科学基金;
关键词
de novo assembly; gene copy number variation; Nelumbo; next-generation sequencing; COMPARATIVE GENOMIC HYBRIDIZATION; READ ALIGNMENT; NORTH-AMERICA; EASTERN ASIA; EVOLUTION; DUPLICATION; POLYMORPHISM; DIVERSIFICATION; DIVERSITY; DISCOVERY;
D O I
10.1111/1755-0998.14080
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene duplication and loss play pivotal roles in the evolutionary dynamics of genomes, contributing to species phenotypic diversity and adaptation. However, detecting copy number variations (CNVs) in homoploid populations and newly-diverged species using short reads from next-generation sequencing (NGS) with traditional methods can often be challenging due to uneven read coverage caused by variations in GC content and the presence of repetitive sequences. To address these challenges, we developed a novel pipeline, ST4gCNV, which leverages ultra-fast de novo assemblies of NGS data to detect gene-specific CNVs between populations. The pipeline effectively reduces the variance of read coverage due to technical factors such as GC bias, providing a reliable CNV detection with a minimum sequencing depth of 10. We successfully apply ST4gCNV to the resequencing analysis of homoploid species Nelumbo nucifera and Nelumbo lutea (lotus). We reveal significant CNV-driven differentiation between these species, particularly in genes related to petal colour diversity such as those involved in the anthocyanin pathway. By highlighting the extensive gene duplication and loss events in Nelumbo, our study demonstrates the utility of ST4gCNV in population genomics and underscores its potential of integrating genomic CNV analysis with traditional SNP-based resequencing analysis.
引用
收藏
页数:14
相关论文
共 32 条
  • [1] De novo assembly of transcriptome from next-generation sequencing data
    Xuan Li
    Yimeng Kong
    QiongYi Zhao
    YuanYuan Li
    Pei Hao
    Quantitative Biology, 2016, 4 (02) : 94 - 105
  • [2] Optimization of de novo transcriptome assembly from next-generation sequencing data
    Surget-Groba, Yann
    Montoya-Burgos, Juan I.
    GENOME RESEARCH, 2010, 20 (10) : 1432 - 1440
  • [3] Validation of an ultra-fast CNV calling tool for Next Generation Sequencing data using MLPA-verified copy number alterations
    Tolhuis, B.
    Karten, H.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2019, 27 : 573 - 574
  • [4] Assessing Copy Number Alterations in Targeted, Amplicon-Based Next-Generation Sequencing Data
    Grasso, Catherine
    Butler, Timothy
    Rhodes, Katherine
    Quist, Michael
    Neff, Tanaya L.
    Moore, Stephen
    Tomlins, Scott A.
    Reinig, Erica
    Beadling, Carol
    Andersen, Mark
    Corless, Christopher L.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2015, 17 (01): : 53 - 63
  • [5] Control-FREEC: a tool for assessing copy number and allelic content using next-generation sequencing data
    Boeva, Valentina
    Popova, Tatiana
    Bleakley, Kevin
    Chiche, Pierre
    Cappo, Julie
    Schleiermacher, Gudrun
    Janoueix-Lerosey, Isabelle
    Delattre, Olivier
    Barillot, Emmanuel
    BIOINFORMATICS, 2012, 28 (03) : 423 - 425
  • [6] An ensemble strategy that significantly improves de novo assembly of microbial genomes from metagenomic next-generation sequencing data
    Deng, Xutao
    Naccache, Samia N.
    Ng, Terry
    Federman, Scot
    Li, Linlin
    Chiu, Charles Y.
    Delwart, Eric L.
    NUCLEIC ACIDS RESEARCH, 2015, 43 (07) : e46
  • [7] Detection of copy number variations based on a local distance using next-generation sequencing data
    Liu, Guojun
    Yang, Hongzhi
    He, Zongzhen
    FRONTIERS IN GENETICS, 2023, 14
  • [8] Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives
    Min Zhao
    Qingguo Wang
    Quan Wang
    Peilin Jia
    Zhongming Zhao
    BMC Bioinformatics, 14
  • [9] Detection of copy number variations by pair analysis using next-generation sequencing data in inherited kidney diseases
    China Nagano
    Kandai Nozu
    Naoya Morisada
    Masahiko Yazawa
    Daisuke Ichikawa
    Keita Numasawa
    Hiroyo Kourakata
    Chieko Matsumura
    Satoshi Tazoe
    Ryojiro Tanaka
    Tomohiko Yamamura
    Shogo Minamikawa
    Tomoko Horinouchi
    Keita Nakanishi
    Junya Fujimura
    Nana Sakakibara
    Yoshimi Nozu
    Ming Juan Ye
    Hiroshi Kaito
    Kazumoto Iijima
    Clinical and Experimental Nephrology, 2018, 22 : 881 - 888
  • [10] Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives
    Zhao, Min
    Wang, Qingguo
    Wang, Quan
    Jia, Peilin
    Zhao, Zhongming
    BMC BIOINFORMATICS, 2013, 14