Long-read-based human genomic structural variation detection with cuteSV

被引:183
|
作者
Jiang, Tao [1 ]
Liu, Yongzhuang [1 ]
Jiang, Yue [2 ]
Li, Junyi [3 ]
Gao, Yan [1 ]
Cui, Zhe [1 ]
Liu, Yadong [1 ]
Liu, Bo [1 ]
Wang, Yadong [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Ctr Bioinformat, Harbin 150001, Heilongjiang, Peoples R China
[2] Nebula Genom, Harbin 150030, Heilongjiang, Peoples R China
[3] Harbin Inst Technol Shenzhen, Sch Comp Sci & Technol, Shenzhen 518055, Guangdong, Peoples R China
关键词
Structural variants detection; Long-read sequencing; Scaling performance; PAIRED-END; IMPACT; DISCOVERY; INSERTION; VARIANTS; SEQUENCE;
D O I
10.1186/s13059-020-02107-y
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Long-read sequencing is promising for the comprehensive discovery of structural variations (SVs). However, it is still non-trivial to achieve high yields and performance simultaneously due to the complex SV signatures implied by noisy long reads. We propose cuteSV, a sensitive, fast, and scalable long-read-based SV detection approach. cuteSV uses tailored methods to collect the signatures of various types of SVs and employs a clustering-and-refinement method to implement sensitive SV detection. Benchmarks on simulated and real long-read sequencing datasets demonstrate that cuteSV has higher yields and scaling performance than state-of-the-art tools. cuteSV is available at https://github.com/tjiangHIT/cuteSV.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Structural Variation Detection with Read Pair Information: An Improved Null Hypothesis Reduces Bias
    Sahlin, Kristoffer
    Franberg, Mattias
    Arvestad, Lars
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2017, 24 (06) : 581 - 589
  • [42] The Database of Genomic Variants: a curated collection of structural variation in the human genome
    MacDonald, Jeffrey R.
    Ziman, Robert
    Yuen, Ryan K. C.
    Feuk, Lars
    Scherer, Stephen W.
    NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) : D986 - D992
  • [43] Phenotypic impact of genomic structural variation: insights from and for human disease
    Joachim Weischenfeldt
    Orsolya Symmons
    François Spitz
    Jan O. Korbel
    Nature Reviews Genetics, 2013, 14 : 125 - 138
  • [44] Phenotypic impact of genomic structural variation: insights from and for human disease
    Weischenfeldt, Joachim
    Symmons, Orsolya
    Spitz, Francois
    Korbel, Jan O.
    NATURE REVIEWS GENETICS, 2013, 14 (02) : 125 - 138
  • [45] Tradeoffs in alignment and assembly-based methods for structural variant detection with long-read sequencing data
    Yichen Henry Liu
    Can Luo
    Staunton G. Golding
    Jacob B. Ioffe
    Xin Maizie Zhou
    Nature Communications, 15
  • [46] Tradeoffs in alignment and assembly-based methods for structural variant detection with long-read sequencing data
    Liu, Yichen Henry
    Luo, Can
    Golding, Staunton G.
    Ioffe, Jacob B.
    Zhou, Xin Maizie
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [47] Long-read sequencing reveals genomic structural variations that underlie creation of quality protein maize
    Changsheng Li
    Xiaoli Xiang
    Yongcai Huang
    Yong Zhou
    Dong An
    Jiaqiang Dong
    Chenxi Zhao
    Hongjun Liu
    Yubin Li
    Qiong Wang
    Chunguang Du
    Joachim Messing
    Brian A. Larkins
    Yongrui Wu
    Wenqin Wang
    Nature Communications, 11
  • [48] Long-read sequencing reveals the structural complexity of genomic integration of HBV DNA in hepatocellular carcinoma
    Zhongling Zhuo
    Weiqi Rong
    Hexin Li
    Ying Li
    Xuanmei Luo
    Ye Liu
    Xiaokun Tang
    Lili Zhang
    Fei Su
    Hongyuan Cui
    Fei Xiao
    npj Genomic Medicine, 6
  • [49] Long-read sequencing reveals genomic structural variations that underlie creation of quality protein maize
    Li, Changsheng
    Xiang, Xiaoli
    Huang, Yongcai
    Zhou, Yong
    An, Dong
    Dong, Jiaqiang
    Zhao, Chenxi
    Liu, Hongjun
    Li, Yubin
    Wang, Qiong
    Du, Chunguang
    Messing, Joachim
    Larkins, Brian A.
    Wu, Yongrui
    Wang, Wenqin
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [50] Discovery and genotyping of structural variation from long-read haploid genome sequence data
    Huddleston, John
    Chaisson, Mark J. P.
    Steinberg, Karyn Meltz
    Warren, Wes
    Hoekzema, Kendra
    Gordon, David
    Graves-Lindsay, Tina A.
    Munson, Katherine M.
    Kronenberg, Zev N.
    Vives, Laura
    Peluso, Paul
    Boitano, Matthew
    Chin, Chen-Shin
    Korlach, Jonas
    Wilson, Richard K.
    Eichler, Evan E.
    GENOME RESEARCH, 2017, 27 (05) : 677 - 685