Mako: A Graph-based Pattern Growth Approach to Detect Complex Structural Variants

被引:4
|
作者
Lin, Jiadong [1 ,2 ,3 ,4 ]
Yang, Xiaofei [2 ,5 ]
Kosters, Walter [4 ]
Xu, Tun [1 ]
Jia, Yanyan [1 ]
Wang, Songbo [1 ]
Zhu, Qihui [6 ]
Ryan, Mallory [6 ]
Guo, Li [2 ]
Zhang, Chengsheng [6 ,7 ]
Lee, Charles [6 ,7 ]
Devine, Scott E. [1 ,8 ]
Eichler, Evan E. [9 ,10 ]
Ye, Kai [1 ,2 ,3 ,11 ]
机构
[1] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, Sch Automat Sci & Engn, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, MOE Key Lab Intelligent Networks & Networks Secur, Xian 710049, Peoples R China
[3] Xi An Jiao Tong Univ, Affiliated Hosp 1, Genome Inst, Xian 710061, Peoples R China
[4] Leiden Univ, Fac Sci, Leiden Inst Adv Comp Sci, NL-2311 EZ Leiden, Netherlands
[5] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, Sch Comp Sci & Technol, Xian 710049, Peoples R China
[6] Jackson Lab Genom Med, Farmington, CT 06032 USA
[7] Xi An Jiao Tong Univ, Affiliated Hosp 1, Precis Med Ctr, Xian 710061, Peoples R China
[8] Univ Maryland, Sch Med, Inst Genome Sci, Baltimore, MD 21201 USA
[9] Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98119 USA
[10] Univ Washington, Howard Hughes Med Inst, Seattle, WA 98195 USA
[11] Xi An Jiao Tong Univ, Sch Life Sci & Technol, Xian 710049, Peoples R China
基金
美国国家科学基金会; 国家重点研发计划;
关键词
Next-generation sequencing; Complex structural variant; Pattern growth; Graph mining; Formation mechanism; MINING SEQUENTIAL PATTERNS; GENOMIC REARRANGEMENTS; PAIRED-END; EVOLUTION; ALIGNMENT; CHROMOTHRIPSIS; ALGORITHMS; MECHANISMS; DISCOVERY; GERMLINE;
D O I
10.1016/j.gpb.2021.03.007
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Complex structural variants (CSVs) are genomic alterations that have more than two breakpoints and are considered as the simultaneous occurrence of simple structural variants. How-ever, detecting the compounded mutational signals of CSVs is challenging through a commonly used model-match strategy. As a result, there has been limited progress for CSV discovery com-pared with simple structural variants. Here, we systematically analyzed the multi-breakpoint con-nection feature of CSVs, and proposed Mako, utilizing a bottom-up guided model-free strategy, to detect CSVs from paired-end short-read sequencing. Specifically, we implemented a graph-based pattern growth approach, where the graph depicts potential breakpoint connections, and pattern growth enables CSV detection without pre-defined models. Comprehensive evaluations on both simulated and real datasets revealed that Mako outperformed other algorithms. Notably, validation rates of CSVs on real data based on experimental and computational validations as well as manual inspections are around 70%, where the medians of experimental and computational breakpoint shift are 13 bp and 26 bp, respectively. Moreover, the Mako CSV subgraph effectively characterized the breakpoint connections of a CSV event and uncovered a total of 15 CSV types, including two novel types of adjacent segment swap and tandem dispersed duplication. Further analysis of these CSVs also revealed the impact of sequence homology on the formation of CSVs. Mako is publicly available at https://github.com/xjtu-omics/Mako.
引用
收藏
页码:205 / 218
页数:14
相关论文
共 50 条
  • [21] Graph-based traceability: a comprehensive approach
    Hannes Schwarz
    Jürgen Ebert
    Andreas Winter
    [J]. Software & Systems Modeling, 2010, 9 : 473 - 492
  • [22] A GRAPH-BASED APPROACH TO SURFACE RECONSTRUCTION
    MENCL, R
    [J]. COMPUTER GRAPHICS FORUM, 1995, 14 (03) : C445 - C456
  • [23] Graph-based traceability: a comprehensive approach
    Schwarz, Hannes
    Ebert, Juergen
    Winter, Andreas
    [J]. SOFTWARE AND SYSTEMS MODELING, 2010, 9 (04): : 473 - 492
  • [24] A graph-based approach to inequality assessment
    Palestini, Arsen
    Pignataro, Giuseppe
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2016, 455 : 65 - 78
  • [25] A GRAPH-BASED APPROACH FOR SEMISUPERVISED CLUSTERING
    Yoshida, Tetsuya
    [J]. COMPUTATIONAL INTELLIGENCE, 2014, 30 (02) : 263 - 284
  • [26] A Graph-based approach for Kite recognition
    Madi, Kamel
    Seba, Hamida
    Kheddouci, Hamamache
    Barge, Olivier
    [J]. PATTERN RECOGNITION LETTERS, 2017, 87 : 186 - 194
  • [27] Grid Smoothing: A Graph-Based Approach
    Noel, Guillaume
    Djouani, Karim
    Hamam, Yskandar
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, 2010, 6419 : 153 - 160
  • [28] Graph-based Approach to Reliability Assessment
    Sinitca, Aleksandr M.
    Shalugin, Evgeniy D.
    [J]. PROCEEDINGS OF THE 2021 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (ELCONRUS), 2021, : 682 - 685
  • [29] A Precedence Graph-Based Approach to Detect Message Injection Attacks in J1939 Based Networks
    Mukherjee, Subhojeet
    Walker, Jacob
    Ray, Indrakshi
    Daily, Jeremy
    [J]. 2017 15TH ANNUAL CONFERENCE ON PRIVACY, SECURITY AND TRUST (PST), 2017, : 67 - 76
  • [30] A Graph-Based Approach for Image Segmentation
    Le, Thang V.
    Kulikowski, Casimir A.
    Muchnik, Ilya B.
    [J]. ADVANCES IN VISUAL COMPUTING, PT I, PROCEEDINGS, 2008, 5358 : 278 - +