Mako: A Graph-based Pattern Growth Approach to Detect Complex Structural Variants

被引:4
|
作者
Lin, Jiadong [1 ,2 ,3 ,4 ]
Yang, Xiaofei [2 ,5 ]
Kosters, Walter [4 ]
Xu, Tun [1 ]
Jia, Yanyan [1 ]
Wang, Songbo [1 ]
Zhu, Qihui [6 ]
Ryan, Mallory [6 ]
Guo, Li [2 ]
Zhang, Chengsheng [6 ,7 ]
Lee, Charles [6 ,7 ]
Devine, Scott E. [1 ,8 ]
Eichler, Evan E. [9 ,10 ]
Ye, Kai [1 ,2 ,3 ,11 ]
机构
[1] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, Sch Automat Sci & Engn, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, MOE Key Lab Intelligent Networks & Networks Secur, Xian 710049, Peoples R China
[3] Xi An Jiao Tong Univ, Affiliated Hosp 1, Genome Inst, Xian 710061, Peoples R China
[4] Leiden Univ, Fac Sci, Leiden Inst Adv Comp Sci, NL-2311 EZ Leiden, Netherlands
[5] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, Sch Comp Sci & Technol, Xian 710049, Peoples R China
[6] Jackson Lab Genom Med, Farmington, CT 06032 USA
[7] Xi An Jiao Tong Univ, Affiliated Hosp 1, Precis Med Ctr, Xian 710061, Peoples R China
[8] Univ Maryland, Sch Med, Inst Genome Sci, Baltimore, MD 21201 USA
[9] Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98119 USA
[10] Univ Washington, Howard Hughes Med Inst, Seattle, WA 98195 USA
[11] Xi An Jiao Tong Univ, Sch Life Sci & Technol, Xian 710049, Peoples R China
基金
美国国家科学基金会; 国家重点研发计划;
关键词
Next-generation sequencing; Complex structural variant; Pattern growth; Graph mining; Formation mechanism; MINING SEQUENTIAL PATTERNS; GENOMIC REARRANGEMENTS; PAIRED-END; EVOLUTION; ALIGNMENT; CHROMOTHRIPSIS; ALGORITHMS; MECHANISMS; DISCOVERY; GERMLINE;
D O I
10.1016/j.gpb.2021.03.007
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Complex structural variants (CSVs) are genomic alterations that have more than two breakpoints and are considered as the simultaneous occurrence of simple structural variants. How-ever, detecting the compounded mutational signals of CSVs is challenging through a commonly used model-match strategy. As a result, there has been limited progress for CSV discovery com-pared with simple structural variants. Here, we systematically analyzed the multi-breakpoint con-nection feature of CSVs, and proposed Mako, utilizing a bottom-up guided model-free strategy, to detect CSVs from paired-end short-read sequencing. Specifically, we implemented a graph-based pattern growth approach, where the graph depicts potential breakpoint connections, and pattern growth enables CSV detection without pre-defined models. Comprehensive evaluations on both simulated and real datasets revealed that Mako outperformed other algorithms. Notably, validation rates of CSVs on real data based on experimental and computational validations as well as manual inspections are around 70%, where the medians of experimental and computational breakpoint shift are 13 bp and 26 bp, respectively. Moreover, the Mako CSV subgraph effectively characterized the breakpoint connections of a CSV event and uncovered a total of 15 CSV types, including two novel types of adjacent segment swap and tandem dispersed duplication. Further analysis of these CSVs also revealed the impact of sequence homology on the formation of CSVs. Mako is publicly available at https://github.com/xjtu-omics/Mako.
引用
收藏
页码:205 / 218
页数:14
相关论文
共 50 条
  • [1] Mako:A Graph-based Pattern Growth Approach to Detect Complex Structural Variants
    Jiadong Lin
    Xiaofei Yang
    Walter Kosters
    Tun Xu
    Yanyan Jia
    Songbo Wang
    Qihui Zhu
    Mallory Ryan
    Li Guo
    Chengsheng Zhang
    Mark B.Gerstein
    Ashley D.Sanders
    Micheal C.Zody
    Michael E.Talkowski
    Ryan E.Mills
    Jan O.Korbel
    Tobias Marschall
    Peter Ebert
    Peter A.Audano
    Bernardo Rodriguez-Martin
    David Porubsky
    Marc Jan Bonder
    Arvis Sulovari
    Jana Ebler
    Weichen Zhou
    Rebecca Serra Mari
    Feyza Yilmaz
    Xuefang Zhao
    Ping Hsun Hsieh
    Joyce Lee
    Sushant Kumar
    Tobias Rausch
    Yu Chen
    Zechen Chong
    Katherine M.Munson
    Mark J.P.Chaisson
    Junjie Chen
    Xinghua Shi
    Aaron M.Wenger
    William T.Harvey
    Patrick Hansenfeld
    Allison Regier
    Ira M.Hall
    Paul Flicek
    Alex R.Hastie
    Susan Fairely
    Charles Lee
    Scott E.Devine
    Evan E.Eichler
    Kai Ye
    [J]. Genomics,Proteomics & Bioinformatics, 2022, 20 (01) : 205 - 218
  • [2] A Graph-based Approach to Detect DoB Attack
    Thomas, Diya
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS (PERCOM WORKSHOPS), 2021, : 422 - 423
  • [3] A graph-based approach to detect unexplained sequences in a log
    Cinque, Marcello
    Della Corte, Raffaele
    Moscato, Vincenzo
    Sperli, Giancarlo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 171
  • [4] A Structural Approach to Assess Graph-Based Exercises
    Sousa, Ruben
    Leal, Jose Paulo
    [J]. LANGUAGES, APPLICATIONS AND TECHNOLOGIES, SLATE 2015, 2015, 563 : 182 - 193
  • [5] An efficient and accurate graph-based approach to detect population substructure
    Sridhar, Srinath
    Rao, Satish
    Halperin, Eran
    [J]. RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2007, 4453 : 503 - +
  • [6] A GRAPH-BASED APPROACH TO DETECT ABNORMAL SPATIAL POINTS AND REGIONS
    Lu, Chang-Tien
    Dos Santos, Raimundo F., Jr.
    Liu, Xutong
    Kou, Yufeng
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2011, 20 (04) : 721 - 751
  • [7] Graph-based object trackinig using structural pattern recognition
    Graciano, Ana B. V.
    Cesar, Roberto M., Jr.
    Bloch, Isabelle
    [J]. PROCEEDINGS OF THE XX BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING, 2007, : 179 - +
  • [8] Graph-based structural data mining in cognitive pattern interpretation
    Ogiela, Lidia
    Tadeusiewicz, Ryszard
    Ogiela, Marek R.
    [J]. GRAMMATICAL INFERENCE: ALGORITHMS AND APPLICATIONS, PROCEEDINGS, 2006, 4201 : 349 - 350
  • [9] A Hybrid Pattern Knowledge Graph-Based API Recommendation Approach
    Wang, Guan
    Wang, Weidong
    Li, Dian
    [J]. ARTIFICIAL INTELLIGENCE, CICAI 2022, PT III, 2022, 13606 : 465 - 476
  • [10] Graph-Based Approaches to Structural Universals and Complex States of Affairs
    Masolo, Claudio
    Vieu, Laure
    [J]. FORMAL ONTOLOGY IN INFORMATION SYSTEMS (FOIS 2018), 2018, 306 : 69 - 82