SAGA: a subgraph matching tool for biological graphs

被引:134
|
作者
Tian, Yuanyuan
McEachin, Richard C.
Santos, Carlos
States, David J.
Patel, Jignesh M. [1 ]
机构
[1] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Natl Ctr Integrat Biomed Informat, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Dept Human Genet, Ann Arbor, MI 48109 USA
[4] Univ Michigan, Bioinformat Program, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/btl571
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: With the rapid increase in the availability of biological graph datasets, there is a growing need for effective and efficient graph querying methods. Due to the noisy and incomplete characteristics of these datasets, exact graph matching methods have limited use and approximate graph matching methods are required. Unfortunately, existing graph matching methods are too restrictive as they only allow exact or near exact graph matching. This paper presents a novel approximate graph matching technique called SAGA. This technique employs a flexible model for computing graph similarity, which allows for node gaps, node mismatches and graph structural differences. SAGA employs an indexing technique that allows it to efficiently evaluate queries even against large graph datasets. Results: SAGA has been used to query biological pathways and literature datasets, which has revealed interesting similarities between distinct pathways that cannot be found by existing methods. These matches associate seemingly unrelated biological processes, connect studies in different sub-areas of biomedical research and thus pose hypotheses for new discoveries. SAGA is also orders of magnitude faster than existing methods.
引用
收藏
页码:232 / 239
页数:8
相关论文
共 50 条
  • [1] Subgraph matching on temporal graphs
    Li, Faming
    Zou, Zhaonian
    [J]. INFORMATION SCIENCES, 2021, 578 : 539 - 558
  • [2] Streaming Temporal Graphs: Subgraph Matching
    Goodman, Eric L.
    Grunwald, Dirk
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4977 - 4986
  • [3] Durable Subgraph Matching on Temporal Graphs
    Li, Faming
    Zou, Zhaonian
    Li, Jianzhong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (05) : 4713 - 4726
  • [4] Parallel Subgraph Matching on Massive Graphs
    Suo, Bo
    Li, Zhanhuai
    Pan, Wei
    [J]. 2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 1932 - 1937
  • [5] Evolving subgraph matching on temporal graphs
    Li, Faming
    Zou, Zhaonian
    Li, Jianzhong
    Yang, Xiaochun
    Wang, Bin
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [6] A survey of continuous subgraph matching for dynamic graphs
    Wang, Xi
    Zhang, Qianzhen
    Guo, Deke
    Zhao, Xiang
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (03) : 945 - 989
  • [7] Consistent Subgraph Matching over Large Graphs
    Yuan, Ye
    Ma, Delong
    Zhang, Aoqian
    Wang, Guoren
    [J]. 2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2536 - 2548
  • [8] A survey of continuous subgraph matching for dynamic graphs
    Xi Wang
    Qianzhen Zhang
    Deke Guo
    Xiang Zhao
    [J]. Knowledge and Information Systems, 2023, 65 : 945 - 989
  • [9] Efficient Subgraph Matching on Billion Node Graphs
    Sun, Zhao
    Wang, Hongzhi
    Wang, Haixun
    Shao, Bin
    Li, Jianzhong
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (09): : 788 - 799
  • [10] Privacy Preserving Subgraph Matching on Large Graphs in Cloud
    Chang, Zhao
    Zou, Lei
    Li, Feifei
    [J]. SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 199 - 213