Data-driven network alignment

被引:12
|
作者
Gu, Shawn [1 ,2 ,3 ]
Milenkovic, Tijana [1 ,2 ,3 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[2] Univ Notre Dame, Eck Inst Global Hlth, Notre Dame, IN 46556 USA
[3] Univ Notre Dame, Ctr Network & Data Sci, Notre Dame, IN 46556 USA
来源
PLOS ONE | 2020年 / 15卷 / 07期
基金
美国国家科学基金会;
关键词
PROTEIN-INTERACTION NETWORKS; BIOGRID INTERACTION DATABASE; GLOBAL ALIGNMENT; MAXIMIZING ACCURACY; GENE ONTOLOGY; NODE;
D O I
10.1371/journal.pone.0234978
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this study, we deal with the problem of biological network alignment (NA), which aims to find a node mapping between species' molecular networks that uncovers similar network regions, thus allowing for the transfer of functional knowledge between the aligned nodes. We provide evidence that current NA methods, which assume that topologically similar nodes (i.e., nodes whose network neighborhoods are isomorphic-like) have high functional relatedness, do not actually end up aligning functionally related nodes. That is, we show that the current topological similarity assumption does not hold well. Consequently, we argue that a paradigm shift is needed with how the NA problem is approached. So, we redefine NA as a data-driven framework, called TARA (data-driven NA), which attempts to learn the relationship between topological relatedness and functional relatedness without assuming that topological relatedness corresponds to topological similarity. TARA makes no assumptions about what nodes should be aligned, distinguishing it from existing NA methods. Specifically, TARA trains a classifier to predict whether two nodes from different networks are functionally related based on their network topological patterns (features). We find that TARAisable to make accurate predictions. TARA then takes each pair of nodes that are predicted as related to be part of an alignment. Like traditional NA methods, TARA uses this alignment for the across-species transfer of functional knowledge. TARA as currently implemented uses topological but not protein sequence information for functional knowledge transfer. In this context, we find that TARA outperforms existing state-of-the-art NA methods that also use topological information, WAVE and SANA, and even outperforms or complements a state-of-the-art NA method that uses both topological and sequence information, PrimAlign. Hence, adding sequence information to TARA, which is our future work, is likely to further improve its performance. The software and data are available at.
引用
收藏
页数:30
相关论文
共 50 条
  • [31] Data-Driven Network Visualization for Innovation and Competitive Intelligence
    Sarica, Serhad
    Yan, Bowen
    Bulato, Gerardo
    Jaipurkar, Pratik
    Luo, Jianxi
    PROCEEDINGS OF THE 52ND ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, 2019, : 127 - 135
  • [32] Research on Data-Driven Distribution Network Planning Method
    Liu, Xingdong
    Xu, Daolin
    Peng, Hui
    Xu, XiaoChuan
    Liu, HuiDeng
    Zhang, Xin
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [33] Data-driven dynamics reconstruction using RBF network
    Du, Cong-Cong
    Wang, Xuan
    Wang, Zhangsen
    Wang, Da-Hui
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2023, 4 (04):
  • [34] Network synthesis through data-driven growth and decay
    Ji, CY
    Psaltis, D
    NEURAL NETWORKS, 1997, 10 (06) : 1133 - 1141
  • [35] Data-driven performance metrics for neural network learning
    Alessandri, Angelo
    Gaggero, Mauro
    Sanguineti, Marcello
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2023,
  • [36] Data-Driven Approach for Inferencing Causality and Network Topology
    Sinha, Subhrajit
    Vaidya, Umesh
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 436 - 441
  • [37] Occupational mobility and automation: a data-driven network model
    del Rio-Chanona, R. Maria
    Mealy, Penny
    Beguerisse-Diaz, Mariano
    Lafond, Francois
    Farmer, J. Doyne
    JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2021, 18 (174)
  • [38] Data-driven statistical optimization of a groundwater monitoring network
    Meggiorin, Mara
    Naranjo-Fernandez, Nuria
    Passadore, Giulia
    Sottani, Andrea
    Botter, Gianluca
    Rinaldo, Andrea
    JOURNAL OF HYDROLOGY, 2024, 631
  • [39] Data-Driven Network Analysis for Anomaly Traffic Detection
    Alam, Shumon
    Alam, Yasin
    Cui, Suxia
    Akujuobi, Cajetan
    SENSORS, 2023, 23 (19)
  • [40] Manufacturing network simulation using a data-driven model
    Essaid M.
    Grimaud F.
    Burlat P.
    International Journal of Simulation and Process Modelling, 2011, 6 (03) : 228 - 237