Data-driven network alignment

被引:12
|
作者
Gu, Shawn [1 ,2 ,3 ]
Milenkovic, Tijana [1 ,2 ,3 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[2] Univ Notre Dame, Eck Inst Global Hlth, Notre Dame, IN 46556 USA
[3] Univ Notre Dame, Ctr Network & Data Sci, Notre Dame, IN 46556 USA
来源
PLOS ONE | 2020年 / 15卷 / 07期
基金
美国国家科学基金会;
关键词
PROTEIN-INTERACTION NETWORKS; BIOGRID INTERACTION DATABASE; GLOBAL ALIGNMENT; MAXIMIZING ACCURACY; GENE ONTOLOGY; NODE;
D O I
10.1371/journal.pone.0234978
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this study, we deal with the problem of biological network alignment (NA), which aims to find a node mapping between species' molecular networks that uncovers similar network regions, thus allowing for the transfer of functional knowledge between the aligned nodes. We provide evidence that current NA methods, which assume that topologically similar nodes (i.e., nodes whose network neighborhoods are isomorphic-like) have high functional relatedness, do not actually end up aligning functionally related nodes. That is, we show that the current topological similarity assumption does not hold well. Consequently, we argue that a paradigm shift is needed with how the NA problem is approached. So, we redefine NA as a data-driven framework, called TARA (data-driven NA), which attempts to learn the relationship between topological relatedness and functional relatedness without assuming that topological relatedness corresponds to topological similarity. TARA makes no assumptions about what nodes should be aligned, distinguishing it from existing NA methods. Specifically, TARA trains a classifier to predict whether two nodes from different networks are functionally related based on their network topological patterns (features). We find that TARAisable to make accurate predictions. TARA then takes each pair of nodes that are predicted as related to be part of an alignment. Like traditional NA methods, TARA uses this alignment for the across-species transfer of functional knowledge. TARA as currently implemented uses topological but not protein sequence information for functional knowledge transfer. In this context, we find that TARA outperforms existing state-of-the-art NA methods that also use topological information, WAVE and SANA, and even outperforms or complements a state-of-the-art NA method that uses both topological and sequence information, PrimAlign. Hence, adding sequence information to TARA, which is our future work, is likely to further improve its performance. The software and data are available at.
引用
收藏
页数:30
相关论文
共 50 条
  • [1] Data-driven biological network alignment that uses topological, sequence, and functional information
    Gu, Shawn
    Milenkovic, Tijana
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [2] Data-driven biological network alignment that uses topological, sequence, and functional information
    Shawn Gu
    Tijana Milenković
    BMC Bioinformatics, 22
  • [3] Data-driven incentive alignment in capitation schemes
    Braverman, Mark
    Chassang, Sylvain
    JOURNAL OF PUBLIC ECONOMICS, 2022, 207
  • [4] An Ontology Alignment Framework for Data-driven Microservices
    Salvadori, Ivan
    Oliveira, Bruno C. N.
    Huf, Alexis
    Inacio, Eduardo C.
    Siqueira, Frank
    19TH INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES (IIWAS2017), 2017, : 425 - 433
  • [5] Data-driven network loading
    Tsanakas, N.
    Ekstrom, J.
    Gundlegard, D.
    Olstam, J.
    Rydergren, C.
    TRANSPORTMETRICA B-TRANSPORT DYNAMICS, 2021, 9 (01) : 237 - 265
  • [6] Data-Driven Tensor Dictionary Learning for Image Alignment
    Yu, Quan
    Bai, Minru
    JOURNAL OF SCIENTIFIC COMPUTING, 2025, 102 (03)
  • [7] Efficient Data-Driven Network Functions
    Yao, Zhiyuan
    Desmouceaux, Yoann
    Cordero-Fuertes, Juan-Antonio
    Townsley, Mark
    Clausen, Thomas
    2022 30TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS, MASCOTS, 2022, : 152 - 159
  • [8] Data-Driven Network Neuroscience: On Data Collection and Benchmark
    Xu, Jiaxing
    Yang, Yunhan
    Huang, David Tse Jung
    Gururajapathy, Sophi Shilpa
    Ke, Yiping
    Qiao, Miao
    Wang, Alan
    Kumar, Haribalan
    McGeown, Josh
    Kwon, Eryn
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [9] Data-driven alignment of the HERA-B outer tracker
    Belotelov I.
    Lanyov A.
    Ososkov G.
    Physics of Particles and Nuclei Letters, 2006, 3 (5) : 335 - 339
  • [10] Distributed Data-Driven Control of Network Systems
    Celi, Federico
    Baggio, Giacomo
    Pasqualetti, Fabio
    IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2023, 2 : 93 - 107