Identification of essential proteins based on edge features and the fusion of multiple-source biological information

被引:4
|
作者
Liu, Peiqiang [1 ]
Liu, Chang [1 ]
Mao, Yanyan [1 ,2 ]
Guo, Junhong [1 ]
Liu, Fanshu [1 ]
Cai, Wangmin [1 ]
Zhao, Feng [1 ]
机构
[1] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai, Peoples R China
[2] China Univ Petr East China, Coll Oceanog & Space Informat, Qingdao, Peoples R China
基金
中国国家自然科学基金;
关键词
Essential protein; Quasi-clique; Triangle graph; Dynamic protein-protein interaction network; Fusion method; CENTRALITY;
D O I
10.1186/s12859-023-05315-y
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundA major current focus in the analysis of protein-protein interaction (PPI) data is how to identify essential proteins. As massive PPI data are available, this warrants the design of efficient computing methods for identifying essential proteins. Previous studies have achieved considerable performance. However, as a consequence of the features of high noise and structural complexity in PPIs, it is still a challenge to further upgrade the performance of the identification methods.MethodsThis paper proposes an identification method, named CTF, which identifies essential proteins based on edge features including h-quasi-cliques and uv-triangle graphs and the fusion of multiple-source information. We first design an edge-weight function, named EWCT, for computing the topological scores of proteins based on quasi-cliques and triangle graphs. Then, we generate an edge-weighted PPI network using EWCT and dynamic PPI data. Finally, we compute the essentiality of proteins by the fusion of topological scores and three scores of biological information.ResultsWe evaluated the performance of the CTF method by comparison with 16 other methods, such as MON, PeC, TEGS, and LBCC, the experiment results on three datasets of Saccharomyces cerevisiae show that CTF outperforms the state-of-the-art methods. Moreover, our method indicates that the fusion of other biological information is beneficial to improve the accuracy of identification.
引用
下载
收藏
页数:24
相关论文
共 50 条
  • [21] Distributed Video Coding Based on Multiple-source Correlation Model
    Qing, Linbo
    He, Xiaohai
    Ou, Xianfeng
    Lv, Rui
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (04): : 1609 - 1614
  • [22] Lower limb locomotion modes recognition based on multiple-source information and general regression neural network
    Liu, Lei
    Yang, Peng
    Liu, Zuojun
    Jiqiren/Robot, 2015, 37 (03): : 310 - 317
  • [23] A Deep Learning Framework for Identifying Essential Proteins by Integrating Multiple Types of Biological Information
    Zeng, Min
    Li, Min
    Fei, Zhihui
    Wu, Fang-Xiang
    Li, Yaohang
    Pan, Yi
    Wang, Jianxin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (01) : 296 - 305
  • [24] An Information-Centric Multiple-Source Routing Scheme for Wireless Sensor Networks
    Wang, Yang
    Wu, Jun
    Han, Weiyi
    Li, Jianhua
    Li, Qiang
    Wang, Shen
    2017 5TH IEEE INTERNATIONAL CONFERENCE ON SMART ENERGY GRID ENGINEERING (SEGE), 2017, : 362 - 366
  • [25] Identification of essential proteins based on a new combination of topological and biological features in weighted protein-protein interaction networks
    Elahi, Abdolkarim
    Babamir, Seyed Morteza
    IET SYSTEMS BIOLOGY, 2018, 12 (06) : 247 - 257
  • [26] CONSTRUCTION OF 4-BAND WAVELET AND ITS APPLICATION IN MULTIPLE-SOURCE IMAGE FUSION
    Shen, Zheng-Wei
    Liu, Yu
    Liao, Fu-Cheng
    Tang, Zhao-Hui
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, VOLS 1 AND 2, 2008, : 128 - 133
  • [27] Information fusion by combining multiple features and classifiers
    Mao, JC
    INTERNATIONAL SYMPOSIUM ON MULTISPECTRAL IMAGE PROCESSING, 1998, 3545 : 542 - 549
  • [28] A new algorithm for essential proteins identification based on the integration of protein complex co-expression information and edge clustering coefficient
    Luo, Jiawei
    Wu, Juan
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 12 (03) : 257 - 274
  • [29] Multiple-source Domain Adaptation in Rule-based Neural Network
    Zuo, Hua
    Lu, Jie
    Zhang, Guangquan
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [30] Fusion of Multiple Gait Features for Human Identification
    Hong, Sungjun
    Lee, Heesung
    An, Sung Je
    Kim, Euntai
    2008 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS, VOLS 1-4, 2008, : 1826 - 1830