Graph convolutional network based virus-human protein-protein interaction prediction for novel viruses

被引:0
|
作者
Koca, Mehmet Burak [1 ]
Nourani, Esmaeil [2 ]
Abbasoglu, Ferda [1 ]
Karadeniz, Ilknur [3 ]
Sevilgen, Fatih Erdogan [1 ,4 ]
机构
[1] Gebze Tech Univ, Fac Engn, Dept Comp Engn, Kocaeli, Turkey
[2] Azarbaijan Shahid Madani Univ, Fac Comp Engn & Informat Technol, Dept Informat Technol, Tabriz, Iran
[3] Isik Univ, Fac Engn & Nat Sci, Dept Comp Engn, Istanbul, Turkey
[4] Bogazici Univ, Inst Data Sci & Artificial Intelligence, Istanbul, Turkey
关键词
PHI networks; Graph convolutional networks; Protein-protein interaction prediction; HOST; GENE; WEB;
D O I
10.1016/j.compbiolchem.2022.107755
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Computational identification of human-virus protein-protein interactions (PHIs) is a worthwhile step towards understanding infection mechanisms. Analysis of the PHI networks is important for the determination of pathogenic diseases. Prediction of these interactions is a popular problem since experimental detection of PHIs is both time-consuming and expensive. The available methods use biological features like amino acid sequences, molecular structure, or biological activities for prediction. Recent studies show that the topological properties of proteins in protein-protein interaction (PPI) networks increase the performance of the predictions. The basic network projections, random-walk-based models, or graph neural networks are used for generating topologically enriched (hybrid) protein embeddings. In this study, we propose a three-stage machine learning pipeline that generates and uses hybrid embeddings for PHI prediction. In the first stage, numerical features are extracted from the amino acid sequences using the Doc2Vec and Byte Pair Encoding method. The amino acid embeddings are used as node features while training a modified GraphSAGE model, which is an improved version of the graph convolutional network. Lastly, the hybrid protein embeddings are used for training a binary interaction classifier model that predicts whether there is an interaction between the given two proteins or not. The proposed method is evaluated with comprehensive experiments to test its functionality and compare it with the state-of-art methods. The experimental results on the benchmark dataset prove the efficiency of the proposed model by having a 3-23% better area under curve (AUC) score than its competitors.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Exploring virus relationships based on virus-host protein-protein interaction network
    Xu, Feng
    Zhao, Chen
    Li, Yuhua
    Li, Jiang
    Deng, Youping
    Shi, Tieliu
    [J]. BMC SYSTEMS BIOLOGY, 2011, 5
  • [42] Prediction of human protein-protein interaction by a domain-based approach
    Zhang, Xiaopan
    Jiao, Xiong
    Song, Jie
    Chang, Shan
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2016, 396 : 144 - 153
  • [43] Small protein complex prediction algorithm based on protein-protein interaction network segmentation
    Lyu, Jiaqing
    Yao, Zhen
    Liang, Bing
    Liu, Yiwei
    Zhang, Yijia
    [J]. BMC BIOINFORMATICS, 2022, 23 (01)
  • [44] Graph Theory Analysis of Protein-Protein Interaction Network and Graph based Clustering of Proteins linked with Zika Virus using MCL Algorithm
    Susymary, J.
    Lawrance, R.
    [J]. PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON CIRCUIT ,POWER AND COMPUTING TECHNOLOGIES (ICCPCT), 2017,
  • [45] DeepInteract: Deep Neural Network Based Protein-Protein Interaction Prediction Tool
    Patel, Sunil
    Tripathi, Rashmi
    Kumari, Vandana
    Varadwaj, Pritish
    [J]. CURRENT BIOINFORMATICS, 2017, 12 (06) : 551 - 557
  • [46] Transfer learning via multi-scale convolutional neural layers for human-virus protein-protein interaction prediction
    Yang, Xiaodi
    Yang, Shiping
    Lian, Xianyi
    Wuchty, Stefan
    Zhang, Ziding
    [J]. BIOINFORMATICS, 2021, 37 (24) : 4771 - 4778
  • [47] Active learning for human protein-protein interaction prediction
    Mohamed, Thahir P.
    Carbonell, Jaime G.
    Ganapathiraju, Madhavi K.
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [48] A Cluster-Constrained Graph Convolutional Network for Protein-Protein Association Networks
    Nguyen Bao Phuoc
    Duong Thuy Trang
    Phan Duy Hung
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2023, PT II, 2023, 13996 : 157 - 169
  • [49] PIPs: human protein-protein interaction prediction database
    McDowall, Mark D.
    Scott, Michelle S.
    Barton, Geoffrey J.
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D651 - D656
  • [50] Active learning for human protein-protein interaction prediction
    Thahir P Mohamed
    Jaime G Carbonell
    Madhavi K Ganapathiraju
    [J]. BMC Bioinformatics, 11