deepHPI: a comprehensive deep learning platform for accurate prediction and visualization of host-pathogen protein-protein interactions

被引:4
|
作者
Kaundal, Rakesh [1 ,2 ,3 ]
Loaiza, Cristian D. [4 ]
Duhan, Naveen [5 ]
Flann, Nicholas [3 ]
机构
[1] Utah State Univ, Dept Plants Soils & Climate, Coll Agr & Appl Sci, Logan, UT 84322 USA
[2] Utah State Univ, Bioinformat Facil, Ctr Integrated BioSyst, Logan, UT 84322 USA
[3] Utah State Univ, Dept Comp Sci, Coll Sci, Logan, UT 84322 USA
[4] Utah State Univ, Dept Plants Soils & Climate, Logan, UT 84322 USA
[5] Utah State Univ, Dept Plants Soils & Climate, Ctr Integrated BioSyst, Coll Agr & Appl Sci, Logan, UT 84322 USA
基金
美国农业部;
关键词
host-pathogen interactions; deep learning; convolutional neural networks (CNNs); neural networks; prediction; computational modeling; INTERACTION NETWORKS; DATABASE;
D O I
10.1093/bib/bbac125
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Host-pathogen protein interactions (HPPIs) play vital roles in many biological processes and are directly involved in infectious diseases. With the outbreak of more frequent pandemics in the last couple of decades, such as the recent outburst of Covid-19 causing millions of deaths, it has become more critical to develop advanced methods to accurately predict pathogen interactions with their respective hosts. During the last decade, experimental methods to identify HPIs have been used to decipher host-pathogen systems with the caveat that those techniques are labor-intensive, expensive and time-consuming. Alternatively, accurate prediction of HPIs can be performed by the use of data-driven machine learning. To provide a more robust and accurate solution for the HPI prediction problem, we have developed a deepHPI tool based on deep learning. The web server delivers four host-pathogen model types: plant-pathogen, human-bacteria, human-virus and animal-pathogen, leveraging its operability to a wide range of analyses and cases of use. The deepHPI web tool is the first to use convolutional neural network models for HPI prediction. These models have been selected based on a comprehensive evaluation of protein features and neural network architectures. The best prediction models have been tested on independent validation datasets, which achieved an overall Matthews correlation coefficient value of 0.87 for animal-pathogen using the combined pseudo-amino acid composition and conjoint triad (PAAC_CT) features, 0.75 for human-bacteria using the combined pseudo-amino acid composition, conjoint triad and normalized Moreau-Broto feature (PAAC_CT_NMBroto), 0.96 for human-virus using PAAC_CT_NMBroto and 0.94 values for plant-pathogen interactions using the combined pseudo-amino acid composition, composition and transition feature (PAAC_CTDC_CTDT). Our server running deepHPI is deployed on a high-performance computing cluster that enables large and multiple user requests, and it provides more information about interactions discovered. It presents an enriched visualization of the resulting host-pathogen networks that is augmented with external links to various protein annotation resources. We believe that the deepHPI web server will be very useful to researchers, particularly those working on infectious diseases. Additionally, many novel and known host-pathogen systems can be further investigated to significantly advance our understanding of complex disease-causing agents. The developed models are established on a web server, which is freely accessible at http://bioinfo. usu.edu/deepHPI/.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Computational prediction of host-pathogen protein-protein interactions
    Dyer, Matthew D.
    Murali, T. M.
    Sobral, Bruno W.
    [J]. BIOINFORMATICS, 2007, 23 (13) : I159 - I166
  • [2] Comparative mapping of host-pathogen protein-protein interactions
    Shah, Priya S.
    Wojcechowskyj, Jason A.
    Eckhardt, Manon
    Krogan, Nevan J.
    [J]. CURRENT OPINION IN MICROBIOLOGY, 2015, 27 : 62 - 68
  • [3] Comprehensive host-pathogen protein-protein interaction network analysis
    Babak Khorsand
    Abdorreza Savadi
    Mahmoud Naghibzadeh
    [J]. BMC Bioinformatics, 21
  • [4] Comprehensive host-pathogen protein-protein interaction network analysis
    Khorsand, Babak
    Savadi, Abdorreza
    Naghibzadeh, Mahmoud
    [J]. BMC BIOINFORMATICS, 2020, 21 (01)
  • [5] Multitask learning for host-pathogen protein interactions
    Kshirsagar, Meghana
    Carbonell, Jaime
    Klein-Seetharaman, Judith
    [J]. BIOINFORMATICS, 2013, 29 (13) : 217 - 226
  • [6] A Comparative Approach to Characterize the Landscape of Host-Pathogen Protein-Protein Interactions
    Muller, Mandy
    Cassonnet, Patricia
    Favre, Michel
    Jacob, Yves
    Demeret, Caroline
    [J]. JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2013, (77):
  • [7] A framework towards data analytics on host-pathogen protein-protein interactions
    Chen, Huaming
    Shen, Jun
    Wang, Lei
    Song, Jiangning
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (11) : 4667 - 4679
  • [8] Elucidation of host-pathogen protein-protein interactions to uncover mechanisms of host cell rewiring
    Nicod, Charlotte
    Banaei-Esfahani, Amir
    Collins, Ben C.
    [J]. CURRENT OPINION IN MICROBIOLOGY, 2017, 39 : 7 - 15
  • [9] High throughput methods to study protein-protein interactions during host-pathogen interactions
    Chandrasekharan, Giridhar
    Unnikrishnan, Meera
    [J]. EUROPEAN JOURNAL OF CELL BIOLOGY, 2024, 103 (02)
  • [10] STRUCTURAL MODELS FOR HOST-PATHOGEN PROTEIN-PROTEIN INTERACTIONS: ASSESSING COVERAGE AND BIAS
    Franzosa, Eric A.
    Xia, Yu
    [J]. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2012, 2012, : 287 - 298