PepMNet: a hybrid deep learning model for predicting peptide properties using hierarchical graph representations

被引:0
|
作者
Otero, Daniel Garzon [1 ]
Akbari, Omid [1 ]
Bilodeau, Camille [1 ]
机构
[1] Univ Virginia, Chem Engn Dept, 385 McCormick Rd, Charlottesville, VA 22903 USA
来源
关键词
RETENTION TIME PREDICTION; ANTIMICROBIAL PEPTIDES; IDENTIFICATION;
D O I
10.1039/d4me00172a
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Peptides are a powerful class of molecules that can be applied to a range of problems including biomaterials development and drug design. Currently, machine learning-based property prediction models for peptides primarily rely on amino acid sequence, resulting in two key limitations: first, they are not compatible with non-natural peptide features like modified sidechains or staples, and second, they use human-crafted features to describe the relationships between different amino acids, which reduces the model's flexibility and generalizability. To address these challenges, we have developed PepMNet, a deep learning model that integrates atom-level and amino acid-level information through a hierarchical graph approach. The model first learns from an atom-level graph and then generates amino acid representations based on the atomic information captured in the first stage. These amino acid representations are then combined using graph convolutions on an amino acid-level graph to produce a molecular-level representation, which is then passed to a fully connected neural network for property prediction. We evaluated this architecture by predicting two peptide properties: chromatographic retention time (RT) as a regression task and antimicrobial peptide (AMP) activity as a classification task. For the regression task, PepMNet achieved an average R2 of 0.980 across eight datasets, which spanned different dataset sizes and three liquid chromatography (LC) methods. For the classification task, we developed an ensemble of five models to reduce overfitting and ensure robust classification performance, achieving an area under the receiver operating curve (AUC-ROC) of 0.978 and an average precision of 0.981. Overall, our model illustrates the potential for hierarchical deep learning models to learn peptide properties without relying on human engineering amino acid features.
引用
收藏
页码:205 / 218
页数:14
相关论文
共 50 条
  • [1] An accurate and interpretable deep learning model for environmental properties prediction using hybrid molecular representations
    Zhang, Jun
    Wang, Qin
    Su, Yang
    Jin, Saimeng
    Ren, Jingzheng
    Eden, Mario
    Shen, Weifeng
    AICHE JOURNAL, 2022, 68 (06)
  • [2] Learning Hierarchical Review Graph Representations for Recommendation
    Liu, Yong
    Yang, Susen
    Zhang, Yinan
    Miao, Chunyan
    Nie, Zaiqing
    Zhang, Juyong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 658 - 671
  • [3] Predicting Learning Behaviour of Online Course Learners' using Hybrid Deep Learning Model
    Kavitha, S.
    Mohanavalli, S.
    Bharathi, B.
    PROCEEDINGS OF THE 2018 IEEE 6TH INTERNATIONAL CONFERENCE ON MOOCS, INNOVATION AND TECHNOLOGY IN EDUCATION (MITE 2018), 2018, : 98 - 102
  • [4] Learning Deep Representations for Graph Clustering
    Tian, Fei
    Gao, Bin
    Cui, Qing
    Chen, Enhong
    Liu, Tie-Yan
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 1293 - 1299
  • [5] Deep Neural Networks for Learning Graph Representations
    Cao, Shaosheng
    Lu, Wei
    Xu, Qiongkai
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1145 - 1152
  • [6] Deep Learning Hierarchical Representations for Image Steganalysis
    Ye, Jian
    Ni, Jiangqun
    Yi, Yang
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (11) : 2545 - 2557
  • [7] DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
    Al-Rfou, Rami
    Zelle, Dustin
    Perozzi, Bryan
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 37 - 48
  • [8] A Hybrid Deep Learning Model for Predicting Protein Hydroxylation Sites
    Long, Haixia
    Liao, Bo
    Xu, Xingyu
    Yang, Jialiang
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2018, 19 (09)
  • [9] Attack Graph Model for Cyber-Physical Power Systems Using Hybrid Deep Learning
    Presekal, Alfan
    Stefanov, Alexandru
    Rajkumar, Vetrivel Subramaniam
    Palensky, Peter
    IEEE TRANSACTIONS ON SMART GRID, 2023, 14 (05) : 4007 - 4020
  • [10] Predicting systemic risk in financial systems using Deep Graph Learning
    Balmaseda, Vicente
    Coronado, Maria
    de Cadenas-Santiago, Gonzalo
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 19