PepMNet: a hybrid deep learning model for predicting peptide properties using hierarchical graph representations

被引:0
|
作者
Otero, Daniel Garzon [1 ]
Akbari, Omid [1 ]
Bilodeau, Camille [1 ]
机构
[1] Univ Virginia, Chem Engn Dept, 385 McCormick Rd, Charlottesville, VA 22903 USA
来源
关键词
RETENTION TIME PREDICTION; ANTIMICROBIAL PEPTIDES; IDENTIFICATION;
D O I
10.1039/d4me00172a
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Peptides are a powerful class of molecules that can be applied to a range of problems including biomaterials development and drug design. Currently, machine learning-based property prediction models for peptides primarily rely on amino acid sequence, resulting in two key limitations: first, they are not compatible with non-natural peptide features like modified sidechains or staples, and second, they use human-crafted features to describe the relationships between different amino acids, which reduces the model's flexibility and generalizability. To address these challenges, we have developed PepMNet, a deep learning model that integrates atom-level and amino acid-level information through a hierarchical graph approach. The model first learns from an atom-level graph and then generates amino acid representations based on the atomic information captured in the first stage. These amino acid representations are then combined using graph convolutions on an amino acid-level graph to produce a molecular-level representation, which is then passed to a fully connected neural network for property prediction. We evaluated this architecture by predicting two peptide properties: chromatographic retention time (RT) as a regression task and antimicrobial peptide (AMP) activity as a classification task. For the regression task, PepMNet achieved an average R2 of 0.980 across eight datasets, which spanned different dataset sizes and three liquid chromatography (LC) methods. For the classification task, we developed an ensemble of five models to reduce overfitting and ensure robust classification performance, achieving an area under the receiver operating curve (AUC-ROC) of 0.978 and an average precision of 0.981. Overall, our model illustrates the potential for hierarchical deep learning models to learn peptide properties without relying on human engineering amino acid features.
引用
收藏
页码:205 / 218
页数:14
相关论文
共 50 条
  • [41] Learning Graph Pooling and Hybrid Convolutional Operations for Text Representations
    Gao, Hongyang
    Chen, Yongjun
    Ji, Shuiwang
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 2743 - 2749
  • [42] Predicting progression to AD using a deep-learning model
    Kelsey R.
    Nature Reviews Neurology, 2019, 15 (9) : 492 - 492
  • [43] Email spam detection using hierarchical attention hybrid deep learning method
    Zavrak, Sultan
    Yilmaz, Seyhmus
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 233
  • [44] Hybrid approach using deep learning and graph comparison for building change detection
    Park, Seula
    Song, Ahram
    GISCIENCE & REMOTE SENSING, 2023, 60 (01)
  • [45] Comparative Deep Learning of Hybrid Representations for Image Recommendations
    Lei, Chenyi
    Liu, Dong
    Li, Weiping
    Zha, Zheng-Jun
    Li, Houqiang
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2545 - 2553
  • [46] A Hybrid Model For Predicting The Degradation Trend Of Hydropower Units Based On Deep Learning
    Hu, Xin
    Li, Chaoshun
    Tang, Geng
    2019 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-QINGDAO), 2019,
  • [47] Hierarchical Graph Augmented Deep Collaborative Dictionary Learning for Classification
    Gou, Jianping
    Yuan, Xia
    Du, Lan
    Xia, Shuyin
    Yi, Zhang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25308 - 25322
  • [48] Deep Reinforcement Learning for Autonomous Driving using High-Level Heterogeneous Graph Representations
    Schier, Maximilian
    Reinders, Christoph
    Rosenhahn, Bodo
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 7147 - 7153
  • [49] A General Model for Learning Node and Graph Representations Jointly
    Chen, Chaofan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2867 - 2873
  • [50] A lightweight hierarchical graph convolutional model for knowledge graph representation learning
    Zhang, Jinglin
    Shen, Bo
    APPLIED INTELLIGENCE, 2024, 54 (21) : 10695 - 10708