Benchmarking protein structure predictors to assist machine learning-guided peptide discovery

被引:2
|
作者
Aldas-Bulos, Victor Daniel [1 ]
Plisson, Fabien [1 ,2 ]
机构
[1] Natl Polytech Inst CINVESTAV IPN, Ctr Res & Adv Studies, Adv Genom Unit, Natl Lab Genom Biodivers LANGEBIO, Irapuato 36824, Guanajuato, Mexico
[2] Natl Polytech Inst CINVESTAV IPN, Ctr Res & Adv Studies, Dept Biotechnol & Biochem, Irapuato Unit, Irapuato 36824, Guanajuato, Mexico
来源
DIGITAL DISCOVERY | 2023年 / 2卷 / 04期
关键词
SECONDARY STRUCTURE PREDICTION; ARTIFICIAL-INTELLIGENCE; ANTIMICROBIAL PEPTIDES; DESIGN; LANGUAGE; DATABASE;
D O I
10.1039/d3dd00045a
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Machine learning models provide an informed and efficient strategy to create novel peptide and protein sequences with the desired profiles. Nevertheless, they are primarily trained on sequences where the tridimensional structures of peptides and proteins are often overlooked. We need a fast and reliable approach to estimate the structural diversity of medium-large training sets before building models. This study benchmarked four protein structure prediction methods (Jpred4, PEP2D, PSIPRED, AlphaFold2) using 261 curated and experimentally known structures from the PDBe database. We applied our best predictor to map the structural landscape of GRAMPA, the giant and vastly uncharted repository of 5980 antimicrobial peptides. The dataset was predominantly made of loose helices (65.1%), followed by random coils (17.8%), and beta-stranded and mixed structures accounted for the rest. Machine learning models provide an informed and efficient strategy to create novel peptide and protein sequences with the desired profiles.
引用
收藏
页码:981 / 993
页数:13
相关论文
共 50 条
  • [1] Machine learning-guided peptide discovery: are we there yet?
    Mausa, Goran
    Kalafatovic, Daniela
    JOURNAL OF PEPTIDE SCIENCE, 2024, 30
  • [2] Machine Learning-Guided Protein Engineering
    Kouba, Petr
    Kohout, Pavel
    Haddadi, Faraneh
    Bushuiev, Anton
    Samusevich, Raman
    Sedlar, Jiri
    Damborsky, Jiri
    Pluskal, Tomas
    Sivic, Josef
    Mazurenko, Stanislav
    ACS CATALYSIS, 2023, 13 (21) : 13863 - 13895
  • [3] Nucleotide augmentation for machine learning-guided protein engineering
    Minot, Mason
    Reddy, Sai T.
    BIOINFORMATICS ADVANCES, 2023, 3 (01):
  • [4] Overcoming the challenges in machine learning-guided antimicrobial peptide design
    Plisson, Fabien
    JOURNAL OF PEPTIDE SCIENCE, 2022, 28
  • [5] Machine learning-guided discovery and design of non-hemolytic peptides
    Plisson, Fabien
    Ramirez-Sanchez, Obed
    Martinez-Hernandez, Cristina
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [6] Machine learning-guided discovery of gas evolving electrode bubble inactivation
    Lake, Jack R.
    Rufer, Simon
    James, Jim
    Pruyne, Nathan
    Scourtas, Aristana
    Schwarting, Marcus
    Ambadkar, Aadit
    Foster, Ian
    Blaiszik, Ben
    Varanasi, Kripa K.
    NANOSCALE, 2025, 17 (03)
  • [7] Machine Learning-Guided Discovery of AcrB and MexB Efflux Pump Inhibitors
    Bera, Abhishek
    Roy, Rakesh Kumar
    Joshi, Pritish
    Patra, Niladri
    JOURNAL OF PHYSICAL CHEMISTRY B, 2024, 128 (03): : 648 - 663
  • [8] Machine learning-guided discovery and design of non-hemolytic peptides
    Fabien Plisson
    Obed Ramírez-Sánchez
    Cristina Martínez-Hernández
    Scientific Reports, 10
  • [9] Challenges and opportunities in machine learning-guided plant protein engineering
    Shukla, Diwakar
    BIOPHYSICAL JOURNAL, 2024, 123 (03) : 455A - 455A
  • [10] Machine Learning-Guided Prediction of Hydroformylation
    Shi, Haonan
    Shen, Chaoren
    Huang, Zheng
    Dong, Kaiwu
    CHEMPHYSCHEM, 2025, 26 (03)