A global machine learning based scoring function for protein structure prediction

被引:17
|
作者
Faraggi, Eshel [1 ,2 ,3 ]
Kloczkowski, Andrzej [2 ,4 ]
机构
[1] Indiana Univ Sch Med, Dept Biochem & Mol Biol, Indianapolis, IN 46202 USA
[2] Nationwide Childrens Hosp, Battelle Ctr Math Med, Columbus, OH 43215 USA
[3] Res & Informat Syst LLC, Div Phys, Carmel, IN 46032 USA
[4] Ohio State Univ, Dept Pediat, Columbus, OH 43215 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
tertiary protein structure; protein knowledge potentials; protein potential energy; protein scoring functions; neural network; global features; KNOWLEDGE-BASED POTENTIALS; RESIDUE FORCE-FIELD; STATISTICAL POTENTIALS; BIOMOLECULAR SYSTEMS; ENERGY FUNCTIONS; LATTICE MODEL; WEB-SERVER; DATA-BANK; SIMULATIONS; ORIENTATION;
D O I
10.1002/prot.24454
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a knowledge-based function to score protein decoys based on their similarity to native structure. A set of features is constructed to describe the structure and sequence of the entire protein chain. Furthermore, a qualitative relationship is established between the calculated features and the underlying electromagnetic interaction that dominates this scale. The features we use are associated with residue-residue distances, residue-solvent distances, pairwise knowledge-based potentials and a four-body potential. In addition, we introduce a new target to be predicted, the fitness score, which measures the similarity of a model to the native structure. This new approach enables us to obtain information both from decoys and from native structures. It is also devoid of previous problems associated with knowledge-based potentials. These features were obtained for a large set of native and decoy structures and a back-propagating neural network was trained to predict the fitness score. Overall this new scoring potential proved to be superior to the knowledge-based scoring functions used as its inputs. In particular, in the latest CASP (CASP10) experiment our method was ranked third for all targets, and second for freely modeled hard targets among about 200 groups for top model prediction. Ours was the only method ranked in the top three for all targets and for hard targets. This shows that initial results from the novel approach are able to capture details that were missed by a broad spectrum of protein structure prediction approaches. Source codes and executable from this work are freely available at http://mathmed.org/#Software and http://mamiris.com/. (C) 2013 Wiley Periodicals, Inc.
引用
收藏
页码:752 / 759
页数:8
相关论文
共 50 条
  • [21] LoCo: a novel main chain scoring function for protein structure prediction based on local coordinates
    Stewart E Moughon
    Ram Samudrala
    [J]. BMC Bioinformatics, 12
  • [22] Protein secondary structure prediction using machine learning
    Zhang, BF
    Chen, ZH
    Murphey, YL
    [J]. Proceedings of the International Joint Conference on Neural Networks (IJCNN), Vols 1-5, 2005, : 532 - 537
  • [23] Protein Secondary Structure Prediction Using Machine Learning
    Saha, Sriparna
    Ekbal, Asif
    Sharma, Sidharth
    Bandyopadhyay, Sanghamitra
    Maulik, Ujjwal
    [J]. INTELLIGENT INFORMATICS, 2013, 182 : 57 - +
  • [24] LoCo: a novel main chain scoring function for protein structure prediction based on local coordinates
    Moughon, Stewart E.
    Samudrala, Ram
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [25] Integrating Bonded and Nonbonded Potentials in the Knowledge-Based Scoring Function for Protein Structure Prediction
    Wang, Xinxiang
    Huang, Sheng-You
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (06) : 3080 - 3090
  • [26] Machine learning for prediction of protein function and elucidation of enzyme function and control
    Kankanamge, Lakindu Pathira
    Ruffner, Lydia A.
    Shafique, Atif
    Iyengar, Suhasini M.
    Barnsley, Kelly K.
    Beuning, Penny
    Ondrechen, Mary Jo
    [J]. BIOPHYSICAL JOURNAL, 2024, 123 (03) : 431A - 431A
  • [27] Protein Structure Prediction Without Optimizing Weighting Factors For Scoring Function
    Yang, Yifeng
    Park, Changsoon
    Kihara, Daisuke
    [J]. BIOPHYSICAL JOURNAL, 2009, 96 (03) : 653A - 653A
  • [28] Tailoring Contact Based Scoring Functions for Protein Structure Prediction
    Zaman, Rianon
    Newton, M. A. Hakim
    Mataeimoghadam, Fereshteh
    Sattar, Abdul
    [J]. AI 2021: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13151 : 155 - 168
  • [29] Development of a machine learning-based target-specific scoring function for structure-based binding affinity prediction for human dihydroorotate dehydrogenase inhibitors
    Meng, Jinhui
    Zhang, Li
    He, Zhe
    Hu, Mengfeng
    Liu, Jinhan
    Bao, Wenzhuo
    Tian, Qifeng
    Feng, Huawei
    Liu, Hongsheng
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2024,
  • [30] Function Prediction for the Orphan GPCR based on Machine Learning
    Tamura, Takashi
    Abe, Fuyuto
    Mineta, Katsuhiko
    Endo, Toshinori
    [J]. GENES & GENETIC SYSTEMS, 2010, 85 (06) : 428 - 428