A global machine learning based scoring function for protein structure prediction

被引:17
|
作者
Faraggi, Eshel [1 ,2 ,3 ]
Kloczkowski, Andrzej [2 ,4 ]
机构
[1] Indiana Univ Sch Med, Dept Biochem & Mol Biol, Indianapolis, IN 46202 USA
[2] Nationwide Childrens Hosp, Battelle Ctr Math Med, Columbus, OH 43215 USA
[3] Res & Informat Syst LLC, Div Phys, Carmel, IN 46032 USA
[4] Ohio State Univ, Dept Pediat, Columbus, OH 43215 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
tertiary protein structure; protein knowledge potentials; protein potential energy; protein scoring functions; neural network; global features; KNOWLEDGE-BASED POTENTIALS; RESIDUE FORCE-FIELD; STATISTICAL POTENTIALS; BIOMOLECULAR SYSTEMS; ENERGY FUNCTIONS; LATTICE MODEL; WEB-SERVER; DATA-BANK; SIMULATIONS; ORIENTATION;
D O I
10.1002/prot.24454
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a knowledge-based function to score protein decoys based on their similarity to native structure. A set of features is constructed to describe the structure and sequence of the entire protein chain. Furthermore, a qualitative relationship is established between the calculated features and the underlying electromagnetic interaction that dominates this scale. The features we use are associated with residue-residue distances, residue-solvent distances, pairwise knowledge-based potentials and a four-body potential. In addition, we introduce a new target to be predicted, the fitness score, which measures the similarity of a model to the native structure. This new approach enables us to obtain information both from decoys and from native structures. It is also devoid of previous problems associated with knowledge-based potentials. These features were obtained for a large set of native and decoy structures and a back-propagating neural network was trained to predict the fitness score. Overall this new scoring potential proved to be superior to the knowledge-based scoring functions used as its inputs. In particular, in the latest CASP (CASP10) experiment our method was ranked third for all targets, and second for freely modeled hard targets among about 200 groups for top model prediction. Ours was the only method ranked in the top three for all targets and for hard targets. This shows that initial results from the novel approach are able to capture details that were missed by a broad spectrum of protein structure prediction approaches. Source codes and executable from this work are freely available at http://mathmed.org/#Software and http://mamiris.com/. (C) 2013 Wiley Periodicals, Inc.
引用
收藏
页码:752 / 759
页数:8
相关论文
共 50 条
  • [41] A Small Step Toward Generalizability: Training a Machine Learning Scoring Function for Structure-Based Virtual Screening
    Scantlebury, Jack
    Vost, Lucy
    Carbery, Anna
    Hadfield, Thomas E.
    Turnbull, Oliver M.
    Brown, Nathan
    Chenthamarakshan, Vijil
    Das, Payel
    Grosjean, Harold
    von Delft, Frank
    Deane, Charlotte M.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (10) : 2960 - 2974
  • [42] Information Theory-Based Scoring Function for the Structure-Based Prediction of Protein-Ligand Binding Affinity
    Kulharia, Mahesh
    Goody, Rooer S.
    Jackson, Richard N.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (10) : 1990 - 1998
  • [43] Human Protein Function Prediction Enhancement Using Decision Tree Based Machine Learning Approach
    Sharma, Sunny
    Singh, Gurvinder
    Singh, Rajinder
    [J]. INFORMATION, COMMUNICATION AND COMPUTING TECHNOLOGY (ICICCT 2019), 2019, 1025 : 279 - 293
  • [44] Derivation of a scoring function for crystal structure prediction
    Apostolakis, J
    Hofmann, DWM
    Lengauer, T
    [J]. ACTA CRYSTALLOGRAPHICA SECTION A, 2001, 57 : 442 - 450
  • [45] Global machine learning model predicting activity level of any GPCRs based on protein structure
    Mollaei, Parisa
    Farimani, Amir Barati
    [J]. BIOPHYSICAL JOURNAL, 2023, 122 (03) : 181A - 181A
  • [46] Machine Learning Prediction of the Transmission Function for Protein Sequencing with Graphene Nanoslit
    Mittal, Sneha
    Manna, Souvik
    Pathak, Biswarup
    [J]. ACS APPLIED MATERIALS & INTERFACES, 2022, 14 (46) : 51645 - 51655
  • [47] DLScore: New deep learning based scoring function for reliable protein-ligand scoring
    Sirimulla, Suman
    Muela, Gerardo
    Fuentes, Olac
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [48] CINF 117-Exploring the feasibility of a protein structure prediction metaserver based on the AMBER/PBSA scoring function
    Chen, Hai-Feng
    Hsieh, M. J.
    Luo, Ray
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2006, 232
  • [49] COMP 265-Validating the network-based scoring function for helical membrane protein structure prediction
    Gao, Jun
    Li, Zhijun
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2008, 236
  • [50] PharmRF: A machine-learning scoring function to identify the best protein-ligand complexes for structure-based pharmacophore screening with high enrichments
    Kumar, Sivakumar Prasanth
    Dixit, Nandan Y.
    Patel, Chirag N.
    Rawal, Rakesh M.
    Pandya, Himanshu A.
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2022, 43 (12) : 847 - 863