Motivation: There are over 30 sequence-based predictors of the protein-binding residues (PBRs). They use either structure-annotated or disorder-annotated training datasets, potentially creating a dichotomy where the structure-/disorder-specific models may not be able to cross-over to accurately predict the other type. Moreover, the structure-trained predictors were shown to substantially cross-predict PBRs among residues that interact with non-protein partners (nucleic acids and small ligands). We address these issues by performing first-of-its-kind comparative study of a representative collection of disorder- and structure-trained predictors using a comprehensive benchmark set with the structure- and disorder-derived annotations of PBRs (to analyze the cross-over) and the protein-, nucleic acid- and small ligand-binding proteins (to study the cross-predictions). Results: Three predictors provide accurate results: SCRIBER, ANCHOR and disoRDPbind. Some of the structure-trained methods make accurate predictions on the structure-annotated proteins. Similarly, the disorder-trained predictors predict well on the disorder-annotated proteins. However, the considered predictors generally fail to crossover, with the exception of SCRIBER. Our study also reveals that virtually all methods substantially cross-predict PBRs, except for SCRIBER for the structure-annotated proteins and disoRDPbind for the disorder-annotated proteins. We formulate a novel hybrid predictor, hybridPBRpred, that combines results produced by disoRDPbind and SCRIBER to accurately predict disorder- and structure-annotated PBRs. HybridPBRpred generates accurate results that cross-over structure- and disorder-annotated proteins and produces relatively low amount of cross-predictions, offering an accurate alternative to predict PBRs.
机构:
Indian Inst Technol Kharagpur, Sch Bio Sci, Kharagpur, India
Indian Inst Technol Kharagpur, Dept Biotechnol, Computat Struct Biol Lab, Kharagpur, IndiaIndian Inst Technol Kharagpur, Sch Bio Sci, Kharagpur, India
Agarwal, Ankita
Kant, Shri
论文数: 0引用数: 0
h-index: 0
机构:
Indian Inst Technol Kharagpur, Dept Biotechnol, Computat Struct Biol Lab, Kharagpur, IndiaIndian Inst Technol Kharagpur, Sch Bio Sci, Kharagpur, India
机构:
Univ Kansas, Ctr Bioinformat, Lawrence, KS 66047 USA
Univ Kansas, Dept Mol Biosci, Lawrence, KS 66047 USA
Univ Calif San Diego, Ctr Res Biol Syst, La Jolla, CA 92093 USAUniv Michigan, Ctr Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
Wu, Sitao
Szilagyi, Andras
论文数: 0引用数: 0
h-index: 0
机构:
Univ Kansas, Ctr Bioinformat, Lawrence, KS 66047 USA
Univ Kansas, Dept Mol Biosci, Lawrence, KS 66047 USA
Hungarian Acad Sci, Inst Enzymol, H-1113 Budapest, HungaryUniv Michigan, Ctr Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
Szilagyi, Andras
Zhang, Yang
论文数: 0引用数: 0
h-index: 0
机构:
Univ Michigan, Ctr Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
Univ Michigan, Dept Biol Chem, Ann Arbor, MI 48109 USA
Univ Kansas, Ctr Bioinformat, Lawrence, KS 66047 USA
Univ Kansas, Dept Mol Biosci, Lawrence, KS 66047 USAUniv Michigan, Ctr Computat Med & Bioinformat, Ann Arbor, MI 48109 USA