diSBPred: A machine learning based approach for disulfide bond prediction

被引:8
|
作者
Mishra, Avdesh [1 ]
Ul Kabir, Md Wasi [2 ]
Hoque, Md Tamjidul [2 ]
机构
[1] Texas A&M Univ, Dept Elect Engn & Comp Sci, Kingsville, TX USA
[2] Univ New Orleans, Dept Comp Sci, New Orleans, LA 70148 USA
关键词
Machine learning; Disulfide bond prediction; Protein structure; Protein sequence; ACCESSIBLE SURFACE-AREA; CONNECTIVITY PREDICTION; NEURAL-NETWORKS; WEB SERVER; PROTEINS; CYSTEINE; STATE; INFORMATION; KNOWLEDGE; OXIDATION;
D O I
10.1016/j.compbiolchem.2021.107436
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The protein disulfide bond is a covalent bond that forms during post-translational modification by the oxidation of a pair of cysteines. In protein, the disulfide bond is the most frequent covalent link between amino acids after the peptide bond. It plays a significant role in three-dimensional (3D) ab initio protein structure prediction (aiPSP), stabilizing protein conformation, post-translational modification, and protein folding. In aiPSP, the location of disulfide bonds can strongly reduce the conformational space searching by imposing geometrical constraints. Existing experimental techniques for the determination of disulfide bonds are time-consuming and expensive. Thus, developing sequence-based computational methods for disulfide bond prediction becomes indispensable. This study proposed a stacking-based machine learning approach for disulfide bond prediction (diSBPred). Various useful sequence and structure-based features are extracted for effective training, including conservation profile, residue solvent accessibility, torsion angle flexibility, disorder probability, a sequential distance between cysteines, and more. The prediction of disulfide bonds is carried out in two stages: first, individual cysteines are predicted as either bonding or non-bonding; second, the cysteine-pairs are predicted as either bonding or non-bonding by including the results from cysteine bonding prediction as a feature. The examination of the relevance of the features employed in this study and the features utilized in the existing nearest neighbor algorithm (NNA) method shows that the features used in this study improve about 7.39 % in jackknife validation balanced accuracy. Moreover, for individual cysteine bonding prediction and cysteine-pair bonding prediction, diSBPred provides a 10-fold cross-validation balanced accuracy of 82.29 % and 94.20 %, respectively. Altogether, our predictor achieves an improvement of 43.25 % based on balanced accuracy compared to the existing NNA based approach. Thus, diSBPred can be utilized to annotate the cysteine bonding residues of protein sequences whose structures are unknown as well as improve the accuracy of the aiPSP method, which can further aid in experimental studies of the disulfide bond and structure determination.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Prediction of disulfide bond engineering sites using a machine learning method
    Gao, Xiang
    Dong, Xiaoqun
    Li, Xuanxuan
    Liu, Zhijie
    Liu, Haiguang
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [2] Prediction of Bond Dissociation Energy for Organic Molecules Based on a Machine-Learning Approach
    Liu, Yidi
    Li, Yao
    Yang, Qi
    Yang, Jin-Dong
    Zhang, Long
    Luo, Sanzhong
    [J]. CHINESE JOURNAL OF CHEMISTRY, 2024, 42 (17) : 1967 - 1974
  • [3] Author Correction: Prediction of disulfide bond engineering sites using a machine learning method
    Xiang Gao
    Xiaoqun Dong
    Xuanxuan Li
    Zhijie Liu
    Haiguang Liu
    [J]. Scientific Reports, 10
  • [4] Prediction of FRCM-Concrete Bond Strength with Machine Learning Approach
    Kumar, Aman
    Arora, Harish Chandra
    Kumar, Krishna
    Mohammed, Mazin Abed
    Majumdar, Arnab
    Khamaksorn, Achara
    Thinnukool, Orawit
    [J]. SUSTAINABILITY, 2022, 14 (02)
  • [5] A Novel Machine Learning Based Approach for Rainfall Prediction
    Solanki, Niharika
    Panchal, Gaurang
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS (ICTIS 2017) - VOL 1, 2018, 83 : 314 - 319
  • [6] Prediction of FRP-concrete interfacial bond strength based on machine learning
    Zhang, Feng
    Wang, Chenxin
    Liu, Jun
    Zou, Xingxing
    Sneed, Lesley H.
    Bao, Yi
    Wang, Libin
    [J]. ENGINEERING STRUCTURES, 2023, 274
  • [7] Prediction of Nucleophilicity and Electrophilicity Based on a Machine-Learning Approach
    Liu, Yidi
    Yang, Qi
    Cheng, Junjie
    Zhang, Long
    Luo, Sanzhong
    Cheng, Jin-Pei
    [J]. CHEMPHYSCHEM, 2023, 24 (14)
  • [8] Anomaly prediction approach in business process based on machine learning
    Wei Y.
    Cao J.
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2019, 25 (04): : 864 - 872
  • [9] BROKEN RAIL PREDICTION WITH MACHINE LEARNING-BASED APPROACH
    Zhang, Zhipeng
    Zhou, Kang
    Liu, Xiang
    [J]. PROCEEDINGS OF THE JOINT RAIL CONFERENCE (JRC2020), 2020,
  • [10] Prediction of harbour vessel emissions based on machine learning approach
    Chen, Zhong Shuo
    Lam, Jasmine Siu Lee
    Xiao, Zengqi
    [J]. TRANSPORTATION RESEARCH PART D-TRANSPORT AND ENVIRONMENT, 2024, 131