An updated dataset and a structure-based prediction model for protein-RNA binding affinity

被引:3
|
作者
Hong, Xu [1 ]
Tong, Xiaoxue [1 ]
Xie, Juan [1 ]
Liu, Pinyu [1 ]
Liu, Xudong [1 ]
Song, Qi [2 ]
Liu, Sen [2 ]
Liu, Shiyong [1 ,3 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Phys, Wuhan, Hubei, Peoples R China
[2] Hubei Univ Technol, Key Lab Fermentat Engn, Minist Educ, Wuhan, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Phys, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
binding affinity; feature selection; protein-RNA interaction; regression model; structural features; FREE-ENERGY; COLLECTION; COMPLEXES;
D O I
10.1002/prot.26503
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Understanding the process of protein-RNA interaction is essential for structural biology. The thermodynamic process is an important part to uncover the protein-RNA interaction mechanism. The regulatory networks between protein and RNA in organisms are dominated by the binding or dissociation in the cells. Therefore, determining the binding affinity for protein-RNA complexes can help us to understand the regulation mechanism of protein-RNA interaction. Since it is time-consuming and labor-intensive to determine the binding affinity for protein-RNA complexes by experimental methods, it is necessary and urgent to develop computational methods to predict that. To develop a binding affinity prediction model, first we update the dataset of protein-RNA binding affinity benchmark (PRBAB), which includes 145 complexes now. Second, we extract the structural features based on complex structure, and then we analyze and select the representative structural features to train the regression model. Third, we random select the subset from the PRBAB2.0 to fit the protein-RNA binding affinity determined by experiment. In the end, we tested our model on the nonredundant PDBbind dataset, and the results showed that Pearson correlation coefficient r = .57 and RMSE = 2.51 kcal/mol. The Pearson correlation coefficient achieves 0.7 while removing 5 complex structures with modified residues/nucleotides and metal ions. While testing on ProNAB, the results showed that 71.60% of the prediction achieves Pearson correlation coefficient r = .61 and RMSE = 1.56 kcal/mol with experiment values.
引用
收藏
页码:1245 / 1253
页数:9
相关论文
共 50 条
  • [1] A structure-based model for the prediction of protein-RNA binding affinity
    Nithin, Chandran
    Mukherjee, Sunandan
    Bahadur, Ranjit Prasad
    [J]. RNA, 2019, 25 (12) : 1628 - 1645
  • [2] PRA-Pred: Structure-based prediction of protein-RNA binding affinity
    Harini, K.
    Sekijima, M.
    Gromiha, M. Michael
    [J]. INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2024, 259
  • [3] The dataset for protein-RNA binding affinity
    Yang, Xiufeng
    Li, Haotian
    Huang, Yangyu
    Liu, Shiyong
    [J]. PROTEIN SCIENCE, 2013, 22 (12) : 1808 - 1811
  • [4] Individually double minimum-distance definition of protein-RNA binding residues and application to structure-based prediction
    Hu, Wen
    Qin, Liu
    Li, Menglong
    Pu, Xuemei
    Guo, Yanzhi
    [J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2018, 32 (12) : 1363 - 1373
  • [5] Structure-based prediction of protein–protein binding affinity with consideration of allosteric effect
    Feifei Tian
    Yonggang Lv
    Li Yang
    [J]. Amino Acids, 2012, 43 : 531 - 543
  • [6] Structure-based prediction and characterization of photo-crosslinking in native protein-RNA complexes
    Feng, Huijuan
    Lu, Xiang-Jun
    Maji, Suvrajit
    Liu, Linxi
    Ustianenko, Dmytro
    Rudnick, Noam D.
    Zhang, Chaolin
    [J]. NATURE COMMUNICATIONS, 2024, 15 (01)
  • [7] Structure-based prediction of protein-protein binding affinity with consideration of allosteric effect
    Tian, Feifei
    Lv, Yonggang
    Yang, Li
    [J]. AMINO ACIDS, 2012, 43 (02) : 531 - 543
  • [8] Genetic perturbations of RNA reveal structure-based recognition in protein-RNA interaction
    Cho, H
    Otten, S
    Schneider, J
    McClain, WH
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2002, 324 (04) : 573 - 576
  • [9] Structure-based protein-ligand interaction fingerprints for binding affinity prediction
    Wang, Debby D.
    Chan, Moon-Tong
    Yan, Hong
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 6291 - 6300
  • [10] PredPRBA: Prediction of Protein-RNA Binding Affinity Using Gradient Boosted Regression Trees
    Deng, Lei
    Yang, Wenyi
    Liu, Hui
    [J]. FRONTIERS IN GENETICS, 2019, 10