Twenty years of advances in prediction of nucleic acid-binding residues in protein sequences

被引:0
|
作者
Basu, Sushmita [1 ]
Yu, Jing [1 ]
Kihara, Daisuke [2 ,3 ]
Kurgan, Lukasz [1 ]
机构
[1] Virginia Commonwealth Univ, Dept Comp Sci, 401 West Main St, Richmond, VA 23284 USA
[2] Purdue Univ, Dept Biol Sci, 915 Mitch Daniels Blvd, W Lafayette, IN 47907 USA
[3] Purdue Univ, Dept Comp Sci, 305 N Univ St, W Lafayette, IN 47907 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
protein-DNA interaction; protein-RNA interaction; nucleic acid-binding; DNA-binding residue; RNA-binding residue; intrinsic disorder; sequence-based prediction; machine learning; deep learning; INTRINSICALLY DISORDERED PROTEINS; SECONDARY STRUCTURE PREDICTION; COMPUTATIONAL PREDICTION; AMINO-ACIDS; WEB SERVER; RNA RECOGNITION; DNA; SITES; EVOLUTIONARY; COMPLEXES;
D O I
10.1093/bib/bbaf016
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Computational prediction of nucleic acid-binding residues in protein sequences is an active field of research, with over 80 methods that were released in the past 2 decades. We identify and discuss 87 sequence-based predictors that include dozens of recently published methods that are surveyed for the first time. We overview historical progress and examine multiple practical issues that include availability and impact of predictors, key features of their predictive models, and important aspects related to their training and assessment. We observe that the past decade has brought increased use of deep neural networks and protein language models, which contributed to substantial gains in the predictive performance. We also highlight advancements in vital and challenging issues that include cross-predictions between deoxyribonucleic acid (DNA)-binding and ribonucleic acid (RNA)-binding residues and targeting the two distinct sources of binding annotations, structure-based versus intrinsic disorder-based. The methods trained on the structure-annotated interactions tend to perform poorly on the disorder-annotated binding and vice versa, with only a few methods that target and perform well across both annotation types. The cross-predictions are a significant problem, with some predictors of DNA-binding or RNA-binding residues indiscriminately predicting interactions with both nucleic acid types. Moreover, we show that methods with web servers are cited substantially more than tools without implementation or with no longer working implementations, motivating the development and long-term maintenance of the web servers. We close by discussing future research directions that aim to drive further progress in this area.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues
    Yang, Xiaoxia
    Wang, Jia
    Sun, Jun
    Liu, Rong
    PLOS ONE, 2015, 10 (07):
  • [2] An ABA-binding protein with nucleic acid-binding property
    Zhifu Zheng
    Zhenhua Jin
    Xie Zhou
    Kai Xia
    Cheng Ma
    Science in China Series C: Life Sciences, 1998, 41 : 209 - 216
  • [3] An ABA-binding protein with nucleic acid-binding property
    郑志富
    金振华
    周燮
    夏凯
    马诚
    Science in China(Series C:Life Sciences) , 1998, (02) : 209 - 216
  • [4] An ABA-binding protein with nucleic acid-binding property
    Zheng, ZF
    Jin, ZH
    Zhou, X
    Xia, K
    Ma, C
    SCIENCE IN CHINA SERIES C-LIFE SCIENCES, 1998, 41 (02): : 209 - 216
  • [5] Nucleic acid-binding specificity of human FUS protein
    Wang, Xueyin
    Schwartz, Jacob C.
    Cech, Thomas R.
    NUCLEIC ACIDS RESEARCH, 2015, 43 (15) : 7535 - 7543
  • [6] Advances in the Application of Protein Language Modeling for Nucleic Acid Protein Binding Site Prediction
    Wang, Bo
    Li, Wenjin
    GENES, 2024, 15 (08)
  • [7] NesT-NABind: a Nested Transformer for Nucleic Acid-Binding Site Prediction on Protein Surface
    Ma, Xinyue
    Li, Fenglei
    Chen, Qianyu
    Gao, Shenghua
    Bai, Fang
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2025, 65 (03) : 1166 - 1177
  • [8] Inhibitory properties of nucleic acid-binding ligands on protein synthesis
    Malina, A
    Khan, S
    Carlson, CB
    Svitkin, Y
    Harvey, I
    Sonenberg, N
    Beal, PA
    Pelletier, J
    FEBS LETTERS, 2005, 579 (01): : 79 - 89
  • [9] ORGANIZATION OF THE GENE ENCODING CELLULAR NUCLEIC ACID-BINDING PROTEIN
    FLINK, IL
    MORKIN, E
    GENE, 1995, 163 (02) : 279 - 282
  • [10] Annotating nucleic acid-binding function based on protein structure
    Stawiski, EW
    Gregoret, LM
    Mandel-Gutfreund, Y
    JOURNAL OF MOLECULAR BIOLOGY, 2003, 326 (04) : 1065 - 1079