Twenty years of advances in prediction of nucleic acid-binding residues in protein sequences

被引:0
|
作者
Basu, Sushmita [1 ]
Yu, Jing [1 ]
Kihara, Daisuke [2 ,3 ]
Kurgan, Lukasz [1 ]
机构
[1] Virginia Commonwealth Univ, Dept Comp Sci, 401 West Main St, Richmond, VA 23284 USA
[2] Purdue Univ, Dept Biol Sci, 915 Mitch Daniels Blvd, W Lafayette, IN 47907 USA
[3] Purdue Univ, Dept Comp Sci, 305 N Univ St, W Lafayette, IN 47907 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
protein-DNA interaction; protein-RNA interaction; nucleic acid-binding; DNA-binding residue; RNA-binding residue; intrinsic disorder; sequence-based prediction; machine learning; deep learning; INTRINSICALLY DISORDERED PROTEINS; SECONDARY STRUCTURE PREDICTION; COMPUTATIONAL PREDICTION; AMINO-ACIDS; WEB SERVER; RNA RECOGNITION; DNA; SITES; EVOLUTIONARY; COMPLEXES;
D O I
10.1093/bib/bbaf016
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Computational prediction of nucleic acid-binding residues in protein sequences is an active field of research, with over 80 methods that were released in the past 2 decades. We identify and discuss 87 sequence-based predictors that include dozens of recently published methods that are surveyed for the first time. We overview historical progress and examine multiple practical issues that include availability and impact of predictors, key features of their predictive models, and important aspects related to their training and assessment. We observe that the past decade has brought increased use of deep neural networks and protein language models, which contributed to substantial gains in the predictive performance. We also highlight advancements in vital and challenging issues that include cross-predictions between deoxyribonucleic acid (DNA)-binding and ribonucleic acid (RNA)-binding residues and targeting the two distinct sources of binding annotations, structure-based versus intrinsic disorder-based. The methods trained on the structure-annotated interactions tend to perform poorly on the disorder-annotated binding and vice versa, with only a few methods that target and perform well across both annotation types. The cross-predictions are a significant problem, with some predictors of DNA-binding or RNA-binding residues indiscriminately predicting interactions with both nucleic acid types. Moreover, we show that methods with web servers are cited substantially more than tools without implementation or with no longer working implementations, motivating the development and long-term maintenance of the web servers. We close by discussing future research directions that aim to drive further progress in this area.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Binding Of Immune Serine Proteases To Nucleic Acids Enhances Their Nuclear Localization and Promotes Their Cleavage Of Nucleic Acid-Binding Protein Substrates
    Whangbo, Jennifer
    Thomas, Marshall
    McCrossan, Geoffrey
    Deutsch, Aaron
    Martinod, Kimberly
    Walch, Michael
    Lieberman, Judy
    BLOOD, 2013, 122 (21)
  • [32] Prediction of binding sites in protein-nucleic acid complexes
    Han, N
    Han, K
    COMPUTATIONAL SCIENCE - ICCS 2004, PT 2, PROCEEDINGS, 2004, 3037 : 309 - 316
  • [33] NUCLEIC ACID-BINDING SPECIFICITIES OF TOBACCO CHLOROPLAST RIBONUCLEOPROTEINS
    LI, YQ
    SUGIURA, M
    NUCLEIC ACIDS RESEARCH, 1991, 19 (11) : 2893 - 2896
  • [34] Nucleic acid-binding properties of the RRM-containing protein RDM1
    Hamimes, S
    Bourgeon, D
    Stasiak, AZ
    Stasiak, A
    Van Dyck, E
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2006, 344 (01) : 87 - 94
  • [35] Modem tools for identification of nucleic acid-binding proteins
    Hegarat, Nadia
    Francois, Jean-Christophe
    Praseuth, Daniele
    BIOCHIMIE, 2008, 90 (09) : 1265 - 1272
  • [36] Comparative Assessment of Intrinsic Disorder Predictions with a Focus on Protein and Nucleic Acid-Binding Proteins
    Katuwawala, Akila
    Kurgan, Lukasz
    BIOMOLECULES, 2020, 10 (12) : 1 - 18
  • [37] ORF 5 of Grapevine Virus A Encodes a Nucleic Acid-Binding Protein and Affects Pathogenesis
    Nurbol Galiakparov
    Edna Tanne
    Munir Mawassi
    Rony Gafny
    Ilan Sela
    Virus Genes, 2003, 27 : 257 - 262
  • [38] THE SUBUNITS OF INTERMEDIATE FILAMENTS ARE NUCLEIC ACID-BINDING PROTEINS
    TRAUB, P
    NELSON, WJ
    VORGIAS, CE
    KUHN, S
    JOURNAL OF CELL BIOLOGY, 1982, 95 (02): : A229 - A229
  • [39] NUCLEIC ACID-BINDING PROPERTIES OF ADENOVIRUS STRUCTURAL POLYPEPTIDES
    RUSSELL, WC
    PRECIOUS, B
    JOURNAL OF GENERAL VIROLOGY, 1982, 63 (NOV): : 69 - 79
  • [40] ORF 5 of grapevine virus A encodes a nucleic acid-binding protein and affects pathogenesis
    Galiakparov, N
    Tanne, E
    Mawassi, M
    Gafny, R
    Sela, I
    VIRUS GENES, 2003, 27 (03) : 257 - 262