Twenty years of advances in prediction of nucleic acid-binding residues in protein sequences

被引:0
|
作者
Basu, Sushmita [1 ]
Yu, Jing [1 ]
Kihara, Daisuke [2 ,3 ]
Kurgan, Lukasz [1 ]
机构
[1] Virginia Commonwealth Univ, Dept Comp Sci, 401 West Main St, Richmond, VA 23284 USA
[2] Purdue Univ, Dept Biol Sci, 915 Mitch Daniels Blvd, W Lafayette, IN 47907 USA
[3] Purdue Univ, Dept Comp Sci, 305 N Univ St, W Lafayette, IN 47907 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
protein-DNA interaction; protein-RNA interaction; nucleic acid-binding; DNA-binding residue; RNA-binding residue; intrinsic disorder; sequence-based prediction; machine learning; deep learning; INTRINSICALLY DISORDERED PROTEINS; SECONDARY STRUCTURE PREDICTION; COMPUTATIONAL PREDICTION; AMINO-ACIDS; WEB SERVER; RNA RECOGNITION; DNA; SITES; EVOLUTIONARY; COMPLEXES;
D O I
10.1093/bib/bbaf016
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Computational prediction of nucleic acid-binding residues in protein sequences is an active field of research, with over 80 methods that were released in the past 2 decades. We identify and discuss 87 sequence-based predictors that include dozens of recently published methods that are surveyed for the first time. We overview historical progress and examine multiple practical issues that include availability and impact of predictors, key features of their predictive models, and important aspects related to their training and assessment. We observe that the past decade has brought increased use of deep neural networks and protein language models, which contributed to substantial gains in the predictive performance. We also highlight advancements in vital and challenging issues that include cross-predictions between deoxyribonucleic acid (DNA)-binding and ribonucleic acid (RNA)-binding residues and targeting the two distinct sources of binding annotations, structure-based versus intrinsic disorder-based. The methods trained on the structure-annotated interactions tend to perform poorly on the disorder-annotated binding and vice versa, with only a few methods that target and perform well across both annotation types. The cross-predictions are a significant problem, with some predictors of DNA-binding or RNA-binding residues indiscriminately predicting interactions with both nucleic acid types. Moreover, we show that methods with web servers are cited substantially more than tools without implementation or with no longer working implementations, motivating the development and long-term maintenance of the web servers. We close by discussing future research directions that aim to drive further progress in this area.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] CLONING OF THE NUCLEIC ACID-BINDING DOMAIN OF THE RAT HNRNP C-TYPE PROTEIN
    SHARP, ZD
    SMITH, KP
    CAO, Z
    HELSEL, S
    BIOCHIMICA ET BIOPHYSICA ACTA, 1990, 1048 (2-3) : 306 - 309
  • [42] A novel nucleic acid-binding protein that interacts with human Rad51 recombinase
    Kovalenko, OV
    Golub, EI
    Bray-Ward, P
    Ward, DC
    Radding, CM
    NUCLEIC ACIDS RESEARCH, 1997, 25 (24) : 4946 - 4953
  • [43] METAL-BINDING, NUCLEIC ACID-BINDING FINGER SEQUENCES IN THE CDC16-GENE OF SACCHAROMYCES-CEREVISIAE
    ICHO, T
    WICKNER, RB
    NUCLEIC ACIDS RESEARCH, 1987, 15 (20) : 8439 - 8450
  • [44] NUCLEIC ACID-BINDING PROTEINS - MORE METAL-BINDING FINGERS
    BERG, JM
    NATURE, 1986, 319 (6051) : 264 - 265
  • [45] COMPLETE AMINO-ACID-SEQUENCE OF THE NUCLEIC ACID-BINDING PROTEIN OF BOVINE LEUKEMIA-VIRUS
    COPELAND, TD
    MORGAN, MA
    OROSZLAN, S
    FEBS LETTERS, 1983, 156 (01) : 37 - 40
  • [46] PURIFICATION OF MURINE ADIPOCYTE LIPID-BINDING PROTEIN - CHARACTERIZATION AS A FATTY ACID-BINDING AND RETINOIC ACID-BINDING PROTEIN
    MATARESE, V
    BERNLOHR, DA
    JOURNAL OF BIOLOGICAL CHEMISTRY, 1988, 263 (28) : 14544 - 14551
  • [47] Prediction of heme binding residues from protein sequences with integrative sequence profiles
    Xiong, Yi
    Liu, Juan
    Zhang, Wen
    Zeng, Tao
    PROTEOME SCIENCE, 2012, 10
  • [48] Amino Acid Composition in Various Types of Nucleic Acid-Binding Proteins
    Bartas, Martin
    Cerven, Jiri
    Guziurova, Simona
    Slychko, Kristyna
    Pecinka, Petr
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (02) : 1 - 12
  • [49] Prediction of heme binding residues from protein sequences with integrative sequence profiles
    Yi Xiong
    Juan Liu
    Wen Zhang
    Tao Zeng
    Proteome Science, 10
  • [50] PiRaNhA: a server for the computational prediction of RNA-binding residues in protein sequences
    Murakami, Yoichi
    Spriggs, Ruth V.
    Nakamura, Haruki
    Jones, Susan
    NUCLEIC ACIDS RESEARCH, 2010, 38 : W412 - W416