Twenty years of advances in prediction of nucleic acid-binding residues in protein sequences

被引:0
|
作者
Basu, Sushmita [1 ]
Yu, Jing [1 ]
Kihara, Daisuke [2 ,3 ]
Kurgan, Lukasz [1 ]
机构
[1] Virginia Commonwealth Univ, Dept Comp Sci, 401 West Main St, Richmond, VA 23284 USA
[2] Purdue Univ, Dept Biol Sci, 915 Mitch Daniels Blvd, W Lafayette, IN 47907 USA
[3] Purdue Univ, Dept Comp Sci, 305 N Univ St, W Lafayette, IN 47907 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
protein-DNA interaction; protein-RNA interaction; nucleic acid-binding; DNA-binding residue; RNA-binding residue; intrinsic disorder; sequence-based prediction; machine learning; deep learning; INTRINSICALLY DISORDERED PROTEINS; SECONDARY STRUCTURE PREDICTION; COMPUTATIONAL PREDICTION; AMINO-ACIDS; WEB SERVER; RNA RECOGNITION; DNA; SITES; EVOLUTIONARY; COMPLEXES;
D O I
10.1093/bib/bbaf016
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Computational prediction of nucleic acid-binding residues in protein sequences is an active field of research, with over 80 methods that were released in the past 2 decades. We identify and discuss 87 sequence-based predictors that include dozens of recently published methods that are surveyed for the first time. We overview historical progress and examine multiple practical issues that include availability and impact of predictors, key features of their predictive models, and important aspects related to their training and assessment. We observe that the past decade has brought increased use of deep neural networks and protein language models, which contributed to substantial gains in the predictive performance. We also highlight advancements in vital and challenging issues that include cross-predictions between deoxyribonucleic acid (DNA)-binding and ribonucleic acid (RNA)-binding residues and targeting the two distinct sources of binding annotations, structure-based versus intrinsic disorder-based. The methods trained on the structure-annotated interactions tend to perform poorly on the disorder-annotated binding and vice versa, with only a few methods that target and perform well across both annotation types. The cross-predictions are a significant problem, with some predictors of DNA-binding or RNA-binding residues indiscriminately predicting interactions with both nucleic acid types. Moreover, we show that methods with web servers are cited substantially more than tools without implementation or with no longer working implementations, motivating the development and long-term maintenance of the web servers. We close by discussing future research directions that aim to drive further progress in this area.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] STRUCTURES OF SOME NUCLEIC ACID-BINDING DRUGS
    ACHARI, A
    JONES, TA
    NEIDLE, S
    ACTA CRYSTALLOGRAPHICA SECTION A, 1975, 31 : S51 - S51
  • [22] Prediction of fatty acid-binding residues on protein surfaces with three-dimensional probability distributions of interacting atoms
    Mahalingam, Rajasekaran
    Peng, Hung-Pin
    Yang, An-Suei
    BIOPHYSICAL CHEMISTRY, 2014, 192 : 10 - 19
  • [23] A human gene coding for a membrane-associated nucleic acid-binding protein
    Siess, DC
    Vedder, CT
    Merksen, LS
    Tanaka, T
    Freed, AC
    McCoy, SL
    Heinrich, MC
    Deffebach, ME
    Bennett, RM
    Hefeneider, SH
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2000, 275 (43) : 33655 - 33662
  • [24] Diverse roles of the nucleic acid-binding protein KHSRP in cell differentiation and disease
    Briata, Paola
    Bordo, Domenico
    Puppo, Margherita
    Gorlero, Franco
    Rossi, Martina
    Perrone-Bizzozero, Nora
    Gherzi, Roberto
    WILEY INTERDISCIPLINARY REVIEWS-RNA, 2016, 7 (02) : 227 - 240
  • [25] Zebrafish cellular nucleic acid-binding protein: gene structure and developmental behaviour
    Armas, P
    Cachero, S
    Lombardo, VA
    Weiner, A
    Allende, ML
    Calcaterra, NB
    GENE, 2004, 337 : 151 - 161
  • [26] A transcript encoding a nucleic acid-binding protein specifically expressed in maize seeds
    A. Heyl
    J. Muth
    G. Santandrea
    T. O'Connell
    A. Serna
    R. Thompson
    Molecular Genetics and Genomics, 2001, 266 : 180 - 189
  • [27] A transcript encoding a nucleic acid-binding protein specifically expressed in maize seeds
    Heyl, A
    Muth, J
    Santandrea, G
    O'Connell, T
    Serna, A
    Thompson, RD
    MOLECULAR GENETICS AND GENOMICS, 2001, 266 (02) : 180 - 189
  • [28] Cellular nucleic acid-binding protein is vital to testis development and spermatogenesis in mice
    Zheng, Bo
    Yu, Jun
    Guo, Yueshuai
    Gao, Tingting
    Shen, Cong
    Zhang, Xi
    Li, Hong
    Huang, Xiaoyan
    REPRODUCTION, 2018, 156 (01) : 59 - 69
  • [29] Cooperative binding to nucleic acids by barley yellow mosaic bymovirus coat protein and characterization of a nucleic acid-binding domain
    Reichel, C
    Maas, C
    Schulze, S
    Schell, J
    Steinbiss, HH
    JOURNAL OF GENERAL VIROLOGY, 1996, 77 : 587 - 592
  • [30] Biochemical Roles for Conserved Residues in the Bacterial Fatty Acid-binding Protein Family
    Broussard, Tyler C.
    Miller, Darcie J.
    Jackson, Pamela
    Nourse, Amanda
    White, Stephen W.
    Rock, Charles O.
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2016, 291 (12) : 6292 - 6303