Predicting and annotating catalytic residues: An information theoretic approach

被引:20
|
作者
Sterner, Beckett
Singh, Rohit
Berger, Bonnie [1 ]
机构
[1] MIT, Dept Math, Cambridge, MA 02139 USA
[2] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA USA
关键词
algorithms; computational molecular biology; protein folding; multiple sequence alignment; information theory; FUNCTIONAL SITES; PROTEIN FUNCTION; ACTIVE-SITES; SIMILARITY ANALYSIS; PATTERN SIMILARITY; CRYSTAL-STRUCTURE; SEQUENCE; IDENTIFICATION; TOOL; ENZYMES;
D O I
10.1089/cmb.2007.0042
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We introduce a computational method to predict and annotate the catalytic residues of a protein using only its sequence information, so that we describe both the residues' sequence locations (prediction) and their specific biochemical roles in the catalyzed reaction (annotation). While knowing the chemistry of an enzyme's catalytic residues is essential to understanding its function, the challenges of prediction and annotation have remained difficult, especially when only the enzyme's sequence and no homologous structures are available. Our sequence-based approach follows the guiding principle that catalytic residues performing the same biochemical function should have similar chemical environments; it detects specific conservation patterns near in sequence to known catalytic residues and accordingly constrains what combination of amino acids can be present near a predicted catalytic residue. We associate with each catalytic residue a short sequence profile and define a Kullback-Leibler (KL) distance measure between these profiles, which, as we show, effectively captures even subtle biochemical variations. We apply the method to the class of glycohydrolase enzymes. This class includes proteins from 96 families with very different sequences and folds, many of which perform important functions. In a cross-validation test, our approach correctly predicts the location of the enzymes' catalytic residues with a sensitivity of 80% at a specificity of 99.4%, and in a separate cross-validation we also correctly annotate the biochemical role of 80% of the catalytic residues. Our results compare favorably to existing methods. Moreover, our method is more broadly applicable because it relies on sequence and not structure information; it may, furthermore, be used in conjunction with structure-based methods.
引用
收藏
页码:1058 / 1073
页数:16
相关论文
共 50 条
  • [31] Information theoretic approach to ADC circuits
    Dighe, AM
    Kelkar, AR
    JOURNAL OF THE INSTITUTION OF ELECTRONICS AND TELECOMMUNICATION ENGINEERS, 1995, 41 (5-6): : 325 - 328
  • [32] An information theoretic approach to optical interferometry
    Hope, D
    Prasad, S
    INTERFEROMETRY FOR OPTICAL ASTRONOMY II, PTS 1 AND 2, 2003, 4838 : 398 - 403
  • [33] Interval estimation: An information theoretic approach
    Golan, Amos
    Ullah, Aman
    ECONOMETRIC REVIEWS, 2017, 36 (6-9) : 781 - 795
  • [34] A constraint-based approach for annotating music scores with gestural information
    Radicioni, Daniele P.
    Lombardo, Vincenzo
    CONSTRAINTS, 2007, 12 (04) : 405 - 428
  • [35] Random Access with Imperfect State Information: An Information Theoretic Approach
    Boyer, Colby
    Dastangoo, Siamak
    2013 IEEE 24TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2013, : 1817 - 1822
  • [36] An information-theoretic approach to statistical dependence: Copula information
    Calsaverini, R. S.
    Vicente, R.
    EPL, 2009, 88 (06)
  • [37] On channel uncertainty modeling - An information theoretic approach
    Krusevac, ZB
    Rapajic, PB
    Kennedy, RA
    GLOBECOM '04: IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-6, 2004, : 410 - 414
  • [38] A group-theoretic approach to information inequalities
    Chan, HL
    Yeung, RW
    2000 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, PROCEEDINGS, 2000, : 492 - 492
  • [39] A geometric approach to information-theoretic private information retrieval
    Woodruff, David
    Yekhanin, Sergey
    SIAM JOURNAL ON COMPUTING, 2007, 37 (04) : 1046 - 1056
  • [40] A geometric approach to information-theoretic private information retrieval
    Woodruff, D
    Yekhanin, S
    TWENTIETH ANNUAL IEEE CONFERENCE ON COMPUTATIONAL COMPLEXITY, PROCEEDINGS, 2005, : 275 - 284