HydPred: a novel method for the identification of protein hydroxylation sites that reveals new insights into human inherited disease

被引:8
|
作者
Li, Shuyan [1 ]
Lu, Jun [2 ]
Li, Jiazhong [3 ]
Chen, Ximing [4 ]
Yao, Xiaojun [1 ]
Xi, Lili [5 ]
机构
[1] Lanzhou Univ, Coll Chem & Chem Engn, Lanzhou 730000, Peoples R China
[2] Lanzhou Univ, Sch Basic Med Sci, Lanzhou 730000, Peoples R China
[3] Lanzhou Univ, Sch Pharm, Lanzhou 730000, Peoples R China
[4] Chinese Acad Sci, Cold & Arid Regions Environm & Engn Res Inst, Key Lab Desert & Desertificat, Lanzhou 730000, Peoples R China
[5] Lanzhou Univ, Hosp 1, Dept Pharm, Lanzhou 730000, Peoples R China
基金
中国国家自然科学基金;
关键词
COLLAGEN PROLYL 3-HYDROXYLATION; ESTROGEN HYDROXYLATION; PROLIDASE ACTIVITY; RANDOM FOREST; PREDICTION; HYDROXYPROLINE; HYDROPHOBICITY; EXPRESSION; PEPTIDES; MACHINE;
D O I
10.1039/c5mb00681c
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The disruption of protein hydroxylation is highly associated with several serious diseases and consequently the identification of protein hydroxylation sites has attracted significant attention recently. Here, we report the development of an improved method, called HydPred, to identify protein hydroxylation sites (hydroxyproline and hydroxylysine) based on the synthetic minority over-sampling technique (SMOTE), the random forest (RF) algorithm and four blocks of newly composed features that are derived from the protein primary sequence. The HydPred method achieved the best prediction performance reported until now with Matthew's correlation coefficient values of 0.770 and 0.857 for hydroxyproline and hydroxylysine, respectively, according to jack-knife cross-validation. This represents an improvement of 8% for hydroxyproline and 19% for hydroxylysine compared to the best results of available predictors. The prediction performance of HydPred for the external validation of hydroxyproline and hydroxylysine was also improved compared with other published methods. We subsequently applied HydPred to study the association of disruption of hydroxylation sites with human inherited disease. The analyses suggested that the loss of hydroxylation sites is more likely to cause disease instead of the gain of hydroxylation sites and 52 different human inherited diseases were found to be highly associated with the loss of hydroxylation sites. Therefore, HydPred represents a new strategy to discover the molecular basis of pathogenesis associated with abnormal hydroxylation. HydPred is now available online as a user-friendly web server at http://lishuyan.lzu.edu.cn/hydpred/.
引用
收藏
页码:490 / 498
页数:9
相关论文
共 50 条
  • [21] Single-molecule fluorescence-based approach reveals novel mechanistic insights into human small heat shock protein chaperone function
    Johnston C.L.
    Marzano N.R.
    Paudel B.P.
    Wright G.
    Benesch J.L.P.
    van Oijen A.M.
    Ecroyd H.
    Journal of Biological Chemistry, 2021, 296
  • [22] Single-molecule fluorescence-based approach reveals novel mechanistic insights into human small heat shock protein chaperone function
    Johnston, Caitlin L.
    Marzano, Nicholas R.
    Paudel, Bishnu P.
    Wright, George
    Benesch, Justin L. P.
    van Oijen, Antoine M.
    Ecroyd, Heath
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2021, 296
  • [23] Identification of a novel human DDX40 gene, a new member of the DEAH-box protein family
    J. Xu
    H. Wu
    C. Zhang
    Y. Cao
    L. Wang
    L. Zeng
    X. Ye
    Q. Wu
    J. Dai
    Y. Xie
    Y. Mao
    Journal of Human Genetics, 2002, 47 : 681 - 683
  • [24] Identification of a novel human DDX40 gene, a new member of the DEAH-box protein family
    Xu, J
    Wu, H
    Zhang, CZ
    Cao, JQ
    Wang, L
    Zeng, L
    Ye, X
    Wu, QH
    Dai, JF
    Xie, Y
    Mao, YM
    JOURNAL OF HUMAN GENETICS, 2002, 47 (12) : 681 - 683
  • [25] LINE-1 Display (LID) Mapping, a novel method for identifying insertion sites, reveals frequent insertion site polymorphisms in the human population.
    Swergold, GD
    Sheen, FM
    AMERICAN JOURNAL OF HUMAN GENETICS, 1997, 61 (04) : A212 - A212
  • [26] New Insights into the Impact of Human Papillomavirus on Oral Cancer in Young Patients: Proteomic Approach Reveals a Novel Role for S100A8
    Miranda-Galvis, Marisol
    Soares, Carolina Carneiro
    Carnielli, Carolina Moretto
    Buttura, Jaqueline Ramalho
    de Sa, Raisa Sales
    Kaminagakura, Estela
    Marchi, Fabio Albuquerque
    Leme, Adriana Franco Paes
    Pinto, Clovis A. Lopes
    Santos-Silva, Alan Roger
    Castilho, Rogerio Moraes
    Kowalski, Luiz Paulo
    Squarize, Cristiane Helena
    CELLS, 2023, 12 (09)
  • [27] Identification of Novel Protein Kinase A Phosphorylation Sites in the M-domain of Human and Murine Cardiac Myosin Binding Protein-C Using Mass Spectrometry Analysis
    Jia, Weitao
    Shaffer, Justin F.
    Harris, Samantha P.
    Leary, Julie A.
    JOURNAL OF PROTEOME RESEARCH, 2010, 9 (04) : 1843 - 1853
  • [28] Identification of the human YVH1 protein-tyrosine phosphatase orthologue reveals a novel zinc binding domain essential for in vivo function
    Muda, M
    Manning, ER
    Orth, K
    Dixon, JE
    JOURNAL OF BIOLOGICAL CHEMISTRY, 1999, 274 (34) : 23991 - 23995
  • [29] Identification of a novel nonstructural protein, VP9, from white spot syndrome virus: Its structure reveals a ferredoxin fold with specific metal binding sites
    Liu, Yang
    Wu, Jinlu
    Song, Jianxing
    Sivaraman, J.
    Hew, Choy L.
    JOURNAL OF VIROLOGY, 2006, 80 (21) : 10419 - 10427
  • [30] Inactivation of the lysine binding sites of human plasminogen (hPg) reveals novel structural requirements for the tight hPg conformation, M-protein binding, and rapid activation
    Ayinuola, Yetunde A. A.
    Castellino, Francis J. J.
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2023, 10