HydPred: a novel method for the identification of protein hydroxylation sites that reveals new insights into human inherited disease

被引:8
|
作者
Li, Shuyan [1 ]
Lu, Jun [2 ]
Li, Jiazhong [3 ]
Chen, Ximing [4 ]
Yao, Xiaojun [1 ]
Xi, Lili [5 ]
机构
[1] Lanzhou Univ, Coll Chem & Chem Engn, Lanzhou 730000, Peoples R China
[2] Lanzhou Univ, Sch Basic Med Sci, Lanzhou 730000, Peoples R China
[3] Lanzhou Univ, Sch Pharm, Lanzhou 730000, Peoples R China
[4] Chinese Acad Sci, Cold & Arid Regions Environm & Engn Res Inst, Key Lab Desert & Desertificat, Lanzhou 730000, Peoples R China
[5] Lanzhou Univ, Hosp 1, Dept Pharm, Lanzhou 730000, Peoples R China
基金
中国国家自然科学基金;
关键词
COLLAGEN PROLYL 3-HYDROXYLATION; ESTROGEN HYDROXYLATION; PROLIDASE ACTIVITY; RANDOM FOREST; PREDICTION; HYDROXYPROLINE; HYDROPHOBICITY; EXPRESSION; PEPTIDES; MACHINE;
D O I
10.1039/c5mb00681c
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The disruption of protein hydroxylation is highly associated with several serious diseases and consequently the identification of protein hydroxylation sites has attracted significant attention recently. Here, we report the development of an improved method, called HydPred, to identify protein hydroxylation sites (hydroxyproline and hydroxylysine) based on the synthetic minority over-sampling technique (SMOTE), the random forest (RF) algorithm and four blocks of newly composed features that are derived from the protein primary sequence. The HydPred method achieved the best prediction performance reported until now with Matthew's correlation coefficient values of 0.770 and 0.857 for hydroxyproline and hydroxylysine, respectively, according to jack-knife cross-validation. This represents an improvement of 8% for hydroxyproline and 19% for hydroxylysine compared to the best results of available predictors. The prediction performance of HydPred for the external validation of hydroxyproline and hydroxylysine was also improved compared with other published methods. We subsequently applied HydPred to study the association of disruption of hydroxylation sites with human inherited disease. The analyses suggested that the loss of hydroxylation sites is more likely to cause disease instead of the gain of hydroxylation sites and 52 different human inherited diseases were found to be highly associated with the loss of hydroxylation sites. Therefore, HydPred represents a new strategy to discover the molecular basis of pathogenesis associated with abnormal hydroxylation. HydPred is now available online as a user-friendly web server at http://lishuyan.lzu.edu.cn/hydpred/.
引用
收藏
页码:490 / 498
页数:9
相关论文
共 50 条
  • [1] In Silico Identification of Protein S-Palmitoylation Sites and Their Involvement in Human Inherited Disease
    Li, Shuyan
    Li, Jiazhong
    Ning, Lulu
    Wang, Shaopeng
    Niu, Yuzhen
    Jin, Nengzhi
    Yao, Xiaojun
    Liu, Huanxiang
    Xi, Lili
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (09) : 2015 - 2025
  • [2] New Insights into Protein Hydroxylation and Its Important Role in Human Diseases
    Zurlo, Giada
    Guo, Jianping
    Takada, Mamoru
    Wei, Wenyi
    Zhang, Qing
    BIOCHIMICA ET BIOPHYSICA ACTA-REVIEWS ON CANCER, 2016, 1866 (02): : 208 - 220
  • [3] The human splicing code reveals new insights into the genetic determinants of disease
    Xiong, Hui Y.
    Alipanahi, Babak
    Lee, Leo J.
    Bretschneider, Hannes
    Merico, Daniele
    Yuen, Ryan K. C.
    Hua, Yimin
    Gueroussov, Serge
    Najafabadi, Hamed S.
    Hughes, Timothy R.
    Morris, Quaid
    Barash, Yoseph
    Krainer, Adrian R.
    Jojic, Nebojsa
    Scherer, Stephen W.
    Blencowe, Benjamin J.
    Frey, Brendan J.
    SCIENCE, 2015, 347 (6218)
  • [4] Identification of Sequence Variants within Experimentally Validated Protein Interaction Sites Provides New Insights into Molecular Mechanisms of Disease Development
    Skrlj, Blaz
    Konc, Janez
    Kunej, Tanja
    MOLECULAR INFORMATICS, 2017, 36 (09)
  • [5] Mouse models for human epithelial disease: novel insights and new horizons
    Fong, Peying
    EXPERIMENTAL PHYSIOLOGY, 2009, 94 (02) : 169 - 170
  • [6] Systematic Analysis of Non-Protein Coding Sequence Variation Reveals Putative Pathogenic Mutations Causing Inherited Human Retinal Disease
    Soens, Zachry
    Zaneveld, Jacques E.
    Gelowani, Violet
    Jiang, Lichun
    Sui, Ruifang
    Koenekoop, Robert K.
    Chen, Rui
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2014, 55 (13)
  • [7] IDENTIFICATION OF CLEAVAGE FRAGMENTS FROM A NOVEL, HUMAN PROTEIN IN SYNOVIAL-FLUID - A NEW, POTENTIAL INDEX FOR INFLAMMATORY JOINT DISEASE
    KERR, GS
    LANGLOIS, PF
    HAMMER, CH
    FRANK, MM
    CLINICAL RESEARCH, 1990, 38 (03): : A787 - A787
  • [8] Open Modification Searching of SARS-CoV-2-Human Protein Interaction Data Reveals Novel Viral Modification Sites
    Adams, Charlotte
    Boonen, Kurt
    Laukens, Kris
    Bittremieux, Wout
    MOLECULAR & CELLULAR PROTEOMICS, 2022, 21 (12)
  • [9] Serum Proteomic Signature of Human Chagasic Patients for the Identification of Novel Potential Protein Biomarkers of Disease
    Wen, Jian-Jun
    Paola Zago, M.
    Nunez, Sonia
    Gupta, Shivali
    Nunez Burgos, Federico
    Jain Garg, Nisha
    MOLECULAR & CELLULAR PROTEOMICS, 2012, 11 (08) : 435 - 452
  • [10] A novel computational model of swine ventricular myocyte reveals new insights into disease mechanisms and therapeutic approaches in Timothy Syndrome
    Trancuccio, Alessandro
    Tarifa, Carmen
    Bongianino, Rossana
    Priori, Silvia G.
    Santiago, Demetrio J.
    SCIENTIFIC REPORTS, 2024, 14 (01):