Genome-Wide Prediction of DNA Methylation Using DNA Composition and Sequence Complexity in Human

被引:7
|
作者
Wu, Chengchao [1 ]
Yao, Shixin [2 ]
Li, Xinghao [2 ]
Chen, Chujia [1 ]
Hu, Xuehai [1 ]
机构
[1] Huazhong Agr Univ, Coll Informat, Agr Bioinformat Key Lab Hubei Prov, Wuhan 430070, Peoples R China
[2] Huazhong Agr Univ, Coll Sci, Wuhan 430070, Peoples R China
基金
中国国家自然科学基金;
关键词
DNA methylation; predicted model; sequence complexity; S-NITROSYLATION SITES; LYSINE SUCCINYLATION SITES; WEB SERVER; PSEUDO COMPONENTS; CPG ISLANDS; PROTEINS; ENTROPY; PSEKNC; PSEAAC; MODES;
D O I
10.3390/ijms18020420
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
DNA methylation plays a significant role in transcriptional regulation by repressing activity. Change of the DNA methylation level is an important factor affecting the expression of target genes and downstream phenotypes. Because current experimental technologies can only assay a small proportion of CpG sites in the human genome, it is urgent to develop reliable computational models for predicting genome-wide DNA methylation. Here, we proposed a novel algorithm that accurately extracted sequence complexity features (seven features) and developed a support-vector-machine-based prediction model with integration of the reported DNA composition features (trinucleotide frequency and GC content, 65 features) by utilizing the methylation profiles of embryonic stem cells in human. The prediction results from 22 human chromosomes with size-varied windows showed that the 600-bp window achieved the best average accuracy of 94.7%. Moreover, comparisons with two existing methods further showed the superiority of our model, and cross-species predictions on mouse data also demonstrated that our model has certain generalization ability. Finally, a statistical test of the experimental data and the predicted data on functional regions annotated by ChromHMM found that six out of 10 regions were consistent, which implies reliable prediction of unassayed CpG sites. Accordingly, we believe that our novel model will be useful and reliable in predicting DNA methylation.
引用
下载
收藏
页数:21
相关论文
共 50 条
  • [1] Genome-wide DNA methylation changes in human
    Siebert-Kuss, Lara M.
    Dietrich, Verena
    Di Persio, Sara
    Bhaskaran, Jahnavi
    Stehling, Martin
    Cremers, Jann-Frederik
    Sandmann, Sarah
    Varghese, Julian
    Kliesch, Sabine
    Schlatt, Stefan
    Vaquerizas, Juan M.
    Neuhaus, Nina
    Laurentino, Sandra
    AMERICAN JOURNAL OF HUMAN GENETICS, 2024, 111 (06) : 1125 - 1139
  • [2] Prediction of genome-wide DNA methylation in repetitive elements
    Zheng, Yinan
    Joyce, Brian T.
    Liu, Lei
    Zhang, Zhou
    Kibbe, Warren A.
    Zhang, Wei
    Hou, Lifang
    NUCLEIC ACIDS RESEARCH, 2017, 45 (15) : 8697 - 8711
  • [3] Genome-wide DNA methylation in human heart failure
    Movassagh, Mehregan
    Vujic, Ana
    Foo, Roger
    EPIGENOMICS, 2011, 3 (01) : 103 - 109
  • [4] Genome-wide analysis of DNA methylation in human atherosclerosis
    Oguri, M.
    Sawabe, M.
    Horibe, H.
    Murohara, T.
    Kato, K.
    Nishida, T.
    Yamada, Y.
    EUROPEAN HEART JOURNAL, 2013, 34 : 488 - 488
  • [5] Genome-Wide Analysis of DNA Methylation in Human Amnion
    Kim, Jinsil
    Pitlick, Mitchell M.
    Christine, Paul J.
    Schaefer, Amanda R.
    Saleme, Cesar
    Comas, Belen
    Cosentino, Viviana
    Gadow, Enrique
    Murray, Jeffrey C.
    SCIENTIFIC WORLD JOURNAL, 2013,
  • [6] Phenotype prediction based on genome-wide DNA methylation data
    Wilhelm, Thomas
    BMC BIOINFORMATICS, 2014, 15
  • [7] Phenotype prediction based on genome-wide DNA methylation data
    Thomas Wilhelm
    BMC Bioinformatics, 15
  • [8] Profiling genome-wide DNA methylation
    Wai-Shin Yong
    Fei-Man Hsu
    Pao-Yang Chen
    Epigenetics & Chromatin, 9
  • [9] Profiling genome-wide DNA methylation
    Yong, Wai-Shin
    Hsu, Fei-Man
    Chen, Pao-Yang
    EPIGENETICS & CHROMATIN, 2016, 9
  • [10] Genome-wide DNA methylation profiling
    Bibikova, Marina
    Fan, Jian-Bing
    WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE, 2010, 2 (02) : 210 - 223