Genome-Wide Prediction of DNA Methylation Using DNA Composition and Sequence Complexity in Human

被引:7
|
作者
Wu, Chengchao [1 ]
Yao, Shixin [2 ]
Li, Xinghao [2 ]
Chen, Chujia [1 ]
Hu, Xuehai [1 ]
机构
[1] Huazhong Agr Univ, Coll Informat, Agr Bioinformat Key Lab Hubei Prov, Wuhan 430070, Peoples R China
[2] Huazhong Agr Univ, Coll Sci, Wuhan 430070, Peoples R China
基金
中国国家自然科学基金;
关键词
DNA methylation; predicted model; sequence complexity; S-NITROSYLATION SITES; LYSINE SUCCINYLATION SITES; WEB SERVER; PSEUDO COMPONENTS; CPG ISLANDS; PROTEINS; ENTROPY; PSEKNC; PSEAAC; MODES;
D O I
10.3390/ijms18020420
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
DNA methylation plays a significant role in transcriptional regulation by repressing activity. Change of the DNA methylation level is an important factor affecting the expression of target genes and downstream phenotypes. Because current experimental technologies can only assay a small proportion of CpG sites in the human genome, it is urgent to develop reliable computational models for predicting genome-wide DNA methylation. Here, we proposed a novel algorithm that accurately extracted sequence complexity features (seven features) and developed a support-vector-machine-based prediction model with integration of the reported DNA composition features (trinucleotide frequency and GC content, 65 features) by utilizing the methylation profiles of embryonic stem cells in human. The prediction results from 22 human chromosomes with size-varied windows showed that the 600-bp window achieved the best average accuracy of 94.7%. Moreover, comparisons with two existing methods further showed the superiority of our model, and cross-species predictions on mouse data also demonstrated that our model has certain generalization ability. Finally, a statistical test of the experimental data and the predicted data on functional regions annotated by ChromHMM found that six out of 10 regions were consistent, which implies reliable prediction of unassayed CpG sites. Accordingly, we believe that our novel model will be useful and reliable in predicting DNA methylation.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Challenges in understanding genome-wide DNA methylation
    Zhang M.Q.
    Smith A.D.
    Journal of Computer Science and Technology, 2010, 25 (1) : 26 - 34
  • [22] Genome-wide DNA Methylation Mapping in Cancer
    Shen, Lanlan
    TUMOR BIOLOGY, 2008, 29 : 16 - 16
  • [23] Genome-Wide Mapping of DNA Methylation in Chicken
    Li, Qinghe
    Li, Ning
    Hu, Xiaoxiang
    Li, Jinxiu
    Du, Zhuo
    Chen, Li
    Yin, Guangliang
    Duan, Jinjie
    Zhang, Haichao
    Zhao, Yaofeng
    Wang, Jun
    Li, Ning
    PLOS ONE, 2011, 6 (05):
  • [24] Genome-wide DNA methylation profile in mungbean
    Kang, Yang Jae
    Bae, Ahra
    Shim, Sangrea
    Lee, Taeyoung
    Lee, Jayern
    Satyawan, Dani
    Kim, Moon Young
    Lee, Suk-Ha
    SCIENTIFIC REPORTS, 2017, 7
  • [25] Challenges in Understanding Genome-Wide DNA Methylation
    张伟奇
    Andrew D. Smith
    Journal of Computer Science & Technology, 2010, 25 (01) : 26 - 34
  • [26] Genome-wide DNA methylation profile in mungbean
    Yang Jae Kang
    Ahra Bae
    Sangrea Shim
    Taeyoung Lee
    Jayern Lee
    Dani Satyawan
    Moon Young Kim
    Suk-Ha Lee
    Scientific Reports, 7
  • [27] A genome-wide DNA methylation study in azoospermia
    Ferfouri, F.
    Boitrelle, F.
    Ghout, I.
    Albert, M.
    Gomes, D. Molina
    Wainer, R.
    Bailly, M.
    Selva, J.
    Vialard, F.
    ANDROLOGY, 2013, 1 (06) : 815 - 821
  • [28] Nightshift work and genome-wide DNA methylation
    Bhatti, Parveen
    Zhang, Yuzheng
    Song, Xiaoling
    Makar, Karen W.
    Sather, Cassandra L.
    Kelsey, Karl T.
    Houseman, E. Andres
    Wang, Pei
    CHRONOBIOLOGY INTERNATIONAL, 2015, 32 (01) : 103 - 112
  • [29] Challenges in Understanding Genome-Wide DNA Methylation
    Zhang, Michael Q.
    Smith, Andrew D.
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2010, 25 (01) : 26 - 34
  • [30] Genome-wide analysis of DNA methylation patterns
    Zilberman, Daniel
    Henikoff, Steven
    DEVELOPMENT, 2007, 134 (22): : 3959 - 3965