CpGIMethPred: computational model for predicting methylation status of CpG islands in human genome

被引:35
|
作者
Zheng, Hao [1 ]
Wu, Hongwei [1 ]
Li, Jinping [2 ]
Jiang, Shi-Wen [2 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
[2] Mercer Univ, Sch Med, Dept Biomed Sci, Macon, GA USA
来源
BMC MEDICAL GENOMICS | 2013年 / 6卷
关键词
DNA METHYLATION; HISTONE ACETYLATION; BROWSER DATABASE; CANCER; VERTEBRATE; SEQUENCES; GENES;
D O I
10.1186/1755-8794-6-S1-S13
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
DNA methylation is an inheritable chemical modification of cytosine, and represents one of the most important epigenetic events. Computational prediction of the DNA methylation status can be employed to speed up the genome-wide methylation profiling, and to identify the key features that are correlated with various methylation patterns. Here, we develop CpGIMethPred, the support vector machine-based models to predict the methylation status of the CpG islands in the human genome under normal conditions. The features for prediction include those that have been previously demonstrated effective (CpG island specific attributes, DNA sequence composition patterns, DNA structure patterns, distribution patterns of conserved transcription factor binding sites and conserved elements, and histone methylation status) as well as those that have not been extensively explored but are likely to contribute additional information from a biological point of view (nucleosome positioning propensities, gene functions, and histone acetylation status). Statistical tests are performed to identify the features that are significantly correlated with the methylation status of the CpG islands, and principal component analysis is then performed to decorrelate the selected features. Data from the Human Epigenome Project (HEP) are used to train, validate and test the predictive models. Specifically, the models are trained and validated by using the DNA methylation data obtained in the CD4 lymphocytes, and are then tested for generalizability using the DNA methylation data obtained in the other 11 normal tissues and cell types. Our experiments have shown that (1) an eight-dimensional feature space that is selected via the principal component analysis and that combines all categories of information is effective for predicting the CpG island methylation status, (2) by incorporating the information regarding the nucleosome positioning, gene functions, and histone acetylation, the models can achieve higher specificity and accuracy than the existing models while maintaining a comparable sensitivity measure, (3) the histone modification (methylation and acetylation) information contributes significantly to the prediction, without which the performance of the models deteriorate, and, (4) the predictive models generalize well to different tissues and cell types. The developed program CpGIMethPred is freely available at http://users.ece.gatech.edu/similar to hzheng7/CGIMetPred.zip.
引用
下载
收藏
页数:12
相关论文
共 50 条
  • [41] CpG Islands' Clustering Uncovers Early Development Genes in the Human Genome
    Babenko, Vladimir N.
    Bogomolov, Anton G.
    Babenko, Roman O.
    Galieva, Elvira R.
    Orlov, Yuriy L.
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2018, 15 (02) : 473 - 485
  • [42] An evaluation of new criteria for CpG islands in the human genome as gene markers
    Wang, Y
    Leung, FCC
    BIOINFORMATICS, 2004, 20 (07) : 1170 - 1177
  • [43] Developmental features of DNA methylation in CpG islands of human gametes and preimplantation embryos
    Huang, Yuling
    Liu, Haiying
    Du, Hongzhi
    Zhang, Wenhong
    Kang, Xianjing
    Luo, Yang
    Zhou, Xueliang
    Li, Lei
    EXPERIMENTAL AND THERAPEUTIC MEDICINE, 2019, 17 (06) : 4447 - 4456
  • [44] DNA Methylation Levels of the ELMO Gene Promoter CpG Islands in Human Glioblastomas
    Michaelsen, Signe Regner
    Aslan, Derya
    Urup, Thomas
    Poulsen, Hans Skovgaard
    Gronbaek, Kirsten
    Broholm, Helle
    Kristensen, Lasse Sommer
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2018, 19 (03)
  • [45] Detection of CRISPR-mediated genome modifications through altered methylation patterns of CpG islands
    Farris, M. Heath
    Texter, Pamela A.
    Mora, Agustin A.
    Wiles, Michael, V
    Mac Garrigle, Ellen F.
    Klaus, Sybil A.
    Rosfjord, Kristine
    BMC GENOMICS, 2020, 21 (01)
  • [46] Detection of CRISPR-mediated genome modifications through altered methylation patterns of CpG islands
    M. Heath Farris
    Pamela A. Texter
    Agustin A. Mora
    Michael V. Wiles
    Ellen F. Mac Garrigle
    Sybil A. Klaus
    Kristine Rosfjord
    BMC Genomics, 21
  • [47] Genome-wide methylation analysis of retrocopy-associated CpG islands and their genomic environment
    Grothaus, Katrin
    Kanber, Deniz
    Gellhaus, Alexandra
    Mikat, Barbara
    Kolarova, Julia
    Siebert, Reiner
    Wieczorek, Dagmar
    Horsthemke, Bernhard
    EPIGENETICS, 2016, 11 (03) : 216 - 226
  • [48] Expression of mRNA for DNA methyltransferases and methyl-CpG-binding proteins and DNA methylation status on CpG islands and pericentromeric satellite regions during human hepatocarcinogenesis
    Saito, Y
    Kanai, Y
    Sakamoto, M
    Saito, H
    Ishii, H
    Hirohashi, S
    HEPATOLOGY, 2001, 33 (03) : 561 - 568
  • [49] Allelic methylation status of CpG islands on chromosome 21q in patients with Trisomy 21
    Xia, Yin-Yin
    Ding, Yu-Bing
    Liu, Xue-Qing
    Chen, Xue-Mei
    Cheng, Shu-Qun
    Li, Lian-Bing
    Ma, Ming-Fu
    He, Jun-Lin
    Wang, Ying-Xiong
    MOLECULAR MEDICINE REPORTS, 2014, 9 (05) : 1681 - 1688
  • [50] METHYLATION STATUS OF CPG-RICH ISLANDS ON ACTIVE AND INACTIVE MOUSE X-CHROMOSOMES
    NORRIS, DP
    BROCKDORFF, N
    RASTAN, S
    MAMMALIAN GENOME, 1991, 1 (02) : 78 - 83