On the prediction of non-CG DNA methylation using machine learning

被引:2
|
作者
Sereshki, Saleh [1 ]
Lee, Nathan [1 ]
Omirou, Michalis [2 ]
Fasoula, Dionysia [3 ]
Lonardi, Stefano [1 ]
机构
[1] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
[2] Agr Res Inst, Dept Agrobiotechnol, Agr Microbiol Lab, CY-1516 Nicosia, Cyprus
[3] Agr Res Inst, Dept Plant Breeding, CY-1516 Nicosia, Cyprus
关键词
GENOME;
D O I
10.1093/nargab/lqad045
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
DNA methylation can be detected and measured using sequencing instruments after sodium bisulfite conversion, but experiments can be expensive for large eukaryotic genomes. Sequencing nonuniformity and mapping biases can leave parts of the genome with low or no coverage, thus hampering the ability of obtaining DNA methylation levels for all cytosines. To address these limitations, several computational methods have been proposed that can predict DNA methylation from the DNA sequence around the cytosine or from the methylation level of nearby cytosines. However, most of these methods are entirely focused on CG methylation in humans and other mammals. In this work, we study, for the first time, the problem of predicting cytosine methylation for CG, CHG and CHH contexts on six plant species, either from the DNA primary sequence around the cytosine or from the methylation levels of neighboring cytosines. In this framework, we also study the cross-species prediction problem and the cross-context prediction problem (within the same species). Finally, we show that providing gene and repeat annotations allows existing classifiers to significantly improve their prediction accuracy. We introduce a new classifier called AMPS (annotation-based methylation prediction from sequence) that takes advantage of genomic annotations to achieve higher accuracy.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] RNAi, DRD1, and histone methylation actively target developmentally important non-CG DNA methylation in Arabidopsis
    Chan, Simon W. -L.
    Henderson, Ian R.
    Zhang, Xiaoyu
    Shah, Govind
    Chien, Jason S. -C.
    Jacobsen, Steven E.
    PLOS GENETICS, 2006, 2 (06): : 791 - 797
  • [22] Non-CG DNA methylation is a biomarker for assessing endodermal differentiation capacity in pluripotent stem cells
    Lee M. Butcher
    Mitsuteru Ito
    Minodora Brimpari
    Tiffany J. Morris
    Filipa A. C. Soares
    Lars Ährlund-Richter
    Nessa Carey
    Ludovic Vallier
    Anne C. Ferguson-Smith
    Stephan Beck
    Nature Communications, 7
  • [23] Non-CG DNA methylation is a biomarker for assessing endodermal differentiation capacity in pluripotent stem cells
    Butcher, Lee M.
    Ito, Mitsuteru
    Brimpari, Minodora
    Morris, Tiffany J.
    Soares, Filipa A. C.
    Ahrlund-Richter, Lars
    Carey, Nessa
    Vallier, Ludovic
    Ferguson-Smith, Anne C.
    Beck, Stephan
    NATURE COMMUNICATIONS, 2016, 7
  • [24] Caloric restriction mitigates age-associated hippocampal differential CG and non-CG methylation
    Hadad, Niran
    Unnikrishnan, Archana
    Jackson, Jordan A.
    Masser, Dustin R.
    Otalora, Laura
    Stanford, David R.
    Richardson, Arlan
    Freeman, Willard M.
    NEUROBIOLOGY OF AGING, 2018, 67 : 53 - 66
  • [25] Neuronal non-CG methylation is an essential target for MeCP2 function
    Tillotson, Rebekah
    Cholewa-Waclaw, Justyna
    Chhatbar, Kashyap
    Connelly, John C.
    Kirschner, Sophie A.
    Webb, Shaun
    Koerner, Martha V.
    Selfridge, Jim
    Kelly, David A.
    De Sousa, Dina
    Brown, Kyla
    Lyst, Matthew J.
    Kriaucionis, Skirmantas
    Bird, Adrian
    MOLECULAR CELL, 2021, 81 (06) : 1260 - 1275.e12
  • [26] Allele-specific non-CG DNA methylation marks domains of active chromatin in female mouse brain
    Keown, Christopher L.
    Berletch, Joel B.
    Castanon, Rosa
    Nery, Joseph R.
    Disteche, Christine M.
    Ecker, Joseph R.
    Mukamel, Eran A.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (14) : E2882 - E2890
  • [27] Hyper non-CG methylation of expanded plant disease resistance NLR genes
    Qianqian Cao
    Shichao Wang
    Huizhen Ma
    Sha Luo
    Hai He
    Yu Zhang
    Plant Cell Reports, 2023, 42 : 1251 - 1254
  • [28] Tandem repeats upstream of the Arabidopsis endogene SDC recruit non-CG DNA methylation and initiate siRNA spreading
    Henderson, Ian R.
    Jacobsen, Steven E.
    GENES & DEVELOPMENT, 2008, 22 (12) : 1597 - 1606
  • [29] Hyper non-CG methylation of expanded plant disease resistance NLR genes
    Cao, Qianqian
    Wang, Shichao
    Ma, Huizhen
    Luo, Sha
    He, Hai
    Zhang, Yu
    PLANT CELL REPORTS, 2023, 42 (07) : 1251 - 1254
  • [30] Human non-CG methylation Are human stem cells plant-like?
    Dyachenko, Olga V.
    Shevchuk, TarasV.
    Kretzner, Leo
    Buryanov, Yaroslav I.
    Smith, Steven S.
    EPIGENETICS, 2010, 5 (07) : 569 - 572