On the prediction of non-CG DNA methylation using machine learning

被引:2
|
作者
Sereshki, Saleh [1 ]
Lee, Nathan [1 ]
Omirou, Michalis [2 ]
Fasoula, Dionysia [3 ]
Lonardi, Stefano [1 ]
机构
[1] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
[2] Agr Res Inst, Dept Agrobiotechnol, Agr Microbiol Lab, CY-1516 Nicosia, Cyprus
[3] Agr Res Inst, Dept Plant Breeding, CY-1516 Nicosia, Cyprus
关键词
GENOME;
D O I
10.1093/nargab/lqad045
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
DNA methylation can be detected and measured using sequencing instruments after sodium bisulfite conversion, but experiments can be expensive for large eukaryotic genomes. Sequencing nonuniformity and mapping biases can leave parts of the genome with low or no coverage, thus hampering the ability of obtaining DNA methylation levels for all cytosines. To address these limitations, several computational methods have been proposed that can predict DNA methylation from the DNA sequence around the cytosine or from the methylation level of nearby cytosines. However, most of these methods are entirely focused on CG methylation in humans and other mammals. In this work, we study, for the first time, the problem of predicting cytosine methylation for CG, CHG and CHH contexts on six plant species, either from the DNA primary sequence around the cytosine or from the methylation levels of neighboring cytosines. In this framework, we also study the cross-species prediction problem and the cross-context prediction problem (within the same species). Finally, we show that providing gene and repeat annotations allows existing classifiers to significantly improve their prediction accuracy. We introduce a new classifier called AMPS (annotation-based methylation prediction from sequence) that takes advantage of genomic annotations to achieve higher accuracy.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Transgenerational establishment of CG and non-CG DNA methylation patterns in Arabidopsis
    Taiko, To
    Tarutani, Yoshiaki
    Kato, Kae
    Inagaki, Soichi
    Ito, Tasuku
    Takahashi, Mayumi
    Toyoda, Atsushi
    Fujiyama, Asao
    Vincent, Colot
    Kakutani, Tetsuji
    GENES & GENETIC SYSTEMS, 2016, 91 (06) : 348 - 348
  • [2] Transposon age and non-CG methylation
    Zhengming Wang
    David C. Baulcombe
    Nature Communications, 11
  • [3] Developmental remodelling of non-CG methylation a satellite DNA repeats
    Ross, Samuel E.
    Angeloni, Allegra
    Geng, Fan-Suo
    de Mendoza, Alex
    Bogdanovic, Ozren
    NUCLEIC ACIDS RESEARCH, 2020, 48 (22) : 12675 - 12688
  • [4] Non-CG Methylation in the Human Genome
    He, Yupeng
    Ecker, Joseph R.
    ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 16, 2015, 16 : 55 - 77
  • [5] Transposon age and non-CG methylation
    Wang, Zhengming
    Baulcombe, David C.
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [6] Role of CG and non-CG methylation in transposon immobilization
    Kato, M
    Miura, A
    Kakutani, T
    PLANT AND CELL PHYSIOLOGY, 2003, 44 : S13 - S13
  • [7] Role of CG and non-CG methylation in immobilization of transposons in arabidopsis
    Kato, M
    Miura, A
    Bender, J
    Jacobsen, SE
    Kakutani, T
    CURRENT BIOLOGY, 2003, 13 (05) : 421 - 426
  • [8] DNA methylation mutants in Physcomitrella patens elucidate individual roles of CG and non-CG methylation in genome regulation
    Domb, Katherine
    Katz, Aviva
    Harris, Keith D.
    Yaari, Rafael
    Kaisler, Efrat
    Nguyen, Vu H.
    Hong, Uyen V. T.
    Griess, Ofir
    Heskiau, Karina G.
    Ohad, Nir
    Zemach, Assaf
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (52) : 33700 - 33710
  • [9] Dysregulation of non-CG methylation by child abuse
    Lutz, P.
    Chay, M.
    Yang, J.
    Aguirre, M.
    van Kempen, L. C.
    Theroux, J.
    Kwan, T.
    Redensek, A.
    Ernst, C.
    Pastinen, T.
    Turecki, G.
    BIPOLAR DISORDERS, 2018, 20 : 17 - 17
  • [10] Dysregulation of Non-Cg Methylation by Child Abuse
    Lutz, Pierre-Eric
    Chay, Marc-Aurele
    Theroux, Jean-Francois
    Kwan, Tony
    Redensek, Adriana
    Mechawar, Naguib
    Pastinen, Tomi
    Turecki, Gustavo
    BIOLOGICAL PSYCHIATRY, 2018, 83 (09) : S7 - S7