DNA methylation can be detected and measured using sequencing instruments after sodium bisulfite conversion, but experiments can be expensive for large eukaryotic genomes. Sequencing nonuniformity and mapping biases can leave parts of the genome with low or no coverage, thus hampering the ability of obtaining DNA methylation levels for all cytosines. To address these limitations, several computational methods have been proposed that can predict DNA methylation from the DNA sequence around the cytosine or from the methylation level of nearby cytosines. However, most of these methods are entirely focused on CG methylation in humans and other mammals. In this work, we study, for the first time, the problem of predicting cytosine methylation for CG, CHG and CHH contexts on six plant species, either from the DNA primary sequence around the cytosine or from the methylation levels of neighboring cytosines. In this framework, we also study the cross-species prediction problem and the cross-context prediction problem (within the same species). Finally, we show that providing gene and repeat annotations allows existing classifiers to significantly improve their prediction accuracy. We introduce a new classifier called AMPS (annotation-based methylation prediction from sequence) that takes advantage of genomic annotations to achieve higher accuracy.
机构:
Garvan Inst Med Res, Genom & Epigenet Div, Sydney, NSW 2010, Australia
Univ New South Wales, Fac Med, St Vincents Clin Sch, Sydney, NSW 2010, AustraliaGarvan Inst Med Res, Genom & Epigenet Div, Sydney, NSW 2010, Australia
Ross, Samuel E.
Angeloni, Allegra
论文数: 0引用数: 0
h-index: 0
机构:
Garvan Inst Med Res, Genom & Epigenet Div, Sydney, NSW 2010, Australia
Univ New South Wales, Fac Med, St Vincents Clin Sch, Sydney, NSW 2010, AustraliaGarvan Inst Med Res, Genom & Epigenet Div, Sydney, NSW 2010, Australia
Angeloni, Allegra
Geng, Fan-Suo
论文数: 0引用数: 0
h-index: 0
机构:
Garvan Inst Med Res, Genom & Epigenet Div, Sydney, NSW 2010, Australia
Univ New South Wales, Fac Med, St Vincents Clin Sch, Sydney, NSW 2010, AustraliaGarvan Inst Med Res, Genom & Epigenet Div, Sydney, NSW 2010, Australia
Geng, Fan-Suo
de Mendoza, Alex
论文数: 0引用数: 0
h-index: 0
机构:
Queen Mary Univ London, Sch Biol & Chem Sci, London E1 4NS, EnglandGarvan Inst Med Res, Genom & Epigenet Div, Sydney, NSW 2010, Australia
de Mendoza, Alex
Bogdanovic, Ozren
论文数: 0引用数: 0
h-index: 0
机构:
Garvan Inst Med Res, Genom & Epigenet Div, Sydney, NSW 2010, Australia
Univ New South Wales, Sch Biotechnol & Biomol Sci, Sydney, NSW 2052, AustraliaGarvan Inst Med Res, Genom & Epigenet Div, Sydney, NSW 2010, Australia
机构:
Univ Calif San Diego, Bioinformat Program, La Jolla, CA 92093 USA
Salk Inst Biol Studies, Genom Anal Lab, La Jolla, CA 92037 USAUniv Calif San Diego, Bioinformat Program, La Jolla, CA 92093 USA
He, Yupeng
Ecker, Joseph R.
论文数: 0引用数: 0
h-index: 0
机构:
Salk Inst Biol Studies, Genom Anal Lab, La Jolla, CA 92037 USA
Salk Inst Biol Studies, Howard Hughes Med Inst, La Jolla, CA 92037 USAUniv Calif San Diego, Bioinformat Program, La Jolla, CA 92093 USA
Ecker, Joseph R.
ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 16,
2015,
16
: 55
-
77