MLML2R: an R package for maximum likelihood estimation of DNA methylation and hydroxymethylation proportions

被引:7
|
作者
Kiihl, Samara F. [1 ]
Jose Martinez-Garrido, Maria [2 ]
Domingo-Relloso, Arce [2 ]
Bermudez, Jose [2 ]
Tellez-Plaza, Maria [3 ]
机构
[1] Univ Estadual Campinas, Dept Stat, BR-13083859 Campinas, SP, Brazil
[2] Univ Valencia, Dept Stat & Operat Res, E-46100 Valencia, Spain
[3] Hosp Clin Valencia, Inst Biomed Res, Valencia 46010, Spain
基金
巴西圣保罗研究基金会;
关键词
DNA hydroxymethylation; DNA methylation; Maximum likelihood; OXIDATIVE BISULFITE; BASE-RESOLUTION; 5-HYDROXYMETHYLCYTOSINE; 5-METHYLCYTOSINE;
D O I
10.1515/sagmb-2018-0031
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Accurately measuring epigenetic marks such as 5-methylcytosine (5-mC) and 5-hydroxymethylcytosine (5-hmC) at the single-nucleotide level, requires combining data from DNA processing methods including traditional (BS), oxidative (oxBS) or Tet-Assisted (TAB) bisulfite conversion. We introduce the R package MLML2R, which provides maximum likelihood estimates (MLE) of 5-mC and 5-hmC proportions. While all other available R packages provide 5-mC and 5-hmC MLEs only for the oxBS+BS combination, MLML2R also provides MLE for TAB combinations. For combinations of any two of the methods, we derived the pool-adjacent-violators algorithm (PAVA) exact constrained MLE in analytical form. For the three methods combination, we implemented both the iterative method by Qu et al. [Qu, J., M. Zhou, Q. Song, E. E. Hong and A. D. Smith (2013): "Mlml: consistent simultaneous estimates of dna methylation and hydroxymethylation," Bioinformatics, 29, 2645-2646.], and also a novel non iterative approximation using Lagrange multipliers. The newly proposed non iterative solutions greatly decrease computational time, common bottlenecks when processing high-throughput data. The MLML2R package is flexible as it takes as input both, preprocessed intensities from Infinium Methylation arrays and counts from Next Generation Sequencing technologies. The MLML2R package is freely available at https://CRAN. R-project.org/package=MLML2R.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] frailtypack: An R Package for the Analysis of Correlated Survival Data with Frailty Models Using Penalized Likelihood Estimation or Parametrical Estimation
    Rondeau, Virginie
    Mazroui, Yassin
    Gonzalez, Juan R.
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2012, 47 (04): : 1 - 28
  • [32] funtooNorm: an R package for normalization of DNA methylation data when there are multiple cell or tissue types
    Klein, Kathleen Oros
    Grinek, Stepan
    Bernatsky, Sasha
    Bouchard, Luigi
    Ciampi, Antonio
    Colmegna, Ines
    Fortin, Jean-Philippe
    Gao, Long
    Hivert, Marie-France
    Hudson, Marie
    Kobor, Michael S.
    Labbe, Aurelie
    MacIsaac, Julia L.
    Meaney, Michael J.
    Morin, Alexander M.
    O'Donnell, Kieran J.
    Pastinen, Tomi
    Van Ijzendoorn, Marinus H.
    Voisin, Gregory
    Greenwood, Celia M. T.
    [J]. BIOINFORMATICS, 2016, 32 (04) : 593 - 595
  • [33] A New Estimation Algorithm for the GNSS-R Interference Pattern Technique: The Segmented Maximum Likelihood
    Ribot, Miguel Angel
    Botteron, Cyril
    Farine, Pierre-Andre
    [J]. PROCEEDINGS OF THE 28TH INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS+ 2015), 2015, : 3849 - 3858
  • [34] MethParquet: an R package for rapid and efficient DNA methylation association analysis adopting Apache Parquet
    Wang, Ziqing
    Cassidy, Michael
    Wallace, Danielle A.
    Sofer, Tamar
    [J]. BIOINFORMATICS, 2024, 40 (07)
  • [35] Improved GOES-R ABI image navigation and registration using maximum likelihood parameter estimation
    Gibbs, Bruce P.
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2020, 14 (03):
  • [36] Unsupervised mixture estimation via approximate maximum likelihood based on the Cramér - von Mises distance
    Bee M.
    [J]. Computational Statistics and Data Analysis, 2023, 185
  • [37] BSL: An R Package for Efficient Parameter Estimation for Simulation-Based Models via Bayesian Synthetic Likelihood
    An, Ziwen
    South, Leah F.
    Drovandi, Christopher
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2022, 101 (11): : 1 - 33
  • [38] ELMER v.2: an R/Bioconductor package to reconstruct gene regulatory networks from DNA methylation and transcriptome profiles
    Silva, Tiago C.
    Coetzee, Simon G.
    Gull, Nicole
    Yao, Lijing
    Hazelett, Dennis J.
    Noushmehr, Houtan
    Lin, De-Chen
    Berman, Benjamin P.
    [J]. BIOINFORMATICS, 2019, 35 (11) : 1974 - 1977
  • [39] scDeconv: an R package to deconvolve bulk DNA methylation data with scRNA-seq data and paired bulk RNA-DNA methylation data
    Liu, Yu
    [J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (03)
  • [40] Easy and reliable maximum a posteriori Bayesian estimation of pharmacokinetic parameters with the open-source R package mapbayr
    Le Louedec, Felicien
    Puisset, Florent
    Thomas, Fabienne
    Chatelut, Etienne
    White-Koning, Melanie
    [J]. CPT-PHARMACOMETRICS & SYSTEMS PHARMACOLOGY, 2021, 10 (10): : 1208 - 1220