PRISM: methylation pattern-based, reference-free inference of subclonal makeup

被引:7
|
作者
Lee, Dohoon [1 ]
Lee, Sangseon [2 ]
Kim, Sun [1 ,2 ,3 ]
机构
[1] Seoul Natl Univ, Interdisciplinary Program Bioinformat, Seoul 08826, South Korea
[2] Seoul Natl Univ, Dept Comp Sci & Engn, Seoul 08826, South Korea
[3] Seoul Natl Univ, Bioinformat Inst, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
TUMOR GENE WT1; DNA METHYLATION; CLONAL EVOLUTION; HETEROGENEITY; DYNAMICS; CANCER; CELLS; CHEMOTHERAPY; METASTASIS; EXPRESSION;
D O I
10.1093/bioinformatics/btz327
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Characterizing cancer subclones is crucial for the ultimate conquest of cancer. Thus, a number of bioinformatic tools have been developed to infer heterogeneous tumor populations based on genomic signatures such as mutations and copy number variations. Despite accumulating evidence for the significance of global DNA methylation reprogramming in certain cancer types including myeloid malignancies, none of the bioinformatic tools are designed to exploit subclonally reprogrammed methylation patterns to reveal constituent populations of a tumor. In accordance with the notion of global methylation reprogramming, our preliminary observations on acute myeloid leukemia (AML) samples implied the existence of subclonally occurring focal methylation aberrance throughout the genome. Results We present PRISM, a tool for inferring the composition of epigenetically distinct subclones of a tumor solely from methylation patterns obtained by reduced representation bisulfite sequencing. PRISM adopts DNA methyltransferase 1-like hidden Markov model-based in silico proofreading for the correction of erroneous methylation patterns. With error-corrected methylation patterns, PRISM focuses on a short individual genomic region harboring dichotomous patterns that can be split into fully methylated and unmethylated patterns. Frequencies of such two patterns form a sufficient statistic for subclonal abundance. A set of statistics collected from each genomic region is modeled with a beta-binomial mixture. Fitting the mixture with expectation-maximization algorithm finally provides inferred composition of subclones. Applying PRISM for two AML samples, we demonstrate that PRISM could infer the evolutionary history of malignant samples from an epigenetic point of view. Availability and implementation PRISM is freely available on GitHub (https://github.com/dohlee/prism). Supplementary information Supplementary data are available at Bioinformatics online.
引用
收藏
页码:I520 / I529
页数:10
相关论文
共 50 条
  • [1] Pattern-Based Contractility Screening, a Reference-Free Alternative to Traction Force Microscopy Methodology
    Ghagre, Ajinkya
    Amini, Ali
    Srivastava, Luv Kishore
    Tirgar, Pouria
    Khavari, Adele
    Koushki, Newsha
    Ehrlicher, Allen
    ACS APPLIED MATERIALS & INTERFACES, 2021, 13 (17) : 19726 - 19735
  • [2] Pattern-Based Contractile Screening (PaCS), A Reference-Free Traction Force Microscopy Methodology, Reveals Contractile Differences in Breast Cancer Cells
    Ghagre, Ajinkya
    Amini, Ali
    Srivastava, Luv Kishore
    Bahnamiri, Pouria Tirgar
    Khavari, Adele
    Koushki, Newsha
    Ehrlicher, Allen J.
    BIOPHYSICAL JOURNAL, 2021, 120 (03) : 65A - 65A
  • [3] A statistical model for reference-free inference of archaic local ancestry
    Durvasula, Arun
    Sankararaman, Sriram
    PLOS GENETICS, 2019, 15 (05):
  • [4] Pattern-based inference approach for data mining
    Sy, BK
    18TH INTERNATIONAL CONFERENCE OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY - NAFIPS, 1999, : 727 - 731
  • [5] Pattern-based inference approach for data mining
    Sy, Bon K.
    1999,
  • [6] Reference-free cell mixture adjustments in analysis of DNA methylation data
    Houseman, Eugene Andres
    Molitor, John
    Marsit, Carmen J.
    BIOINFORMATICS, 2014, 30 (10) : 1431 - 1439
  • [7] Inference Algorithms for Pattern-Based CRFs on Sequence Data
    Kolmogorov, Vladimir
    Takhanov, Rustem
    ALGORITHMICA, 2016, 76 (01) : 17 - 46
  • [8] Predicting the effectiveness of pattern-based entity extractor inference
    Bartoli, Alberto
    De Lorenzo, Andrea
    Medvet, Eric
    Tarlao, Fabiano
    APPLIED SOFT COMPUTING, 2016, 46 : 398 - 406
  • [9] Inference Algorithms for Pattern-Based CRFs on Sequence Data
    Vladimir Kolmogorov
    Rustem Takhanov
    Algorithmica, 2016, 76 : 17 - 46
  • [10] Reference-free deconvolution of DNA methylation data and mediation by cell composition effects
    Houseman, E. Andres
    Kile, Molly L.
    Christiani, David C.
    Ince, Tan A.
    Kelsey, Karl T.
    Marsit, Carmen J.
    BMC BIOINFORMATICS, 2016, 17