pETM: a penalized Exponential Tilt Model for analysis of correlated high-dimensional DNA methylation data

被引:13
|
作者
Sun, Hokeun [1 ]
Wang, Ya [2 ]
Chen, Yong [3 ]
Li, Yun [4 ,5 ,6 ]
Wang, Shuang [2 ]
机构
[1] Pusan Natl Univ, Dept Stat, Busan 609735, South Korea
[2] Columbia Univ, Mailman Sch Publ Hlth, Dept Biostat, New York, NY 10032 USA
[3] Univ Penn, Perelman Sch Med, Div Biostat, Philadelphia, PA 19103 USA
[4] Univ N Carolina, Dept Biostat, Chapel Hill, NC 27599 USA
[5] Univ N Carolina, Dept Genet, Chapel Hill, NC 27599 USA
[6] Univ N Carolina, Dept Comp Sci, Chapel Hill, NC 27599 USA
基金
新加坡国家研究基金会;
关键词
OVARIAN-CANCER; REGULARIZATION PATHS; LUNG-CANCER; CELL; GENES; EXPRESSION; IDENTIFICATION; REGRESSION; MARKERS; HYPERMETHYLATION;
D O I
10.1093/bioinformatics/btx064
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: DNA methylation plays an important role in many biological processes and cancer progression. Recent studies have found that there are also differences in methylation variations in different groups other than differences in methylation means. Several methods have been developed that consider both mean and variance signals in order to improve statistical power of detecting differentially methylated loci. Moreover, as methylation levels of neighboring CpG sites are known to be strongly correlated, methods that incorporate correlations have also been developed. We previously developed a network-based penalized logistic regression for correlated methylation data, but only focusing on mean signals. We have also developed a generalized exponential tilt model that captures both mean and variance signals but only examining one CpG site at a time. Results: In this article, we proposed a penalized Exponential Tilt Model (pETM) using network-based regularization that captures both mean and variance signals in DNA methylation data and takes into account the correlations among nearby CpG sites. By combining the strength of the two models we previously developed, we demonstrated the superior power and better performance of the pETM method through simulations and the applications to the 450K DNA methylation array data of the four breast invasive carcinoma cancer subtypes from The Cancer Genome Atlas (TCGA) project. The developed pETM method identifies many cancer-related methylation loci that were missed by our previously developed method that considers correlations among nearby methylation loci but not variance signals.
引用
收藏
页码:1765 / 1772
页数:8
相关论文
共 50 条
  • [41] Penalized mixtures of factor analyzers with application to clustering high-dimensional microarray data
    Xie, Benhuai
    Pan, Wei
    Shen, Xiaotong
    BIOINFORMATICS, 2010, 26 (04) : 501 - 508
  • [42] Penalized empirical likelihood for high-dimensional generalized linear models with longitudinal data
    Chen, Xia
    Tan, Xiaoyan
    Yan, Li
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2023, 93 (10) : 1515 - 1531
  • [43] Coordinate ascent for penalized semiparametric regression on high-dimensional panel count data
    Wu, Tong Tong
    He, Xin
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (01) : 25 - 33
  • [44] Low-dimensional confounder adjustment and high-dimensional penalized estimation for survival analysis
    Xiaochao Xia
    Binyan Jiang
    Jialiang Li
    Wenyang Zhang
    Lifetime Data Analysis, 2016, 22 : 547 - 569
  • [45] HIMA2: high-dimensional mediation analysis and its application in epigenome-wide DNA methylation data
    Perera, Chamila
    Zhang, Haixiang
    Zheng, Yinan
    Hou, Lifang
    Qu, Annie
    Zheng, Cheng
    Xie, Ke
    Liu, Lei
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [46] HIMA2: high-dimensional mediation analysis and its application in epigenome-wide DNA methylation data
    Chamila Perera
    Haixiang Zhang
    Yinan Zheng
    Lifang Hou
    Annie Qu
    Cheng Zheng
    Ke Xie
    Lei Liu
    BMC Bioinformatics, 23
  • [47] Low-dimensional confounder adjustment and high-dimensional penalized estimation for survival analysis
    Xia, Xiaochao
    Jiang, Binyan
    Li, Jialiang
    Zhang, Wenyang
    LIFETIME DATA ANALYSIS, 2016, 22 (04) : 547 - 569
  • [48] Online Variational Bayes Inference for High-Dimensional Correlated Data
    Kabisa, Sylvie
    Dunson, David B.
    Morris, Jeffrey S.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2016, 25 (02) : 426 - 444
  • [49] Estimating and Accounting for Unobserved Covariates in High-Dimensional Correlated Data
    McKennan, Chris
    Nicolae, Dan
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (537) : 225 - 236
  • [50] High-dimensional data analysis and visualisation
    Chen, Cathy W. S.
    Lombardo, Rosaria
    Ripamonti, Enrico
    COMPUTATIONAL STATISTICS, 2024, 39 (01) : 1 - 2