pETM: a penalized Exponential Tilt Model for analysis of correlated high-dimensional DNA methylation data

被引:13
|
作者
Sun, Hokeun [1 ]
Wang, Ya [2 ]
Chen, Yong [3 ]
Li, Yun [4 ,5 ,6 ]
Wang, Shuang [2 ]
机构
[1] Pusan Natl Univ, Dept Stat, Busan 609735, South Korea
[2] Columbia Univ, Mailman Sch Publ Hlth, Dept Biostat, New York, NY 10032 USA
[3] Univ Penn, Perelman Sch Med, Div Biostat, Philadelphia, PA 19103 USA
[4] Univ N Carolina, Dept Biostat, Chapel Hill, NC 27599 USA
[5] Univ N Carolina, Dept Genet, Chapel Hill, NC 27599 USA
[6] Univ N Carolina, Dept Comp Sci, Chapel Hill, NC 27599 USA
基金
新加坡国家研究基金会;
关键词
OVARIAN-CANCER; REGULARIZATION PATHS; LUNG-CANCER; CELL; GENES; EXPRESSION; IDENTIFICATION; REGRESSION; MARKERS; HYPERMETHYLATION;
D O I
10.1093/bioinformatics/btx064
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: DNA methylation plays an important role in many biological processes and cancer progression. Recent studies have found that there are also differences in methylation variations in different groups other than differences in methylation means. Several methods have been developed that consider both mean and variance signals in order to improve statistical power of detecting differentially methylated loci. Moreover, as methylation levels of neighboring CpG sites are known to be strongly correlated, methods that incorporate correlations have also been developed. We previously developed a network-based penalized logistic regression for correlated methylation data, but only focusing on mean signals. We have also developed a generalized exponential tilt model that captures both mean and variance signals but only examining one CpG site at a time. Results: In this article, we proposed a penalized Exponential Tilt Model (pETM) using network-based regularization that captures both mean and variance signals in DNA methylation data and takes into account the correlations among nearby CpG sites. By combining the strength of the two models we previously developed, we demonstrated the superior power and better performance of the pETM method through simulations and the applications to the 450K DNA methylation array data of the four breast invasive carcinoma cancer subtypes from The Cancer Genome Atlas (TCGA) project. The developed pETM method identifies many cancer-related methylation loci that were missed by our previously developed method that considers correlations among nearby methylation loci but not variance signals.
引用
收藏
页码:1765 / 1772
页数:8
相关论文
共 50 条
  • [21] Penalized Cox's proportional hazards model for high-dimensional survival data with grouped predictors
    Dang, Xuan
    Huang, Shuai
    Qian, Xiaoning
    STATISTICS AND COMPUTING, 2021, 31 (06)
  • [22] Penalized weighted smoothed quantile regression for high-dimensional longitudinal data
    Song, Yanan
    Han, Haohui
    Fu, Liya
    Wang, Ting
    STATISTICS IN MEDICINE, 2024, 43 (10) : 2007 - 2042
  • [23] Penalized generalized empirical likelihood in high-dimensional weakly dependent data
    Zhang, Jia
    Shi, Haoming
    Tian, Lemeng
    Xiao, Fengjun
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 171 : 270 - 283
  • [24] Penalized Gaussian Process Regression and Classification for High-Dimensional Nonlinear Data
    Yi, G.
    Shi, J. Q.
    Choi, T.
    BIOMETRICS, 2011, 67 (04) : 1285 - 1294
  • [25] A penalized linear mixed model with generalized method of moments for prediction analysis on high-dimensional multi-omics data
    Wang, Xiaqiong
    Wen, Yalu
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (04)
  • [26] Exponential synchronization for nonidentical high-dimensional Kuramoto model
    Wei, Xinmiao
    Peng, Shanshan
    Zhu, Jiandong
    SYSTEMS & CONTROL LETTERS, 2023, 177
  • [27] Network-based regularization for matched case-control analysis of high-dimensional DNA methylation data
    Sun, Hokeun
    Wang, Shuang
    STATISTICS IN MEDICINE, 2013, 32 (12) : 2127 - 2139
  • [28] On the penalized maximum likelihood estimation of high-dimensional approximate factor model
    Wang, Shaoxin
    Yang, Hu
    Yao, Chaoli
    COMPUTATIONAL STATISTICS, 2019, 34 (02) : 819 - 846
  • [29] On the penalized maximum likelihood estimation of high-dimensional approximate factor model
    Shaoxin Wang
    Hu Yang
    Chaoli Yao
    Computational Statistics, 2019, 34 : 819 - 846
  • [30] High-dimensional inference for linear model with correlated errors
    Yuan, Panxu
    Guo, Xiao
    METRIKA, 2022, 85 (01) : 21 - 52