Penalized logistic regression for high-dimensional DNA methylation data with case-control studies

被引:74
|
作者
Sun, Hokeun [1 ]
Wang, Shuang [1 ]
机构
[1] Columbia Univ, Mailman Sch Publ Hlth, Dept Biostat, New York, NY 10032 USA
关键词
VARIABLE SELECTION; REGULARIZATION; LASSO;
D O I
10.1093/bioinformatics/bts145
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Results: Using simulation studies we demonstrated that the proposed procedure outperforms existing main-stream regularization methods such as lasso and elastic-net when data is correlated within a group. We also applied our method to identify important CpG sites and corresponding genes for ovarian cancer from over 20 000 CpGs generated from Illumina Infinium HumanMethylation27K Beadchip. Some genes identified are potentially associated with cancers.
引用
收藏
页码:1368 / 1375
页数:8
相关论文
共 50 条
  • [31] Performance Comparison of Penalized Regression Methods in Poisson Regression under High-Dimensional Sparse Data with Multicollinearity
    Choosawat, Chutikarn
    Reangsephet, Orawan
    Srisuradetchai, Patchanok
    Lisawadi, Supranee
    THAILAND STATISTICIAN, 2020, 18 (03): : 306 - 318
  • [32] An Alternating Direction Method of Multipliers for MCP-penalized Regression with High-dimensional Data
    Yue Yong SHI
    Yu Ling JIAO
    Yong Xiu CAO
    Yan Yan LIU
    Acta Mathematica Sinica,English Series, 2018, 34 (12) : 1892 - 1906
  • [33] An Alternating Direction Method of Multipliers for MCP-penalized Regression with High-dimensional Data
    Yue Yong SHI
    Yu Ling JIAO
    Yong Xiu CAO
    Yan Yan LIU
    Acta Mathematica Sinica, 2018, 34 (12) : 1892 - 1906
  • [34] An Alternating Direction Method of Multipliers for MCP-penalized Regression with High-dimensional Data
    Yue Yong Shi
    Yu Ling Jiao
    Yong Xiu Cao
    Yan Yan Liu
    Acta Mathematica Sinica, English Series, 2018, 34 : 1892 - 1906
  • [35] Vanishing deviance problem in high-dimensional penalized Cox regression
    Yao, Sijie
    Li, Tingyi
    Cao, Biwei
    Wang, Xuefeng
    CANCER RESEARCH, 2023, 83 (07)
  • [36] High-Dimensional Censored Regression via the Penalized Tobit Likelihood
    Jacobson, Tate
    Zou, Hui
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2024, 42 (01) : 286 - 297
  • [37] Matched Forest: supervised learning for high-dimensional matched case-control studies
    Zadeh, Nooshin Shomal
    Lin, Sangdi
    Runger, George C.
    BIOINFORMATICS, 2020, 36 (05) : 1570 - 1576
  • [38] Semi-Supervised Factored Logistic Regression for High-Dimensional Neuroimaging Data
    Bzdok, Danilo
    Eickenberg, Michael
    Grisel, Olivier
    Thirion, Bertrand
    Varoquaux, Gael
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [39] Using principal components for estimating logistic regression with high-dimensional multicollinear data
    Aguilera, AM
    Escabias, M
    Valderrama, MJ
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (08) : 1905 - 1924
  • [40] A Note on Penalized Regression Spline Estimation in the Secondary Analysis of Case-Control Data
    Gazioglu S.
    Wei J.
    Jennings E.M.
    Carroll R.J.
    Statistics in Biosciences, 2013, 5 (2) : 250 - 260