Penalized logistic regression for high-dimensional DNA methylation data with case-control studies

被引:74
|
作者
Sun, Hokeun [1 ]
Wang, Shuang [1 ]
机构
[1] Columbia Univ, Mailman Sch Publ Hlth, Dept Biostat, New York, NY 10032 USA
关键词
VARIABLE SELECTION; REGULARIZATION; LASSO;
D O I
10.1093/bioinformatics/bts145
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Results: Using simulation studies we demonstrated that the proposed procedure outperforms existing main-stream regularization methods such as lasso and elastic-net when data is correlated within a group. We also applied our method to identify important CpG sites and corresponding genes for ovarian cancer from over 20 000 CpGs generated from Illumina Infinium HumanMethylation27K Beadchip. Some genes identified are potentially associated with cancers.
引用
收藏
页码:1368 / 1375
页数:8
相关论文
共 50 条
  • [21] Targeted Inference Involving High-Dimensional Data Using Nuisance Penalized Regression
    Sun, Qiang
    Zhang, Heping
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (535) : 1472 - 1486
  • [22] Coordinate ascent for penalized semiparametric regression on high-dimensional panel count data
    Wu, Tong Tong
    He, Xin
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (01) : 25 - 33
  • [23] PENALIZED LINEAR REGRESSION WITH HIGH-DIMENSIONAL PAIRWISE SCREENING
    Gong, Siliang
    Zhang, Kai
    Liu, Yufeng
    STATISTICA SINICA, 2021, 31 (01) : 391 - 420
  • [24] ADMM for High-Dimensional Sparse Penalized Quantile Regression
    Gu, Yuwen
    Fan, Jun
    Kong, Lingchen
    Ma, Shiqian
    Zou, Hui
    TECHNOMETRICS, 2018, 60 (03) : 319 - 331
  • [25] High-Dimensional Classification by Sparse Logistic Regression
    Abramovich, Felix
    Grinshtein, Vadim
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2019, 65 (05) : 3068 - 3079
  • [26] The Impact of Regularization on High-dimensional Logistic Regression
    Salehi, Fariborz
    Abbasi, Ehsan
    Hassibi, Babak
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [27] USING OF STRATIFICATION AND THE LOGISTIC-REGRESSION MODEL IN THE ANALYSIS OF DATA OF CASE-CONTROL STUDIES
    GIMENO, SGA
    DESOUZA, JMP
    REVISTA DE SAUDE PUBLICA, 1995, 29 (04): : 283 - 289
  • [28] Fitting logistic regression models with contaminated case-control data
    Cheng, K. F.
    Chen, L. C.
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2006, 136 (12) : 4147 - 4160
  • [29] An Alternating Direction Method of Multipliers for MCP-penalized Regression with High-dimensional Data
    Shi, Yue Yong
    Jiao, Yu Ling
    Cao, Yong Xiu
    Liu, Yan Yan
    ACTA MATHEMATICA SINICA-ENGLISH SERIES, 2018, 34 (12) : 1892 - 1906
  • [30] SCAD-penalized quantile regression for high-dimensional data analysis and variable selection
    Amin, Muhammad
    Song, Lixin
    Thorlie, Milton Abdul
    Wang, Xiaoguang
    STATISTICA NEERLANDICA, 2015, 69 (03) : 212 - 235