Adaptive Elastic Net Based on Modified PSO for Variable Selection in Cox Model With High-Dimensional Data: A Comprehensive Simulation Study

被引:1
|
作者
Sancar, Nuriye [1 ]
Onakpojeruo, Efe Precious [2 ]
Inan, Deniz [3 ]
Ozsahin, Dilber Uzun [2 ,4 ,5 ]
机构
[1] Near East Univ, Dept Math, TR-99138 Nicosia, Turkiye
[2] Near East Univ, Operat Res Ctr Healthcare, TR-99138 Nicosia, Northern Cyprus, Turkiye
[3] Marmara Univ, Dept Stat, TR-34722 Istanbul, Turkiye
[4] Univ Sharjah, Coll Hlth Sci, Dept Med Diagnost Imaging, Sharjah, U Arab Emirates
[5] Univ Sharjah, Res Inst Med & Hlth Sci, Sharjah, U Arab Emirates
关键词
Adaptive elastic net; cox model; high-dimensional data; modified particle swarm optimization; variable selection; PARTICLE SWARM OPTIMIZATION; PROPORTIONAL HAZARDS MODEL; PENALIZED LIKELIHOOD; LOGISTIC-REGRESSION; LASSO; REGULARIZATION;
D O I
10.1109/ACCESS.2023.3329386
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In contemporary research, high-dimensional data has become more popular in many scientific fields with the rapid advancement of technology in collecting and storing large datasets. As in any modeling process with high-dimensional data, it is very important to accurately identify a subset of the features and reduce the dimensionality in the Cox modeling process in the case of high-dimensionality. Numerous penalized techniques for the Cox model with high-dimensional data have been developed to handle the multicollinearity problem and decrease variability. Adaptive Elastic-net is one of the penalized methods used for feature selection that both handles the grouping effect and has the oracle property. However, providing these advantageous properties of Adaptive Elastic-net for variable selection in the Cox model depends on the optimal selection of hyperparameters, alpha , and lambda values. For this reason, the appropriate selection of these parameters is quite important. Hyperparameters are generally selected by maximizing k-fold cross-validated log partial likelihood based on grid search over ( alpha , lambda ) for the model. However, this method does not guarantee optimal alpha and lambda values. In grid search, hyperparameters are typically allowed to take values specified in a limited sequence in a grid. The purpose of this study is to propose a novel method to determine the optimum hyperparameters ( alpha , lambda ) pair of Adaptive Elastic-net for variable selection in the Cox model with high dimensional data based on modified particle swarm optimization (MPSO). The introduced metaheuristic-based method has been evaluated by extensive simulation studies by comparing it with different traditional penalized methods using various evaluation criteria under different scenarios. According to the comprehensive simulation study, the proposed method outperforms other penalized methods in terms of both variable selection and prediction and estimation accuracy performance for the Cox model in investigating the high-dimensional data.
引用
收藏
页码:127302 / 127316
页数:15
相关论文
共 50 条
  • [31] Variable selection for survival data with a class of adaptive elastic net techniques
    Md Hasinur Rahaman Khan
    J. Ewart H. Shaw
    [J]. Statistics and Computing, 2016, 26 : 725 - 741
  • [32] Bayesian variable selection in multinomial probit model for classifying high-dimensional data
    Aijun Yang
    Yunxian Li
    Niansheng Tang
    Jinguan Lin
    [J]. Computational Statistics, 2015, 30 : 399 - 418
  • [33] Bayesian variable selection in multinomial probit model for classifying high-dimensional data
    Yang, Aijun
    Li, Yunxian
    Tang, Niansheng
    Lin, Jinguan
    [J]. COMPUTATIONAL STATISTICS, 2015, 30 (02) : 399 - 418
  • [34] Variable selection through adaptive elastic net for proportional odds model
    Wang, Chunxiang
    Li, Nan
    Diao, Hongbin
    Lu, Lanqing
    [J]. JAPANESE JOURNAL OF STATISTICS AND DATA SCIENCE, 2024, 7 (01) : 203 - 221
  • [35] Spike-and-slab type variable selection in the Cox proportional hazards model for high-dimensional features
    Wu, Ryan
    Ahn, Mihye
    Yang, Hojin
    [J]. JOURNAL OF APPLIED STATISTICS, 2022, 49 (09) : 2189 - 2207
  • [36] Variable Selection in a Log-Linear Birnbaum-Saunders Regression Model for High-Dimensional Survival Data via the Elastic-Net and Stochastic EM
    Zhang, Yukun
    Lu, Xuewen
    Desmond, Anthony F.
    [J]. TECHNOMETRICS, 2016, 58 (03) : 383 - 392
  • [37] High-dimensional variable selection via low-dimensional adaptive learning
    Staerk, Christian
    Kateri, Maria
    Ntzoufras, Ioannis
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (01): : 830 - 879
  • [38] Variable selection for high-dimensional partly linear additive Cox model with application to Alzheimer's disease
    Wu, Qiwei
    Zhao, Hui
    Zhu, Liang
    Sun, Jianguo
    [J]. STATISTICS IN MEDICINE, 2020, 39 (23) : 3120 - 3134
  • [39] Bayesian Variable Selection in Clustering High-Dimensional Data With Substructure
    Swartz, Michael D.
    Mo, Qianxing
    Murphy, Mary E.
    Lupton, Joanne R.
    Turner, Nancy D.
    Hong, Mee Young
    Vannucci, Marina
    [J]. JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2008, 13 (04) : 407 - 423
  • [40] Stochastic variational variable selection for high-dimensional microbiome data
    Dang, Tung
    Kumaishi, Kie
    Usui, Erika
    Kobori, Shungo
    Sato, Takumi
    Toda, Yusuke
    Yamasaki, Yuji
    Tsujimoto, Hisashi
    Ichihashi, Yasunori
    Iwata, Hiroyoshi
    [J]. MICROBIOME, 2022, 10 (01)