Adaptive Elastic Net Based on Modified PSO for Variable Selection in Cox Model With High-Dimensional Data: A Comprehensive Simulation Study

被引:1
|
作者
Sancar, Nuriye [1 ]
Onakpojeruo, Efe Precious [2 ]
Inan, Deniz [3 ]
Ozsahin, Dilber Uzun [2 ,4 ,5 ]
机构
[1] Near East Univ, Dept Math, TR-99138 Nicosia, Turkiye
[2] Near East Univ, Operat Res Ctr Healthcare, TR-99138 Nicosia, Northern Cyprus, Turkiye
[3] Marmara Univ, Dept Stat, TR-34722 Istanbul, Turkiye
[4] Univ Sharjah, Coll Hlth Sci, Dept Med Diagnost Imaging, Sharjah, U Arab Emirates
[5] Univ Sharjah, Res Inst Med & Hlth Sci, Sharjah, U Arab Emirates
关键词
Adaptive elastic net; cox model; high-dimensional data; modified particle swarm optimization; variable selection; PARTICLE SWARM OPTIMIZATION; PROPORTIONAL HAZARDS MODEL; PENALIZED LIKELIHOOD; LOGISTIC-REGRESSION; LASSO; REGULARIZATION;
D O I
10.1109/ACCESS.2023.3329386
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In contemporary research, high-dimensional data has become more popular in many scientific fields with the rapid advancement of technology in collecting and storing large datasets. As in any modeling process with high-dimensional data, it is very important to accurately identify a subset of the features and reduce the dimensionality in the Cox modeling process in the case of high-dimensionality. Numerous penalized techniques for the Cox model with high-dimensional data have been developed to handle the multicollinearity problem and decrease variability. Adaptive Elastic-net is one of the penalized methods used for feature selection that both handles the grouping effect and has the oracle property. However, providing these advantageous properties of Adaptive Elastic-net for variable selection in the Cox model depends on the optimal selection of hyperparameters, alpha , and lambda values. For this reason, the appropriate selection of these parameters is quite important. Hyperparameters are generally selected by maximizing k-fold cross-validated log partial likelihood based on grid search over ( alpha , lambda ) for the model. However, this method does not guarantee optimal alpha and lambda values. In grid search, hyperparameters are typically allowed to take values specified in a limited sequence in a grid. The purpose of this study is to propose a novel method to determine the optimum hyperparameters ( alpha , lambda ) pair of Adaptive Elastic-net for variable selection in the Cox model with high dimensional data based on modified particle swarm optimization (MPSO). The introduced metaheuristic-based method has been evaluated by extensive simulation studies by comparing it with different traditional penalized methods using various evaluation criteria under different scenarios. According to the comprehensive simulation study, the proposed method outperforms other penalized methods in terms of both variable selection and prediction and estimation accuracy performance for the Cox model in investigating the high-dimensional data.
引用
收藏
页码:127302 / 127316
页数:15
相关论文
共 50 条
  • [41] Bayesian variable selection in clustering high-dimensional data with substructure
    Michael D. Swartz
    Qianxing Mo
    Mary E. Murphy
    Joanne R. Lupton
    Nancy D. Turner
    Mee Young Hong
    Marina Vannucci
    [J]. Journal of Agricultural, Biological, and Environmental Statistics, 2008, 13 : 407 - 423
  • [42] Sparse Bayesian variable selection for classifying high-dimensional data
    Yang, Aijun
    Lian, Heng
    Jiang, Xuejun
    Liu, Pengfei
    [J]. STATISTICS AND ITS INTERFACE, 2018, 11 (02) : 385 - 395
  • [43] Variable selection and subgroup analysis for high-dimensional censored data
    Zhang, Yu
    Wang, Jiangli
    Zhang, Weiping
    [J]. STATISTICAL THEORY AND RELATED FIELDS, 2024, 8 (03) : 211 - 231
  • [44] Scalable Bayesian variable selection for structured high-dimensional data
    Chang, Changgee
    Kundu, Suprateek
    Long, Qi
    [J]. BIOMETRICS, 2018, 74 (04) : 1372 - 1382
  • [45] High-dimensional variable selection in regression and classification with missing data
    Gao, Qi
    Lee, Thomas C. M.
    [J]. SIGNAL PROCESSING, 2017, 131 : 1 - 7
  • [46] A Robust Supervised Variable Selection for Noisy High-Dimensional Data
    Kalina, Jan
    Schlenker, Anna
    [J]. BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [47] Variable selection for longitudinal data with high-dimensional covariates and dropouts
    Zheng, Xueying
    Fu, Bo
    Zhang, Jiajia
    Qin, Guoyou
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2018, 88 (04) : 712 - 725
  • [48] Estimation and variable selection for high-dimensional spatial data models
    Hou, Li
    Jin, Baisuo
    Wu, Yuehua
    [J]. JOURNAL OF ECONOMETRICS, 2024, 238 (02)
  • [49] Stochastic variational variable selection for high-dimensional microbiome data
    Tung Dang
    Kie Kumaishi
    Erika Usui
    Shungo Kobori
    Takumi Sato
    Yusuke Toda
    Yuji Yamasaki
    Hisashi Tsujimoto
    Yasunori Ichihashi
    Hiroyoshi Iwata
    [J]. Microbiome, 10
  • [50] Ranking-based Feature Selection with Wrapper PSO Search in High-Dimensional Data Classification
    Saw, Thinzar
    Oo, Win Mar
    [J]. IAENG International Journal of Computer Science, 2023, 50 (01)