Variable Selection via SCAD-Penalized Quantile Regression for High-Dimensional Count Data

被引:0
|
作者
Khan, Dost Muhammad [1 ]
Yaqoob, Anum [2 ]
Iqbal, Nadeem [3 ]
Wahid, Abdul [1 ]
Khalil, Umair [1 ]
Khan, Mukhtaj [3 ]
Abd Rahman, Mohd Amiruddin [4 ]
Mustafa, Mohd Shafie [5 ]
Khan, Zardad [1 ]
机构
[1] Abdul Wali Khan Univ Mardan, Dept Stat, Mardan 23200, Pakistan
[2] Allama Iqbal Open Univ, Dept Stat, Islamabad 44000, Pakistan
[3] Abdul Wali Khan Univ Mardan, Dept Comp Sci, Mardan 23200, Pakistan
[4] Univ Putra Malaysia, Dept Phys, Seri Kembangan 43400, Malaysia
[5] Univ Putra Malaysia, Dept Math, Seri Kembangan 43400, Malaysia
来源
IEEE ACCESS | 2019年 / 7卷
关键词
Input variables; Data models; Biological system modeling; Adaptation models; Mathematical model; Fans; Estimation; Count data; high dimensional; jittering; quantile regression; variable selection; zeroinflated; SHRINKAGE; LIKELIHOOD; LASSO;
D O I
10.1109/ACCESS.2019.2948278
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article introduces a quantile penalized regression technique for variable selection and estimation of conditional quantiles of counts in sparse high-dimensional models. The direct estimation and variable selection of the quantile regression is not feasible due to the discreteness of the count data and non-differentiability of the objective function, therefore, some smoothness must be artificially imposed on the problem. To achieve the necessary smoothness, we use the Jittering process by adding a uniformly distributed noise to the response count variable. The proposed method is compared with the existing penalized regression methods in terms of prediction accuracy and variable selection. We compare the proposed approach in zero-inflated count data regression models and in the presence of outliers. The performance and implementation of the proposed method are illustrated by detailed simulation studies and real data applications.
引用
收藏
页码:153205 / 153216
页数:12
相关论文
共 50 条
  • [41] Penalized Gaussian Process Regression and Classification for High-Dimensional Nonlinear Data
    Yi, G.
    Shi, J. Q.
    Choi, T.
    [J]. BIOMETRICS, 2011, 67 (04) : 1285 - 1294
  • [42] Consistent tuning parameter selection in high-dimensional group-penalized regression
    Yaguang Li
    Yaohua Wu
    Baisuo Jin
    [J]. Science China Mathematics, 2019, 62 (04) : 751 - 770
  • [43] Consistent tuning parameter selection in high-dimensional group-penalized regression
    Yaguang Li
    Yaohua Wu
    Baisuo Jin
    [J]. Science China Mathematics, 2019, 62 : 751 - 770
  • [44] Consistent tuning parameter selection in high-dimensional group-penalized regression
    Li, Yaguang
    Wu, Yaohua
    Jin, Baisuo
    [J]. SCIENCE CHINA-MATHEMATICS, 2019, 62 (04) : 751 - 770
  • [45] Variable selection for ultra-high dimensional quantile regression with missing data and measurement error
    Bai, Yongxin
    Tian, Maozai
    Tang, Man-Lai
    Lee, Wing-Yan
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2021, 30 (01) : 129 - 150
  • [46] Bayesian quantile regression and variable selection for count data with an application to Youth Fitness Survey
    Lv, Jing
    Fu, Yingzi
    [J]. PROCEEDINGS OF 2016 12TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2016, : 14 - 18
  • [47] Consistent significance controlled variable selection in high-dimensional regression
    Zambom, Adriano Zanin
    Kim, Jongwook
    [J]. STAT, 2018, 7 (01):
  • [48] High-Dimensional Regression and Variable Selection Using CAR Scores
    Zuber, Verena
    Strimmer, Korbinian
    [J]. STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2011, 10 (01)
  • [49] Bayesian Regression Trees for High-Dimensional Prediction and Variable Selection
    Linero, Antonio R.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (522) : 626 - 636
  • [50] FACTOR MODELS AND VARIABLE SELECTION IN HIGH-DIMENSIONAL REGRESSION ANALYSIS
    Kneip, Alois
    Sarda, Pascal
    [J]. ANNALS OF STATISTICS, 2011, 39 (05): : 2410 - 2447