Tilted Correlation Screening Learning in High-Dimensional Data Analysis

被引:11
|
作者
Lin, Bingqing [1 ]
Pang, Zhen [1 ]
机构
[1] Nanyang Technol Univ, Sch Phys & Math Sci, Div Math Sci, Singapore 637371, Singapore
关键词
Bootstrap; Model averaging; TCS algorithm; Variable selection; VARIABLE SELECTION; MODEL SELECTION; REGRESSION; REGULARIZATION;
D O I
10.1080/10618600.2013.792266
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Statistical inference can be over optimistic and even misleading based on a selected model due to the uncertainty of the model selection procedure, especially in the high-dimensional data analysis. In this article, we propose a bootstrap-based tilted correlation screening learning (TCSL) algorithm to alleviate this uncertainty. The algorithm is inspired by the recently proposed variable selection method, TCS algorithm, which screens variables via tilted correlation. Our algorithm can reduce the prediction error and make the interpretation more reliable. The other gain of our algorithm is the reduced computational cost compared with the TCS algorithm when the dimension is large. Extensive simulation examples and the analysis of one real dataset are conducted to exhibit the good performance of our algorithm. Supplementary materials for this article are available online.
引用
收藏
页码:478 / 496
页数:19
相关论文
共 50 条
  • [1] Learning high-dimensional data
    Verleysen, M
    LIMITATIONS AND FUTURE TRENDS IN NEURAL COMPUTATION, 2003, 186 : 141 - 162
  • [2] Feature Screening for High-Dimensional Survival Data via Censored Quantile Correlation
    XU Kai
    HUANG Xudong
    JournalofSystemsScience&Complexity, 2021, 34 (03) : 1207 - 1224
  • [3] Feature Screening for High-Dimensional Survival Data via Censored Quantile Correlation
    Kai Xu
    Xudong Huang
    Journal of Systems Science and Complexity, 2021, 34 : 1207 - 1224
  • [4] Feature Screening for High-Dimensional Survival Data via Censored Quantile Correlation
    Xu, Kai
    Huang, Xudong
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2021, 34 (03) : 1207 - 1224
  • [5] Dynamic tilted current correlation for high dimensional variable screening
    Zhao, Bangxin
    Liu, Xin
    He, Wenqing
    Yi, Grace Y.
    JOURNAL OF MULTIVARIATE ANALYSIS, 2021, 182
  • [6] Learning high-dimensional multimedia data
    Xiaofeng Zhu
    Zhi Jin
    Rongrong Ji
    Multimedia Systems, 2017, 23 : 281 - 283
  • [7] Learning to visualise high-dimensional data
    Ahmad, K
    Vrusias, B
    EIGHTH INTERNATIONAL CONFERENCE ON INFORMATION VISUALISATION, PROCEEDINGS, 2004, : 507 - 512
  • [8] Learning high-dimensional multimedia data
    Zhu, Xiaofeng
    Jin, Zhi
    Ji, Rongrong
    MULTIMEDIA SYSTEMS, 2017, 23 (03) : 281 - 283
  • [9] Fast Robust Correlation for High-Dimensional Data
    Raymaekers, Jakob
    Rousseeuw, Peter J.
    TECHNOMETRICS, 2021, 63 (02) : 184 - 198
  • [10] Efficient Learning on High-dimensional Operational Data
    Samani, Forough Shahab
    Zhang, Hongyi
    Stadler, Rolf
    2019 15TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM), 2019,