Model selection in high-dimensional noisy data: a simulation study

被引:2
|
作者
Romeo, Giovanni [1 ]
Thoresen, Magne [1 ]
机构
[1] Univ Oslo, Oslo Ctr Biostat & Epidemiol, Dept Biostat, Oslo, Norway
关键词
Measurement error; high-dimensional regression; lasso; matrix uncertainty selector; convex conditional lasso; variable selection; MEASUREMENT ERROR; LASSO;
D O I
10.1080/00949655.2019.1607345
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In many practical applications, high-dimensional regression analyses have to take into account measurement error in the covariates. It is thus necessary to extend regularization methods, that can handle the situation where the number of covariates p largely exceed the sample size n, to the case in which covariates are also mismeasured. A variety of methods are available in this context, but many of them rely on knowledge about the measurement error and the structure of its covariance matrix. In this paper, we set the goal to compare some of these methods, focusing on situations relevant for practical applications. In particular, we will evaluate these methods in setups in which the measurement error distribution and dependence structure are not known and have to be estimated from data. Our focus is on variable selection, and the evaluation is based on extensive simulations.
引用
收藏
页码:2031 / 2050
页数:20
相关论文
共 50 条
  • [1] Model Selection for High-Dimensional Data
    Owrang, Arash
    Jansson, Magnus
    [J]. 2016 50TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2016, : 606 - 609
  • [2] A Robust Supervised Variable Selection for Noisy High-Dimensional Data
    Kalina, Jan
    Schlenker, Anna
    [J]. BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [3] Simultaneous Feature and Model Selection for High-Dimensional Data
    Perolini, Alessandro
    Guerif, Sebastien
    [J]. 2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 47 - 50
  • [4] Clustering High-Dimensional Noisy Categorical Data
    Tian, Zhiyi
    Xu, Jiaming
    Tang, Jen
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024,
  • [5] Geometric classifiers for high-dimensional noisy data
    Ishii, Aki
    Yata, Kazuyoshi
    Aoshima, Makoto
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2022, 188
  • [6] Variable Selection Methods in High-dimensional RegressionA Simulation Study
    Shahriari, Shirin
    Faria, Susana
    Goncalves, A. Manuela
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2015, 44 (10) : 2548 - 2561
  • [7] Improved Model for Attribute Selection on High-Dimensional Economic Data
    Somol, Petr
    Pudil, Pavel
    Castek, Ondrej
    Pokorna, Jana
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON MANAGEMENT, LEADERSHIP AND GOVERNANCE (ICMLG 2014), 2014, : 276 - 285
  • [8] Feature selection for high-dimensional data
    Bolón-Canedo V.
    Sánchez-Maroño N.
    Alonso-Betanzos A.
    [J]. Progress in Artificial Intelligence, 2016, 5 (2) : 65 - 75
  • [9] Feature selection for high-dimensional data
    Destrero A.
    Mosci S.
    De Mol C.
    Verri A.
    Odone F.
    [J]. Computational Management Science, 2009, 6 (1) : 25 - 40
  • [10] Adaptive Elastic Net Based on Modified PSO for Variable Selection in Cox Model With High-Dimensional Data: A Comprehensive Simulation Study
    Sancar, Nuriye
    Onakpojeruo, Efe Precious
    Inan, Deniz
    Ozsahin, Dilber Uzun
    [J]. IEEE ACCESS, 2023, 11 : 127302 - 127316