High-dimensional sign-constrained feature selection and grouping

被引:0
|
作者
Shanshan Qin
Hao Ding
Yuehua Wu
Feng Liu
机构
[1] York University,Department of Mathematics and Statistics
[2] University of Technology Sydney,Australian Artificial Intelligence Institute
关键词
Difference convex programming; Feature grouping; Feature selection; High-dimensional; Non-negative;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose a non-negative feature selection/feature grouping (nnFSG) method for general sign-constrained high-dimensional regression problems that allows regression coefficients to be disjointly homogeneous, with sparsity as a special case. To solve the resulting non-convex optimization problem, we provide an algorithm that incorporates the difference of convex programming, augmented Lagrange and coordinate descent methods. Furthermore, we show that the aforementioned nnFSG method recovers the oracle estimate consistently, and that the mean-squared errors are bounded. Additionally, we examine the performance of our method using finite sample simulations and applying it to a real protein mass spectrum dataset.
引用
收藏
页码:787 / 819
页数:32
相关论文
共 50 条
  • [1] High-dimensional sign-constrained feature selection and grouping
    Qin, Shanshan
    Ding, Hao
    Wu, Yuehua
    Liu, Feng
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2021, 73 (04) : 787 - 819
  • [2] Sign-constrained least squares estimation for high-dimensional regression
    Meinshausen, Nicolai
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2013, 7 : 1607 - 1631
  • [3] High-dimensional feature selection via feature grouping: A Variable Neighborhood Search approach
    Garcia-Torres, Miguel
    Gomez-Vela, Francisco
    Melian-Batista, Belen
    Marcos Moreno-Vega, J.
    [J]. INFORMATION SCIENCES, 2016, 326 : 102 - 118
  • [4] Feature selection for high-dimensional data
    Destrero A.
    Mosci S.
    De Mol C.
    Verri A.
    Odone F.
    [J]. Computational Management Science, 2009, 6 (1) : 25 - 40
  • [5] Feature selection for high-dimensional data
    Bolón-Canedo V.
    Sánchez-Maroño N.
    Alonso-Betanzos A.
    [J]. Progress in Artificial Intelligence, 2016, 5 (2) : 65 - 75
  • [6] Feature Grouping as a Stochastic Regularizer for High-Dimensional Structured Data
    Aydore, Sergul
    Thirion, Bertrand
    Varoquaux, Gael
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [7] FEATURE SELECTION FOR HIGH-DIMENSIONAL DATA ANALYSIS
    Verleysen, Michel
    [J]. NCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION THEORY AND APPLICATIONS, 2011, : IS23 - IS25
  • [8] Feature selection for high-dimensional data in astronomy
    Zheng, Hongwen
    Zhang, Yanxia
    [J]. ADVANCES IN SPACE RESEARCH, 2008, 41 (12) : 1960 - 1964
  • [9] Feature selection for high-dimensional imbalanced data
    Yin, Liuzhi
    Ge, Yong
    Xiao, Keli
    Wang, Xuehua
    Quan, Xiaojun
    [J]. NEUROCOMPUTING, 2013, 105 : 3 - 11
  • [10] A filter feature selection for high-dimensional data
    Janane, Fatima Zahra
    Ouaderhman, Tayeb
    Chamlal, Hasna
    [J]. JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2023, 17