Robust Grouped Variable Selection Using Distributionally Robust Optimization

被引:0
|
作者
Ruidi Chen
Ioannis Ch. Paschalidis
机构
[1] Boston University,
关键词
Regression; Classification; Grouped LASSO; Wasserstein metric; Spectral clustering; 90C47; 62J07; 62J12;
D O I
暂无
中图分类号
学科分类号
摘要
We propose a distributionally robust optimization formulation with a Wasserstein-based uncertainty set for selecting grouped variables under perturbations on the data for both linear regression and classification problems. The resulting model offers robustness explanations for grouped least absolute shrinkage and selection operator algorithms and highlights the connection between robustness and regularization. We prove probabilistic bounds on the out-of-sample loss and the estimation bias, and establish the grouping effect of our estimator, showing that coefficients in the same group converge to the same value as the sample correlation between covariates approaches 1. Based on this result, we propose to use the spectral clustering algorithm with the Gaussian similarity function to perform grouping on the predictors, which makes our approach applicable without knowing the grouping structure a priori. We compare our approach to an array of alternatives and provide extensive numerical results on both synthetic data and a real large dataset of surgery-related medical records, showing that our formulation produces an interpretable and parsimonious model that encourages sparsity at a group level and is able to achieve better prediction and estimation performance in the presence of outliers.
引用
收藏
页码:1042 / 1071
页数:29
相关论文
共 50 条
  • [1] Robust Grouped Variable Selection Using Distributionally Robust Optimization
    Chen, Ruidi
    Paschalidis, Ioannis Ch
    [J]. JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2022, 194 (03) : 1042 - 1071
  • [2] Distributionally Robust Selection of the Best
    Fan, Weiwei
    Hong, L. Jeff
    Zhang, Xiaowei
    [J]. MANAGEMENT SCIENCE, 2020, 66 (01) : 190 - 208
  • [3] Adaptive Distributionally Robust Optimization
    Bertsimas, Dimitris
    Sim, Melvyn
    Zhang, Meilin
    [J]. MANAGEMENT SCIENCE, 2019, 65 (02) : 604 - 618
  • [4] BAYESIAN DISTRIBUTIONALLY ROBUST OPTIMIZATION
    Shapiro, Alexander
    Zhou, Enlu
    Lin, Yifan
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2023, 33 (02) : 1279 - 1304
  • [5] Distributionally Robust Portfolio Optimization
    Bardakci, I. E.
    Lagoa, C. M.
    [J]. 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 1526 - 1531
  • [6] Distributionally Robust Bayesian Optimization
    Kirschner, Johannes
    Bogunovic, Ilija
    Jegelka, Stefanie
    Krause, Andreas
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 1921 - 1930
  • [7] Distributionally Robust Convex Optimization
    Wiesemann, Wolfram
    Kuhn, Daniel
    Sim, Melvyn
    [J]. OPERATIONS RESEARCH, 2014, 62 (06) : 1358 - 1376
  • [8] DISTRIBUTIONALLY ROBUST SPARSE PORTFOLIO SELECTION
    Sheng, Xiwen
    Zhang, Beibei
    Cheng, Yonghui
    Luan, Dongqing
    Ji, Ying
    [J]. MATHEMATICAL FOUNDATIONS OF COMPUTING, 2023,
  • [9] Regularization for Wasserstein distributionally robust optimization
    Azizian, Waiss
    Iutzeler, Franck
    Malick, Jerome
    [J]. ESAIM-CONTROL OPTIMISATION AND CALCULUS OF VARIATIONS, 2023, 29
  • [10] A framework of distributionally robust possibilistic optimization
    Guillaume, Romain
    Kasperski, Adam
    Zielinski, Pawel
    [J]. FUZZY OPTIMIZATION AND DECISION MAKING, 2024, 23 (02) : 253 - 278