SOCP relaxation bounds for the optimal subset selection problem applied to robust linear regression

被引:2
|
作者
Flores, Salvador [1 ]
机构
[1] Univ Chile, CNRS UMI 2807, Ctr Modelamiento Matemat, Santiago, Chile
关键词
Global optimization; Integer programming; High breakdown point regression; Branch and bound; Relaxation-linearization technique; TRIMMED SQUARES REGRESSION; ALGORITHM; BRANCH;
D O I
10.1016/j.ejor.2015.04.024
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
This paper deals with the problem of finding the globally optimal subset of h elements from a larger set of n elements in d space dimensions so as to minimize a quadratic criterion, with an special emphasis on applications to computing the Least Trimmed Squares Estimator (LTSE) for robust regression. The computation of the LTSE is a challenging subset selection problem involving a nonlinear program with continuous and binary variables, linked in a highly nonlinear fashion. The selection of a globally optimal subset using the branch and bound (BB) algorithm is limited to problems in very low dimension, typically d,5 5, as the complexity of the problem increases exponentially with d. We introduce a bold pruning strategy in the BB algorithm that results in a significant reduction in computing time, at the price of a negligeable accuracy lost. The novelty of our algorithm is that the bounds at nodes of the BB tree come from pseudo-convexifications derived using a linearization technique with approximate bounds for the nonlinear terms. The approximate bounds are computed solving an auxiliary semidefinite optimization problem. We show through a computational study that our algorithm performs well in a wide set of the most difficult instances of the LTSE problem. (C) 2015 Elsevier B.V. and Association of European Operational Research Societies (EURO) within the International Federation of Operational Research Societies (IFORS). All rights reserved.
引用
收藏
页码:44 / 50
页数:7
相关论文
共 50 条
  • [41] Relaxation for an optimal design problem with linear growth and perimeter penalization
    Carita, Graca
    Zappale, Elvira
    [J]. PROCEEDINGS OF THE ROYAL SOCIETY OF EDINBURGH SECTION A-MATHEMATICS, 2015, 145 (02) : 223 - 268
  • [42] Estimation of optimal backward perturbation bounds for the linear least squares problem
    Rune Karlson
    Bertil Waldén
    [J]. BIT Numerical Mathematics, 1997, 37 : 862 - 869
  • [43] OPTIMAL BACKWARD PERTURBATION BOUNDS FOR THE LINEAR LEAST-SQUARES PROBLEM
    WALDEN, B
    KARLSON, R
    SUN, JG
    [J]. NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 1995, 2 (03) : 271 - 286
  • [44] Estimation of optimal backward perturbation bounds for the linear least squares problem
    Karlson, R
    Walden, B
    [J]. BIT NUMERICAL MATHEMATICS, 1997, 37 (04) : 862 - 869
  • [45] Subset Selection in Linear Regression using Sequentially Normalized Least Squares: Asymptotic Theory
    Maatta, Jussi
    Schmidt, Daniel F.
    Roos, Teemu
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2016, 43 (02) : 382 - 395
  • [46] Variable selection in linear regression models: Choosing the best subset is not always the best choice
    Hanke, Moritz
    Dijkstra, Louis
    Foraita, Ronja
    Didelez, Vanessa
    [J]. BIOMETRICAL JOURNAL, 2023,
  • [47] Subset selection in multiple linear regression models: A hybrid of genetic and simulated annealing algorithms
    Orkcu, H. Hasan
    [J]. APPLIED MATHEMATICS AND COMPUTATION, 2013, 219 (23) : 11018 - 11028
  • [48] Variable selection in linear regression models: Choosing the best subset is not always the best choice
    Hanke, Moritz
    Dijkstra, Louis
    Foraita, Ronja
    Didelez, Vanessa
    [J]. BIOMETRICAL JOURNAL, 2024, 66 (01)
  • [49] Robust model selection in linear regression models using information complexity
    Guney, Yesim
    Bozdogan, Hamparsum
    Arslan, Olcay
    [J]. JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2021, 398
  • [50] Efficient and robust optimal design for quantile regression based on linear programming
    Peng, Cheng
    Kouri, Drew P.
    Uryasev, Stan
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 192