A MODEL OF DOUBLE DESCENT FOR HIGH-DIMENSIONAL LOGISTIC REGRESSION

被引:0
|
作者
Deng, Zeyu [1 ]
Kammoun, Abla [2 ]
Thrampoulidis, Christos [1 ]
机构
[1] Univ Calif Santa Barbara, Santa Barbara, CA 93106 USA
[2] King Abdullah Univ Sci & Technol, Thuwal, Saudi Arabia
关键词
Generalization error; Binary Classification; Overparameterization; Max-margin; Asymptotics;
D O I
10.1109/icassp40776.2020.9053524
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We consider a model for logistic regression where only a subset of features of size p is used for training a linear classifier over n training samples. The classifier is obtained by running gradient-descent (GD) on the logistic-loss. For this model, we investigate the dependence of the classification error on the overparameterization ratio kappa = p/n. First, building on known deterministic results on convergence properties of the GD, we uncover a phase-transition phenomenon for the case of Gaussian features: the classification error of GD is the same as that of the maximum-likelihood (ML) solution when kappa < kappa(star), and that of the max-margin (SVM) solution when kappa < kappa(star). Next, using the convex Gaussian min-max theorem (CGMT), we sharply characterize the performance of both the ML and SVM solutions. Combining these results, we obtain curves that explicitly characterize the test error of GD for varying values of kappa. The numerical results validate the theoretical predictions and unveil "double-descent" phenomena that complement similar recent observations in linear regression settings.
引用
收藏
页码:4267 / 4271
页数:5
相关论文
共 50 条
  • [1] High-Dimensional Analysis of Double Descent for Linear Regression with Random Projections
    Bach, Francis
    [J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2024, 6 (01): : 26 - 50
  • [2] A model of double descent for high-dimensional binary linear classification
    Deng, Zeyu
    Kammoun, Abla
    Thrampoulidis, Christos
    [J]. INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2022, 11 (02) : 435 - 495
  • [3] High-Dimensional Classification by Sparse Logistic Regression
    Abramovich, Felix
    Grinshtein, Vadim
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2019, 65 (05) : 3068 - 3079
  • [4] The Impact of Regularization on High-dimensional Logistic Regression
    Salehi, Fariborz
    Abbasi, Ehsan
    Hassibi, Babak
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [5] Debiased inference for heterogeneous subpopulations in a high-dimensional logistic regression model
    Hyunjin Kim
    Eun Ryung Lee
    Seyoung Park
    [J]. Scientific Reports, 13
  • [6] Debiased inference for heterogeneous subpopulations in a high-dimensional logistic regression model
    Kim, Hyunjin
    Lee, Eun Ryung
    Park, Seyoung
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [7] Inference for the case probability in high-dimensional logistic regression
    Guo, Zijian
    Rakshit, Prabrisha
    Herman, Daniel S.
    Chen, Jinbo
    [J]. Journal of Machine Learning Research, 2021, 22
  • [8] Weak Signals in High-Dimensional Logistic Regression Models
    Reangsephet, Orawan
    Lisawadi, Supranee
    Ahmed, Syed Ejaz
    [J]. PROCEEDINGS OF THE THIRTEENTH INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT, VOL 1, 2020, 1001 : 121 - 133
  • [9] Improving Penalized Logistic Regression Model with Missing Values in High-Dimensional Data
    Alharthi, Aiedh Mrisi
    Lee, Muhammad Hisyam
    Algamal, Zakariya Yahya
    [J]. INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2022, 18 (02) : 40 - 54
  • [10] Robust adaptive LASSO in high-dimensional logistic regression
    Basu, Ayanendranath
    Ghosh, Abhik
    Jaenada, Maria
    Pardo, Leandro
    [J]. STATISTICAL METHODS AND APPLICATIONS, 2024,