Optimal Poisson Subsampling for Softmax Regression

被引:1
|
作者
Yao, Yaqiong [1 ]
Zou, Jiahui [2 ]
Wang, Haiying [1 ]
机构
[1] Univ Connecticut, Dept Stat, Storrs, CT 06269 USA
[2] Capital Univ Econ & Business, Sch Stat, Beijing 100070, Peoples R China
基金
美国国家科学基金会;
关键词
Multinomial logistic regression; optimality criterion; optimal subsampling; APPROXIMATION; ALGORITHMS; MATRICES;
D O I
10.1007/s11424-023-1179-z
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Softmax regression, which is also called multinomial logistic regression, is widely used in various fields for modeling the relationship between covariates and categorical responses with multiple levels. The increasing volumes of data bring new challenges for parameter estimation in softmax regression, and the optimal subsampling method is an effective way to solve them. However, optimal subsampling with replacement requires to access all the sampling probabilities simultaneously to draw a subsample, and the resultant subsample could contain duplicate observations. In this paper, the authors consider Poisson subsampling for its higher estimation accuracy and applicability in the scenario that the data exceed the memory limit. The authors derive the asymptotic properties of the general Poisson subsampling estimator and obtain optimal subsampling probabilities by minimizing the asymptotic variance-covariance matrix under both A- and L- optimality criteria. The optimal subsampling probabilities contain unknown quantities from the full dataset, so the authors suggest an approximately optimal Poisson subsampling algorithm which contains two sampling steps, with the first step as a pilot phase. The authors demonstrate the performance of our optimal Poisson subsampling algorithm through numerical simulations and real data examples.
引用
收藏
页码:1609 / 1625
页数:17
相关论文
共 50 条
  • [1] Optimal Poisson Subsampling for Softmax Regression
    YAO Yaqiong
    ZOU Jiahui
    WANG Haiying
    [J]. Journal of Systems Science & Complexity, 2023, 36 (04) : 1609 - 1625
  • [2] Optimal Poisson Subsampling for Softmax Regression
    Yaqiong Yao
    Jiahui Zou
    Haiying Wang
    [J]. Journal of Systems Science and Complexity, 2023, 36 : 1609 - 1625
  • [3] Optimal subsampling for softmax regression
    Yaqiong Yao
    HaiYing Wang
    [J]. Statistical Papers, 2019, 60 : 585 - 599
  • [4] Optimal subsampling for softmax regression
    Yao, Yaqiong
    Wang, HaiYing
    [J]. STATISTICAL PAPERS, 2019, 60 (02) : 235 - 249
  • [5] Model constraints independent optimal subsampling probabilities for softmax regression
    Yao, Yaqiong
    Zou, Jiahui
    Wang, HaiYing
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2023, 225 : 188 - 201
  • [6] Optimal subsampling for softmax regression (vol 45, pg 726, 2019)
    Yao, Yaqiong
    Wang, HaiYing
    [J]. STATISTICAL PAPERS, 2019, 60 (05) : 1801 - 1801
  • [7] Optimal subsampling for functional quantile regression
    Yan, Qian
    Li, Hanyu
    Niu, Chengmei
    [J]. STATISTICAL PAPERS, 2023, 64 (06) : 1943 - 1968
  • [8] Optimal subsampling for functional quantile regression
    Qian Yan
    Hanyu Li
    Chengmei Niu
    [J]. Statistical Papers, 2023, 64 : 1943 - 1968
  • [9] Optimal Subsampling for Large Sample Logistic Regression
    Wang, HaiYing
    Zhu, Rong
    Ma, Ping
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (522) : 829 - 844
  • [10] Optimal subsampling for quantile regression in big data
    Wang, Haiying
    Ma, Yanyuan
    [J]. BIOMETRIKA, 2021, 108 (01) : 99 - 112