The COR criterion for optimal subset selection in distributed estimation

被引:1
|
作者
Guo, Guangbao [1 ]
Song, Haoyue [1 ]
Zhu, Lixing [2 ]
机构
[1] Shandong Univ Technol, Sch Math & Stat, Zibo, Peoples R China
[2] Beijing Normal Univ, Dept Stat, Zhuhai, Peoples R China
关键词
Distributed data; Optimal subset selection; Distributed estimation; COR criterion;
D O I
10.1007/s11222-024-10471-z
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The problem of selecting an optimal subset in distributed regression is a crucial issue, as each distributed data subset may contain redundant information, which can be attributed to various sources such as outliers, dispersion, inconsistent duplicates, too many independent variables, and excessive data points, among others. Efficient reduction and elimination of this redundancy can help alleviate inconsistency issues for statistical inference. Therefore, it is imperative to track redundancy while measuring and processing data. We develop a criterion for optimal subset selection that is related to Covariance matrices, Observation matrices, and Response vectors (COR). We also derive a novel distributed interval estimation for the proposed criterion and establish the existence of optimal subset length. Finally, numerical experiments are conducted to verify the experimental feasibility of the proposed criterion.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] The COR criterion for optimal subset selection in distributed estimation (vol 34, 163, 2024)
    Guo, Guangbao
    Song, Haoyue
    Zhu, Lixing
    [J]. STATISTICS AND COMPUTING, 2024, 34 (06)
  • [2] LIC criterion for optimal subset selection in distributed interval estimation
    Guo, Guangbao
    Sun, Yue
    Qian, Guoqi
    Wang, Qian
    [J]. JOURNAL OF APPLIED STATISTICS, 2023, 50 (09) : 1900 - 1920
  • [3] Communication-efficient estimation for distributed subset selection
    Yan Chen
    Ruipeng Dong
    Canhong Wen
    [J]. Statistics and Computing, 2023, 33
  • [4] Communication-efficient estimation for distributed subset selection
    Chen, Yan
    Dong, Ruipeng
    Wen, Canhong
    [J]. STATISTICS AND COMPUTING, 2023, 33 (06)
  • [5] LIC: An R package for optimal subset selection for distributed data
    Chang, Di
    Guo, Guangbao
    [J]. SoftwareX, 2024, 28
  • [6] Optimal partitioning in distributed state estimation considering a modified convergence criterion
    Asefi, Sajjad
    Gryazina, Elena
    Leite, Helder
    [J]. 2021 INTERNATIONAL CONFERENCE ON SMART ENERGY SYSTEMS AND TECHNOLOGIES (SEST), 2021,
  • [7] Selection of an optimal subset of sizes
    Jorjani, S
    Scott, CH
    Woodruff, DL
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 1999, 37 (16) : 3697 - 3710
  • [8] Optimization of the AUC criterion for rule subset selection
    Ishida, Celso Y.
    Pozo, Aurora T. R.
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2007, : 497 - 502
  • [9] Optimal Statistical Feature Subset Selection for Bearing Fault Detection and Severity Estimation
    Grover, Chhaya
    Turk, Neelam
    [J]. SHOCK AND VIBRATION, 2020, 2020
  • [10] Distributed Column Subset Selection on MapReduce
    Farahat, Ahmed K.
    Elgohary, Ahmed
    Ghodsi, Ali
    Kamel, Mohamed S.
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 171 - 180