The COR criterion for optimal subset selection in distributed estimation

被引:1
|
作者
Guo, Guangbao [1 ]
Song, Haoyue [1 ]
Zhu, Lixing [2 ]
机构
[1] Shandong Univ Technol, Sch Math & Stat, Zibo, Peoples R China
[2] Beijing Normal Univ, Dept Stat, Zhuhai, Peoples R China
关键词
Distributed data; Optimal subset selection; Distributed estimation; COR criterion;
D O I
10.1007/s11222-024-10471-z
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The problem of selecting an optimal subset in distributed regression is a crucial issue, as each distributed data subset may contain redundant information, which can be attributed to various sources such as outliers, dispersion, inconsistent duplicates, too many independent variables, and excessive data points, among others. Efficient reduction and elimination of this redundancy can help alleviate inconsistency issues for statistical inference. Therefore, it is imperative to track redundancy while measuring and processing data. We develop a criterion for optimal subset selection that is related to Covariance matrices, Observation matrices, and Response vectors (COR). We also derive a novel distributed interval estimation for the proposed criterion and establish the existence of optimal subset length. Finally, numerical experiments are conducted to verify the experimental feasibility of the proposed criterion.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Optimal Features Subset Selection and Classification for Iris Recognition
    Roy, Kaushik
    Bhattacharya, Prabir
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)
  • [42] Optimal Column Subset Selection by A-Star Search
    Arai, Hiromasa
    Maung, Crystal
    Schweitzer, Haim
    [J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 1079 - 1085
  • [43] A genetic algorithm applied to optimal gene subset selection
    Ding, SD
    Liu, J
    Wu, CL
    Yang, Q
    [J]. CEC2004: PROCEEDINGS OF THE 2004 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2004, : 1654 - 1660
  • [44] A New MPR Selection Algorithm based on the Optimal Subset
    Zhang, Hong
    Fan, Wen-jie
    Wang, Chuan-zhen
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATIONS (CSA), 2015, : 109 - 112
  • [45] A Covariate Selection Criterion for Estimation of Treatment Effects
    Lu, Xun
    [J]. JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2015, 33 (04) : 506 - 522
  • [47] Simultaneous estimation following subset selection of binomial populations
    Al-Mosawi, Riyadh
    [J]. METRON-INTERNATIONAL JOURNAL OF STATISTICS, 2012, 70 (01): : 59 - 69
  • [48] An Improved Greedy Algorithm for Subset Selection in Linear Estimation
    Dutta, Shamak
    Wilde, Nils
    Smith, Stephen L.
    [J]. 2022 EUROPEAN CONTROL CONFERENCE (ECC), 2022, : 1067 - 1072
  • [49] Optimal Deployment of Distributed Generation Using a Reliability Criterion
    Mitra, Joydeep
    Vallem, Mallikarjuna R.
    Singh, Chanan
    [J]. 2015 51ST IEEE INDUSTRY APPLICATIONS SOCIETY ANNUAL MEETING, 2015,
  • [50] Optimal Deployment of Distributed Generation Using a Reliability Criterion
    Mitra, Joydeep
    Vallem, Mallikarjuna R.
    Singh, Chanan
    [J]. IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2016, 52 (03) : 1989 - 1997