Kernel Density Estimation Based on the Distinct Units in Sampling with Replacement

被引:1
|
作者
Mostafa, Sayed A. [1 ]
Ahmad, Ibrahim A. [2 ]
机构
[1] North Carolina A&T State Univ, Greensboro, NC 27405 USA
[2] Oklahoma State Univ, Dept Stat, Stillwater, OK 74078 USA
关键词
Distinct units; kernel density estimation; kernel regression; random sample size; sampling with; without replacement; DESIGN;
D O I
10.1007/s13571-019-00223-9
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This paper considers the problem of estimating density functions using the kernel method based on the set of distinct units in sampling with replacement. Using a combined design-model-based inference framework, which accounts for the underlying superpopulation model as well as the randomization distribution induced by the sampling design, we derive asymptotic expressions for the bias and integrated mean squared error (MISE) of a Parzen-Rosenblatt-type kernel density estimator (KDE) based on the distinct units from sampling with replacement. We also prove the asymptotic normality of the distinct units KDE under both design-based and combined inference frameworks. Additionally, we give the asymptotic MISE formulas of several alternative estimators including the estimator based on the full with-replacement sample and estimators based on without-replacement sampling of similar cost. Using the MISE expressions, we discuss how the various estimators compare asymptotically. Moreover, we use Mote Carlo simulations to investigate the finite sample properties of these estimators. Our simulation results show that the distinct units KDE and the without-replacement KDEs perform similarly but are all always superior to the full with-replacement sample KDE. Furthermore, we briefly discuss a Nadaraya-Watson-type kernel regression estimator based on the distinct units from sampling with replacement, derive its MSE under the combined inference framework, and demonstrate its finite sample properties using a small simulation study. Finally, we extend the distinct units density and regression estimators to the case of two-stage sampling with replacement.
引用
收藏
页码:507 / 547
页数:41
相关论文
共 50 条
  • [1] Kernel Density Estimation Based on the Distinct Units in Sampling with Replacement
    Sayed A. Mostafa
    Ibrahim A. Ahmad
    [J]. Sankhya B, 2021, 83 : 507 - 547
  • [2] AVERAGING OVER DISTINCT UNITS IN SAMPLING WITH REPLACEMENT
    KORWAR, RM
    SERFLING, RJ
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (06): : 2132 - &
  • [3] ON AVERAGING OVER DISTINCT UNITS IN SAMPLING WITH REPLACEMENT
    SINHA, BK
    SEN, PK
    [J]. SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 1989, 51 : 65 - 83
  • [4] Kernel density estimation based sampling for imbalanced class distribution
    Kamalov, Firuz
    [J]. INFORMATION SCIENCES, 2020, 512 : 1192 - 1201
  • [5] On kernel density estimation based on different stratified sampling with optimal allocation
    Samawi, Hani
    Chatterjee, Arpita
    Yin, JingJing
    Rochani, Haresh
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (22) : 10973 - 10990
  • [6] A NOTE ON THE COMPARISON BETWEEN SIMPLE MEAN AND MEAN BASED ON DISTINCT UNITS IN SAMPLING WITH REPLACEMENT
    ASOK, C
    [J]. AMERICAN STATISTICIAN, 1980, 34 (03): : 158 - 158
  • [7] Load Sampling for SCUC Based on Principal Component Analysis and Kernel Density Estimation
    Lu, Dan
    Bao, Zhen
    Li, Zuyi
    [J]. 2016 IEEE POWER AND ENERGY SOCIETY GENERAL MEETING (PESGM), 2016,
  • [8] A Kernel Density Estimation-Based Variation Sampling for Class Imbalance in Defect Prediction
    Zhang, Yuqing
    Yan, Xuefeng
    Khan, Arif Ali
    [J]. 2020 IEEE INTL SYMP ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, INTL CONF ON BIG DATA & CLOUD COMPUTING, INTL SYMP SOCIAL COMPUTING & NETWORKING, INTL CONF ON SUSTAINABLE COMPUTING & COMMUNICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2020), 2020, : 1058 - 1065
  • [9] Notes on kernel density based mode estimation using more efficient sampling designs
    Hani Samawi
    Haresh Rochani
    JingJing Yin
    Daniel Linder
    Robert Vogel
    [J]. Computational Statistics, 2018, 33 : 1071 - 1090
  • [10] Notes on kernel density based mode estimation using more efficient sampling designs
    Samawi, Hani
    Rochani, Haresh
    Yin, JingJing
    Linder, Daniel
    Vogel, Robert
    [J]. COMPUTATIONAL STATISTICS, 2018, 33 (02) : 1071 - 1090