Data-Based Optimal Bandwidth for Kernel Density Estimation of Statistical Samples

被引:1
|
作者
李振伟 [1 ,2 ]
何平 [1 ,3 ]
机构
[1] Center for Theoretical Physics and College of Physics,Jilin University
[2] Changchun Observatory,National Astronomical Observatories,CAS
[3] Center for High Energy Physics,Peking University
基金
美国国家科学基金会;
关键词
numerical methods; kernel density estimation; optimal bandwidth; large-scale structure of Universe;
D O I
暂无
中图分类号
P159 [宇宙学];
学科分类号
摘要
It is a common practice to evaluate probability density function or matter spatial density function from statistical samples. Kernel density estimation is a frequently used method, but to select an optimal bandwidth of kernel estimation, which is completely based on data samples, is a long-term issue that has not been well settled so far. There exist analytic formulae of optimal kernel bandwidth, but they cannot be applied directly to data samples,since they depend on the unknown underlying density functions from which the samples are drawn. In this work, we devise an approach to pick out the totally data-based optimal bandwidth. First, we derive correction formulae for the analytic formulae of optimal bandwidth to compute the roughness of the sample’s density function. Then substitute the correction formulae into the analytic formulae for optimal bandwidth, and through iteration we obtain the sample’s optimal bandwidth. Compared with analytic formulae, our approach gives very good results, with relative differences from the analytic formulae being only 2%~3% for sample size larger than 10;. This approach can also be generalized easily to cases of variable kernel estimations.
引用
收藏
页码:728 / 734
页数:7
相关论文
共 50 条
  • [31] Bandwidth selection in kernel density estimation for interval-grouped data
    Reyes, Miguel
    Francisco-Fernandez, Mario
    Cao, Ricardo
    TEST, 2017, 26 (03) : 527 - 545
  • [32] Plug-in Bandwidth Selection for Kernel Density Estimation with Discrete Data
    Chu, Chi-Yang
    Henderson, Daniel J.
    Parmeter, Christopher F.
    ECONOMETRICS, 2015, 3 (02): : 199 - 214
  • [33] Bandwidth selection for kernel density estimation with length-biased data
    Borrajo, M. I.
    Gonzalez-Manteiga, W.
    Martinez-Miranda, M. D.
    JOURNAL OF NONPARAMETRIC STATISTICS, 2017, 29 (03) : 636 - 668
  • [34] Bandwidth selection in kernel density estimation for interval-grouped data
    Miguel Reyes
    Mario Francisco-Fernández
    Ricardo Cao
    TEST, 2017, 26 : 527 - 545
  • [35] Optimal bandwidth selection in kernel density estimation for continuous time dependent processes
    El Heda, Khadijetou
    Louani, Djamal
    STATISTICS & PROBABILITY LETTERS, 2018, 138 : 9 - 19
  • [36] Kernel density estimation for linear processes: Asymptotic normality and optimal bandwidth derivation
    Hallin, M
    Tran, LT
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1996, 48 (03) : 429 - 449
  • [37] DATA-BASED BANDWIDTH SELECTION FOR KERNEL ESTIMATORS OF THE INTEGRAL OF F(2)(X)
    SHEATHER, SJ
    HETTMANSPERGER, TP
    DONALD, MR
    SCANDINAVIAN JOURNAL OF STATISTICS, 1994, 21 (03) : 265 - 275
  • [38] Optimal kernel density estimation by statistical modeling with random censoring of observations
    Abdushukurov, Abdurakhim A.
    Bozorov, Sukhrob B.
    Mansurov, Dilshod R.
    VESTNIK TOMSKOGO GOSUDARSTVENNOGO UNIVERSITETA-UPRAVLENIE VYCHISLITELNAJA TEHNIKA I INFORMATIKA-TOMSK STATE UNIVERSITY JOURNAL OF CONTROL AND COMPUTER SCIENCE, 2024, (67):
  • [39] Data-based optimal smoothing of a wavelet density estimator
    Luo, H
    Angus, JE
    MATHEMATICAL AND COMPUTER MODELLING, 1997, 26 (03) : 31 - 43
  • [40] Bandwidth selectors for multivariate kernel density estimation
    Duong, T
    BULLETIN OF THE AUSTRALIAN MATHEMATICAL SOCIETY, 2005, 71 (02) : 351 - 352