Kernel density estimation;
correlated data;
resampling;
bandwidth selection;
multistage sampling;
random effects;
D O I:
10.1080/03610926.2018.1563179
中图分类号:
O21 [概率论与数理统计];
C8 [统计学];
学科分类号:
020208 ;
070103 ;
0714 ;
摘要:
Multistage sampling is a common sampling technique employed in many studies. In this setting, observations are identically distributed but not independent, thus many traditional kernel smoothing techniques, which assume that the data are independent and identically distributed (i.i.d.), may not produce reasonable density estimates. In this paper, we sample repeatedly with replacement from each cluster, create multiple i.i.d. samples containing one observation from each cluster, and then create a kernel density estimate from each i.i.d. sample. These estimates will then be combined to form an estimate of the marginal probability density function of the population.