Density estimation under a two-sample semiparametric model

被引:15
|
作者
Qin, J
Zhang, B
机构
[1] Univ Toledo, Dept Math, Toledo, OH 43606 USA
[2] Mem Sloan Kettering Canc Ctr, Dept Epidemiol & Biostat, New York, NY 10021 USA
关键词
bandwidth; biased sampling problem; case-control data; kernel density estimator; logistic regression; retrospective sampling;
D O I
10.1080/10485250500039346
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider estimating a density function under a two-sample semiparametric model in which the log ratio of two density functions is a quadratic function of data. This two-sample semiparametric model, arising naturally from case-control studies and logistic discriminant analysis, can be regarded as a biased sampling model. Under this model, the difference between the two samples is quantified. A kernel-based density estimator is constructed by smoothing the increments of the maximum semiparametric likelihood estimator of the underlying distribution function. The required computation for our method can be accomplished by using the standard statistical software packages for categorical data analysis. We establish some asymptotic results on the proposed kernel density estimator. In addition, we present some results on a simulation study and on the analysis of two data sets to demonstrate the utility of the proposed density estimator.
引用
收藏
页码:665 / 683
页数:19
相关论文
共 50 条