discrete and continuous variables;
density estimation;
nonparametric smoothing;
cross-validation;
asymptotic normality;
D O I:
10.1016/S0047-259X(02)00025-8
中图分类号:
O21 [概率论与数理统计];
C8 [统计学];
学科分类号:
020208 ;
070103 ;
0714 ;
摘要:
In this paper we consider the problem of estimating an unknown joint distribution which is defined over mixed discrete and continuous variables. A nonparametric kernel approach is proposed with smoothing parameters obtained from the cross-validated minimization of the estimator's integrated squared error. We derive the rate of convergence of the cross-validated smoothing parameters to their 'benchmark' optimal values, and we also establish the asymptotic normality of the resulting nonparametric kernel density estimator. Monte Carlo simulations illustrate that the proposed estimator performs substantially better than the conventional nonparametric frequency estimator in a range of settings. The simulations also demonstrate that the proposed approach does not suffer from known limitations of the likelihood cross-validation method which breaks down with commonly used kernels when the continuous variables are drawn from fat-tailed distributions. An empirical application demonstrates that the proposed method can yield superior predictions relative to commonly used parametric models. (C) 2003 Elsevier Science (USA). All rights reserved.
机构:
NYU, Leonard N Stern Sch Business, Dept Stat & Operat Res, New York, NY 10012 USANYU, Leonard N Stern Sch Business, Dept Stat & Operat Res, New York, NY 10012 USA