Nonparametric estimation of distributions with categorical and continuous data

被引:100
|
作者
Li, Q [1 ]
Racine, J
机构
[1] Texas A&M Univ, Dept Econ, College Stn, TX 77843 USA
[2] Syracuse Univ, Dept Econ, Syracuse, NY 13244 USA
关键词
discrete and continuous variables; density estimation; nonparametric smoothing; cross-validation; asymptotic normality;
D O I
10.1016/S0047-259X(02)00025-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper we consider the problem of estimating an unknown joint distribution which is defined over mixed discrete and continuous variables. A nonparametric kernel approach is proposed with smoothing parameters obtained from the cross-validated minimization of the estimator's integrated squared error. We derive the rate of convergence of the cross-validated smoothing parameters to their 'benchmark' optimal values, and we also establish the asymptotic normality of the resulting nonparametric kernel density estimator. Monte Carlo simulations illustrate that the proposed estimator performs substantially better than the conventional nonparametric frequency estimator in a range of settings. The simulations also demonstrate that the proposed approach does not suffer from known limitations of the likelihood cross-validation method which breaks down with commonly used kernels when the continuous variables are drawn from fat-tailed distributions. An empirical application demonstrates that the proposed method can yield superior predictions relative to commonly used parametric models. (C) 2003 Elsevier Science (USA). All rights reserved.
引用
收藏
页码:266 / 292
页数:27
相关论文
共 50 条
  • [41] Nonparametric estimation of component distributions in a multivariate mixture
    Hall, P
    Zhou, XH
    [J]. ANNALS OF STATISTICS, 2003, 31 (01): : 201 - 224
  • [42] NONPARAMETRIC ESTIMATION OF DENSITY FOR REGULARLY VARYING DISTRIBUTIONS
    BOFINGER, E
    [J]. AUSTRALIAN JOURNAL OF STATISTICS, 1975, 17 (03): : 192 - 195
  • [43] Nonparametric Estimation of Distributions in Random Effects Models
    Hart, Jeffrey D.
    Canette, Isabel
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2011, 20 (02) : 461 - 478
  • [44] Nonparametric estimation of waiting time distributions in a Markov model based on current status data
    Datta, Somnath
    Lan, Ling
    Sundaram, Rajeshwari
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2009, 139 (09) : 2885 - 2897
  • [45] Parametric and semi-nonparametric model strategies for the estimation of distributions of chemical contaminant data
    Nysen, Ruth
    Faes, Christel
    Ferrari, Pietro
    Verger, Philippe
    Aerts, Marc
    [J]. ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2015, 22 (02) : 423 - 444
  • [46] SMOLUCHOWSKI PROCESSES AND NONPARAMETRIC ESTIMATION OF FUNCTIONALS OF PARTICLE DISPLACEMENT DISTRIBUTIONS FROM COUNT DATA
    Goldenshluger, Alexander
    Jacobovic, Royi
    [J]. ANNALS OF APPLIED PROBABILITY, 2024, 34 (1B): : 1224 - 1270
  • [47] Parametric and semi-nonparametric model strategies for the estimation of distributions of chemical contaminant data
    Ruth Nysen
    Christel Faes
    Pietro Ferrari
    Philippe Verger
    Marc Aerts
    [J]. Environmental and Ecological Statistics, 2015, 22 : 423 - 444
  • [48] Nonparametric estimation for dependent data
    Johannes, Jan
    Rao, Suhasini Subba
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2011, 23 (03) : 661 - 681
  • [49] Nonparametric estimation with aggregated data
    Linton, O
    Whang, YJ
    [J]. ECONOMETRIC THEORY, 2002, 18 (02) : 420 - 468
  • [50] MAXIMUM-LIKELIHOOD ESTIMATION FOR MIXED CONTINUOUS AND CATEGORICAL-DATA WITH MISSING VALUES
    LITTLE, RJA
    SCHLUCHTER, MD
    [J]. BIOMETRIKA, 1985, 72 (03) : 497 - 512