A nonparametric test for equality of distributions with mixed categorical and continuous data

被引:104
|
作者
Li, Qi [1 ,2 ]
Maasoumi, Esfandiar [3 ]
Racine, Jeffrey S. [4 ]
机构
[1] Texas A&M Univ, Dept Econ, College Stn, TX 77843 USA
[2] Tsinghua Univ, Dept Econ, Beijing 100084, Peoples R China
[3] Emory Clin, Dept Econ, Atlanta, GA 30322 USA
[4] McMaster Univ, Dept Econ, Hamilton, ON L85 4M4, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Mixed discrete and continuous variables; Density testing; Nonparametric smoothing; Cross-validation; CENTRAL-LIMIT-THEOREM; MULTIVARIATE BINARY DISCRIMINATION; OF-FIT TESTS; KERNEL-METHOD; DENSITY ESTIMATORS; CROSS-VALIDATION; METRIC ENTROPY; GOODNESS; INDEPENDENCE; STATISTICS;
D O I
10.1016/j.jeconom.2008.10.007
中图分类号
F [经济];
学科分类号
02 ;
摘要
In this paper we consider the problem of testing for equality of two density or two conditional density functions defined over mixed discrete and continuous variables. We smooth both the discrete and continuous variables. with the smoothing parameters chosen via least-squares cross-validation. The test statistics are shown to have (asymptotic) normal null distributions. However, we advocate the use of bootstrap methods in order to better approximate their null distribution in finite-sample settings and we provide asymptotic validity of the proposed bootstrap method. Simulations show that the proposed tests have better power than both conventional frequency-based tests and smoothing tests based on ad hoc smoothing parameter selection, while a demonstrative empirical application to the joint distribution of earnings and educational attainment underscores the utility of the proposed approach in mixed data settings. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:186 / 200
页数:15
相关论文
共 50 条
  • [1] Nonparametric estimation of distributions with categorical and continuous data
    Li, Q
    Racine, J
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2003, 86 (02) : 266 - 292
  • [2] Nonparametric Estimation of Conditional CDF and Quantile Functions With Mixed Categorical and Continuous Data
    Li, Qi
    Racine, Jeffrey S.
    [J]. JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2008, 26 (04) : 423 - 434
  • [3] Nonparametric estimation of regression functions with both categorical and continuous data
    Racine, J
    Li, Q
    [J]. JOURNAL OF ECONOMETRICS, 2004, 119 (01) : 99 - 130
  • [4] A nonparametric test for the equality of counting processes with panel count data
    Balakrishnan, N.
    Zhao, Xingqiu
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (01) : 135 - 142
  • [5] Probabilistic Mixed Topological Map for Categorical and Continuous Data
    Rogovschi, Nicoleta
    Lebbah, Mustapha
    Bennani, Younes
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 224 - +
  • [6] A nonparametric test for proving noninferiority in clinical trials with ordered categorical data
    Munzel, U
    Hauschke, D
    [J]. PHARMACEUTICAL STATISTICS, 2003, 2 (01) : 31 - 37
  • [7] A nonparametric adaptive EWMA control chart for monitoring mixed continuous and categorical data using self-starting strategy
    Xue, Li
    Wang, Qiuyu
    An, Lisheng
    He, Zhen
    Feng, Sen
    Zhu, Jie
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2024, 188
  • [8] Generalized nonparametric smoothing with mixed discrete and continuous data
    Li, Degui
    Simar, Leopold
    Zelenyuk, Valentin
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 100 : 424 - 444
  • [9] QUADRATIC LOCATION DISCRIMINANT FUNCTIONS FOR MIXED CATEGORICAL AND CONTINUOUS DATA
    KRZANOWSKI, WJ
    [J]. STATISTICS & PROBABILITY LETTERS, 1994, 19 (02) : 91 - 95
  • [10] AN HYBRID APPROACH TO FEATURE SELECTION FOR MIXED CATEGORICAL AND CONTINUOUS DATA
    Doquire, Gauthier
    Verleysen, Michel
    [J]. KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2011, : 394 - 401