A Bayesian Motivated Two-Sample Test Based on Kernel Density Estimates

被引:0
|
作者
Merchant, Naveed [1 ]
Hart, Jeffrey D. [1 ]
机构
[1] Texas A&M Univ, Dept Stat, College Stn, TX 77840 USA
关键词
Bayes factors; permutation tests; cross-validation; consistent tests; Kolmogorov-Smirnov test;
D O I
10.3390/e24081071
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
A new nonparametric test of equality of two densities is investigated. The test statistic is an average of log-Bayes factors, each of which is constructed from a kernel density estimate. Prior densities for the bandwidths of the kernel estimates are required, and it is shown how to choose priors so that the log-Bayes factors can be calculated exactly. Critical values of the test statistic are determined by a permutation distribution, conditional on the data. An attractive property of the methodology is that a critical value of 0 leads to a test for which both type I and II error probabilities tend to 0 as sample sizes tend to infinity. Existing results on Kullback-Leibler loss of kernel estimates are crucial to obtaining these asymptotic results, and also imply that the proposed test works best with heavy-tailed kernels. Finite sample characteristics of the test are studied via simulation, and extensions to multivariate data are straightforward, as illustrated by an application to bivariate connectionist data.
引用
下载
收藏
页数:17
相关论文
共 50 条
  • [1] A Kernel Two-Sample Test
    Gretton, Arthur
    Borgwardt, Karsten M.
    Rasch, Malte J.
    Schoelkopf, Bernhard
    Smola, Alexander
    JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 723 - 773
  • [2] Bayesian Kernel Two-Sample Testing
    Zhang, Qinyi
    Wild, Veit
    Filippi, Sarah
    Flaxman, Seth
    Sejdinovic, Dino
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2022, 31 (04) : 1164 - 1176
  • [3] The Bayesian two-sample t test
    Gönen, M
    Johnson, WO
    Lu, YG
    Westfall, PH
    AMERICAN STATISTICIAN, 2005, 59 (03): : 252 - 257
  • [4] A Kernel Two-Sample Test for Functional Data
    Wynne, George
    Duncan, Andrew B.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [5] A Differentially Private Kernel Two-Sample Test
    Raj, Anant
    Law, Ho Chung Leon
    Sejdinovic, Dino
    Park, Mijung
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT I, 2020, 11906 : 697 - 724
  • [6] A Kernel Two-Sample Test for Functional Data
    Wynne, George
    Duncan, Andrew B.
    Journal of Machine Learning Research, 2022, 23 : 1 - 51
  • [7] Network Traffic Fingerprinting Based on Approximated Kernel Two-Sample Test
    Kohout, Jan
    Pevny, Tomas
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2018, 13 (03) : 788 - 801
  • [8] Two-Sample Test with Kernel Projected Wasserstein Distance
    Wang, Jie
    Gao, Rui
    Xie, Yao
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [9] A Pooled Two-Sample Median Test Based on Density Estimation
    Bichutskiy, Vadim Y.
    JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2011, 10 (02) : 692 - 698
  • [10] A permutation-free kernel two-sample test
    Shekhar, Shubhanshu
    Kim, Ilmun
    Ramdas, Aaditya
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,