Two-Sample Test with Kernel Projected Wasserstein Distance

被引:0
|
作者
Wang, Jie [1 ]
Gao, Rui [2 ]
Xie, Yao [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Univ Texas Austin, Austin, TX 78712 USA
关键词
ANOMALY DETECTION; CONVERGENCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop a kernel projected Wasserstein distance for the two-sample test, an essential building block in statistics and machine learning: given two sets of samples, to determine whether they are from the same distribution. This method operates by finding the nonlinear mapping in the data space which maximizes the distance between projected distributions. In contrast to existing works about projected Wasserstein distance, the proposed method circumvents the curse of dimensionality more efficiently. We present practical algorithms for computing this distance function together with the non-asymptotic uncertainty quantification of empirical estimates. Numerical examples validate our theoretical results and demonstrate good performance of the proposed method.
引用
收藏
页数:34
相关论文
共 50 条
  • [21] SPECTRAL REGULARIZED KERNEL TWO-SAMPLE TESTS
    Hagrass, Omar
    Sriperumbudur, Bharath K.
    Li, Bing
    [J]. ANNALS OF STATISTICS, 2024, 52 (03): : 1076 - 1101
  • [22] Kernel two-sample tests for manifold data
    Cheng, Xiuyuan
    Xie, Yao
    [J]. BERNOULLI, 2024, 30 (04) : 2572 - 2597
  • [23] High dimensional two-sample test based on the inter-point distance
    Shin-ichi Tsukada
    [J]. Computational Statistics, 2019, 34 : 599 - 615
  • [24] High dimensional two-sample test based on the inter-point distance
    Tsukada, Shin-ichi
    [J]. COMPUTATIONAL STATISTICS, 2019, 34 (02) : 599 - 615
  • [25] Two-sample goodness-of-fit tests on the flat torus based on Wasserstein distance and their relevance to structural biology*
    Gonzalez-Delgado, Javier
    Gonzalez-Sanz, Alberto
    Cortes, Juan
    Neuvial, Pierre
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2023, 17 (01): : 1547 - 1586
  • [26] A reproducing kernel Hilbert space log-rank test for the two-sample problem
    Fernandez, Tamara
    Rivera, Nicolas
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2021, 48 (04) : 1384 - 1432
  • [27] Manifold energy two-sample test
    Chu, Lynna
    Dai, Xiongtao
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (01): : 145 - 166
  • [28] On a new multivariate two-sample test
    Baringhaus, L
    Franz, C
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 88 (01) : 190 - 206
  • [29] Revisiting the two-sample runs test
    Baringhaus, Ludwig
    Henze, Norbert
    [J]. TEST, 2016, 25 (03) : 432 - 448
  • [30] The Bayesian two-sample t test
    Gönen, M
    Johnson, WO
    Lu, YG
    Westfall, PH
    [J]. AMERICAN STATISTICIAN, 2005, 59 (03): : 252 - 257