Two-sample Testing Using Deep Learning

被引:0
|
作者
Kirchler, Matthias [1 ,2 ]
Khorasani, Shahryar [1 ]
Kloft, Marius [2 ,3 ]
Lippert, Christoph [1 ,4 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst Digital Engn, Potsdam, Germany
[2] Tech Univ Kaiserslautern, Kaiserslautern, Germany
[3] Univ Southern Calif, Los Angeles, CA 90007 USA
[4] Hasso Plattner Inst Digital Hlth Mt Sinai, New York, NY USA
基金
加拿大健康研究院; 美国国家卫生研究院;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a two-sample testing procedure based on learned deep neural network representations. To this end, we define two test statistics that perform an asymptotic location test on data samples mapped onto a hidden layer. The tests are consistent and asymptotically control the type-1 error rate. Their test statistics can be evaluated in linear time (in the sample size). Suitable data representations are obtained in a data-driven way, by solving a supervised or unsupervised transfer-learning task on an auxiliary (potentially distinct) data set. If no auxiliary data is available, we split the data into two chunks: one for learning representations and one for computing the test statistic. In experiments on audio samples, natural images and three-dimensional neuroimaging data our tests yield significant decreases in type-2 error rate (up to 35 percentage points) compared to state-of-the-art two-sample tests such as kernel-methods and classifier two-sample tests.*
引用
收藏
页码:1387 / 1397
页数:11
相关论文
共 50 条
  • [21] Bayesian multiple testing for two-sample multivariate endpoints
    Gönen, M
    Westfall, PH
    Johnson, WO
    BIOMETRICS, 2003, 59 (01) : 76 - 82
  • [22] Two-Sample Testing can be as hard as Structure Learning in Ising Models: Minimax Lower Bounds
    Gangrade, Aditya
    Nazer, Bobak
    Saligrama, Venkatesh
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6931 - 6935
  • [23] Two-sample t α -test for testing hypotheses in small-sample experiments
    Tan, Yuan-De
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2023, 19 (01): : 1 - 19
  • [24] A randomized Baumgartner statistic for multivariate two-sample testing hypothesis
    Murakami, Hidetoshi
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (01) : 189 - 201
  • [25] A permutation approach for testing heterogeneity in two-sample categorical variables
    Rosa Arboretti Giancristofaro
    Stefano Bonnini
    Fortunato Pesarin
    Statistics and Computing, 2009, 19 : 209 - 216
  • [26] Fast Two-Sample Testing with Analytic Representations of Probability Measures
    Chwialkowski, Kacper
    Ramdas, Aaditya
    Sejdinovic, Dino
    Gretton, Arthur
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [27] On Wasserstein Two-Sample Testing and Related Families of Nonparametric Tests
    Ramdas, Aaditya
    Trillos, Nicolas Garcia
    Cuturi, Marco
    ENTROPY, 2017, 19 (02):
  • [28] Union–intersection permutation solution for two-sample equivalence testing
    Fortunato Pesarin
    Luigi Salmaso
    Eleonora Carrozzo
    Rosa Arboretti
    Statistics and Computing, 2016, 26 : 693 - 701
  • [29] A nonparametric two-sample hypothesis testing problem for random graphs
    Tang, Minh
    Athreya, Avanti
    Sussman, Daniel L.
    Lyzinski, Vince
    Priebe, Carey E.
    BERNOULLI, 2017, 23 (03) : 1599 - 1630
  • [30] Asymptotically Optimal One- and Two-Sample Testing With Kernels
    Zhu, Shengyu
    Chen, Biao
    Chen, Zhitang
    Yang, Pengfei
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (04) : 2074 - 2092