Two-sample Testing Using Deep Learning

被引:0
|
作者
Kirchler, Matthias [1 ,2 ]
Khorasani, Shahryar [1 ]
Kloft, Marius [2 ,3 ]
Lippert, Christoph [1 ,4 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst Digital Engn, Potsdam, Germany
[2] Tech Univ Kaiserslautern, Kaiserslautern, Germany
[3] Univ Southern Calif, Los Angeles, CA 90007 USA
[4] Hasso Plattner Inst Digital Hlth Mt Sinai, New York, NY USA
基金
加拿大健康研究院; 美国国家卫生研究院;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a two-sample testing procedure based on learned deep neural network representations. To this end, we define two test statistics that perform an asymptotic location test on data samples mapped onto a hidden layer. The tests are consistent and asymptotically control the type-1 error rate. Their test statistics can be evaluated in linear time (in the sample size). Suitable data representations are obtained in a data-driven way, by solving a supervised or unsupervised transfer-learning task on an auxiliary (potentially distinct) data set. If no auxiliary data is available, we split the data into two chunks: one for learning representations and one for computing the test statistic. In experiments on audio samples, natural images and three-dimensional neuroimaging data our tests yield significant decreases in type-2 error rate (up to 35 percentage points) compared to state-of-the-art two-sample tests such as kernel-methods and classifier two-sample tests.*
引用
收藏
页码:1387 / 1397
页数:11
相关论文
共 50 条
  • [1] Meta Two-Sample Testing: Learning Kernels for Testing with Limited Data
    Liu, Feng
    Xu, Wenkai
    Lu, Jie
    Sutherland, Danica J.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [2] Addressing maximization bias in reinforcement learning with two-sample testing
    Waltz, Martin
    Okhrin, Ostap
    ARTIFICIAL INTELLIGENCE, 2024, 336
  • [3] Bayesian Kernel Two-Sample Testing
    Zhang, Qinyi
    Wild, Veit
    Filippi, Sarah
    Flaxman, Seth
    Sejdinovic, Dino
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2022, 31 (04) : 1164 - 1176
  • [4] Nonparametric Two-Sample Testing by Betting
    Shekhar, Shubhanshu
    Ramdas, Aaditya
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2024, 70 (02) : 1178 - 1203
  • [5] Two-sample testing in high dimensions
    Stadler, Nicolas
    Mukherjee, Sach
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2017, 79 (01) : 225 - 246
  • [6] Testing variability in the two-sample case
    Ramsey, Philip H.
    Ramsey, Patricia P.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2007, 36 (02) : 233 - 248
  • [7] Two-sample testing for random graphs
    Wen, Xiaoyi
    STATISTICAL ANALYSIS AND DATA MINING, 2024, 17 (04)
  • [8] CLASSIFICATION ACCURACY AS A PROXY FOR TWO-SAMPLE TESTING
    Kim, Ilmun
    Ramdas, Aaditya
    Singh, Aarti
    Wasserman, Larry
    ANNALS OF STATISTICS, 2021, 49 (01): : 411 - 434
  • [9] Two-sample testing with local community depth
    Evans, Ciaran
    Berenhaut, Kenneth S.
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
  • [10] Practical Methods for Graph Two-Sample Testing
    Ghoshdastidar, Debarghya
    von Luxburg, Ulrike
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31