A Chasm Between Identity and Equivalence Testing with Conditional Queries

被引:2
|
作者
Acharya, Jayadev [1 ]
Canonne, Clement L. [2 ]
Kamath, Gauta N. [3 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
[2] Stanford Univ, Stanford, CA 94305 USA
[3] Simons Inst Theory Comp, Berkeley, CA USA
关键词
property testing; distribution testing; conditional sampling; lower bounds; equivalence testing; uniformity testing; support size estimation; BOUNDS;
D O I
10.4086/toc.2018.v014a019
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A recent model for property testing of probability distributions (Chakraborty et al., ITCS 2013, Canonne et al., SICOMP 2015) enables tremendous savings in the sample complexity of testing algorithms, by allowing them to condition the sampling on subsets of the domain In particular, Canonne, Ron, and Servedio (SICOMP 2015) showed that, in this setting, testing identity of an unknown distribution D (i.e., whether D = D* for an explicitly known D*) can be done with a constant number of queries (i.e., samples), independent of the support size n-in contrast to the required Omega(root n) in the standard sampling model. However, it was unclear whether the same stark contrast exists for the case of testing equivalence, where both distributions are unknown. Indeed, while Canonne et al. established a poly(log n)-query upper bound for equivalence testing, very recently brought down to (O) over tilde (log log n) by Falahatgar et al. (COLT 2015), whether a dependence on the domain size n is necessary was still open, and explicitly posed by Fischer at the Bertinoro Workshop on Sublinear Algorithms (2014). In this article, we answer the question in the affirmative, showing that any testing algorithm for equivalence must make Omega(root log log n) queries in the conditional sampling model. Interestingly, this demonstrates a gap between identity and equivalence testing, absent in the standard sampling model (where both problems have sampling complexity n(Theta(1))). We also obtain results on the query complexity of uniformity testing and support-size estimation with conditional samples. In particular, we answer a question of Chakraborty et al. (ITCS 2013) showing that non-adaptive uniformity testing indeed requires Omega(log n) queries in the conditional model. This is an exponential improvement on their previous lower bound of Omega(log log n), and matches up to polynomial factors their poly(log n) upper bound. For the problem of support-size estimation, we provide both adaptive and non -adaptive algorithms, with query complexities poly(log log n) and poly(log n), respectively, and complement them with a lower bound of Omega(log n) conditional queries for non-adaptive algorithms.
引用
收藏
页数:46
相关论文
共 50 条
  • [1] Conditional identity calculus and conditioned rational equivalence
    Pinus A.G.
    Algebra and Logic, 1998, 37 (4) : 245 - 259
  • [2] Conditional equivalence testing: An alternative remedy for publication bias
    Campbell, Harlan
    Gustafson, Paul
    PLOS ONE, 2018, 13 (04):
  • [3] Equivalence of Polynomial Identity Testing and Polynomial Factorization
    Swastik Kopparty
    Shubhangi Saraf
    Amir Shpilka
    computational complexity, 2015, 24 : 295 - 331
  • [4] Equivalence of Polynomial Identity Testing and Polynomial Factorization
    Kopparty, Swastik
    Saraf, Shubhangi
    Shpilka, Amir
    COMPUTATIONAL COMPLEXITY, 2015, 24 (02) : 295 - 331
  • [5] On the equivalence between Stein identity and de Bruijn identity
    Park, Sangwoo
    Serpedin, Erchin
    Qaraqe, Khalid
    2012 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2012, : 145 - 149
  • [6] Generating Keyword Queries for Natural Language Queries to Alleviate Lexical Chasm Problem
    Liu, Xiaoyu
    Pan, Shunda
    Zhang, Qi
    Jiang, Yu-Gang
    Huang, Xuanjing
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1163 - 1172
  • [7] RELATIONSHIP BETWEEN IDENTITY AND EQUIVALENCE CONSERVATION
    NORTHMAN, JE
    GRUEN, GE
    DEVELOPMENTAL PSYCHOLOGY, 1970, 2 (02) : 311 - 311
  • [8] Tight Lower Bound on Equivalence Testing in Conditional Sampling Model
    Chakraborty, Diptarka
    Chakraborty, Sourav
    Kumar, Gunjan
    PROCEEDINGS OF THE 2024 ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, SODA, 2024, : 4371 - 4394
  • [10] Differentially Private Identity and Equivalence Testing of Discrete Distributions
    Aliakbarpour, Maryam
    Diakonikolas, Ilias
    Rubinfeld, Ronitt
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80