A Chasm Between Identity and Equivalence Testing with Conditional Queries

被引：2

作者：

Acharya, Jayadev ^{[1
]}

Canonne, Clement L. ^{[2
]}

Kamath, Gauta N. ^{[3
]}

机构：

[1] Cornell Univ, Ithaca, NY 14853 USA

[2] Stanford Univ, Stanford, CA 94305 USA

[3] Simons Inst Theory Comp, Berkeley, CA USA

来源：

THEORY OF COMPUTING | 2018年 / 14卷

关键词：

property testing; distribution testing; conditional sampling; lower bounds; equivalence testing; uniformity testing; support size estimation; BOUNDS;

D O I：

10.4086/toc.2018.v014a019

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

A recent model for property testing of probability distributions (Chakraborty et al., ITCS 2013, Canonne et al., SICOMP 2015) enables tremendous savings in the sample complexity of testing algorithms, by allowing them to condition the sampling on subsets of the domain In particular, Canonne, Ron, and Servedio (SICOMP 2015) showed that, in this setting, testing identity of an unknown distribution D (i.e., whether D = D* for an explicitly known D*) can be done with a constant number of queries (i.e., samples), independent of the support size n-in contrast to the required Omega(root n) in the standard sampling model. However, it was unclear whether the same stark contrast exists for the case of testing equivalence, where both distributions are unknown. Indeed, while Canonne et al. established a poly(log n)-query upper bound for equivalence testing, very recently brought down to (O) over tilde (log log n) by Falahatgar et al. (COLT 2015), whether a dependence on the domain size n is necessary was still open, and explicitly posed by Fischer at the Bertinoro Workshop on Sublinear Algorithms (2014). In this article, we answer the question in the affirmative, showing that any testing algorithm for equivalence must make Omega(root log log n) queries in the conditional sampling model. Interestingly, this demonstrates a gap between identity and equivalence testing, absent in the standard sampling model (where both problems have sampling complexity n(Theta(1))). We also obtain results on the query complexity of uniformity testing and support-size estimation with conditional samples. In particular, we answer a question of Chakraborty et al. (ITCS 2013) showing that non-adaptive uniformity testing indeed requires Omega(log n) queries in the conditional model. This is an exponential improvement on their previous lower bound of Omega(log log n), and matches up to polynomial factors their poly(log n) upper bound. For the problem of support-size estimation, we provide both adaptive and non -adaptive algorithms, with query complexities poly(log log n) and poly(log n), respectively, and complement them with a lower bound of Omega(log n) conditional queries for non-adaptive algorithms.

引用

页数：46

共 50 条

[1] Conditional identity calculus and conditioned rational equivalence
Pinus A.G.
Algebra and Logic, 1998, 37 (4) : 245 - 259
[2] Conditional equivalence testing: An alternative remedy for publication bias
Campbell, Harlan
Gustafson, Paul
PLOS ONE, 2018, 13 (04):
[3] Equivalence of Polynomial Identity Testing and Polynomial Factorization
Swastik Kopparty
Shubhangi Saraf
Amir Shpilka
computational complexity, 2015, 24 : 295 - 331
[4] Equivalence of Polynomial Identity Testing and Polynomial Factorization
Kopparty, Swastik
Saraf, Shubhangi
Shpilka, Amir
COMPUTATIONAL COMPLEXITY, 2015, 24 (02) : 295 - 331
[5] On the equivalence between Stein identity and de Bruijn identity
Park, Sangwoo
Serpedin, Erchin
Qaraqe, Khalid
2012 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2012, : 145 - 149
[6] Generating Keyword Queries for Natural Language Queries to Alleviate Lexical Chasm Problem
Liu, Xiaoyu
Pan, Shunda
Zhang, Qi
Jiang, Yu-Gang
Huang, Xuanjing
CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1163 - 1172
[7] RELATIONSHIP BETWEEN IDENTITY AND EQUIVALENCE CONSERVATION
NORTHMAN, JE
GRUEN, GE
DEVELOPMENTAL PSYCHOLOGY, 1970, 2 (02) : 311 - 311
[8] Tight Lower Bound on Equivalence Testing in Conditional Sampling Model
Chakraborty, Diptarka
Chakraborty, Sourav
Kumar, Gunjan
PROCEEDINGS OF THE 2024 ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, SODA, 2024, : 4371 - 4394
[9] ON THE ROLE OF EQUIVALENCE QUERIES IN LEARNING VIA QUERIES
TANI, S
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1992, E75D (04) : 435 - 441
[10] Differentially Private Identity and Equivalence Testing of Discrete Distributions
Aliakbarpour, Maryam
Diakonikolas, Ilias
Rubinfeld, Ronitt
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80

← 1 2 3 4 5 →