Testing Distributional Assumptions of Learning Algorithms

被引：1

作者：

Rubinfeld, Ronitt ^{[1
]}

Vasilyan, Arsen ^{[1
]}

机构：

[1] MIT, Cambridge, MA 02139 USA

来源：

PROCEEDINGS OF THE 55TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, STOC 2023 | 2023年

关键词：

agnostic learning; distribution testing; learning theory; K-WISE; INDEPENDENCE; HALFSPACES;

D O I：

10.1145/3564246.3585117

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

There are many important high dimensional function classes that have fast agnostic learning algorithms when strong assumptions on the distribution of examples can be made, such as Gaussianity or uniformity over the domain. But how can one be suffuciently confident that the data indeed satisfies the distributional assumption, so that one can trust in the output quality of the agnostic learning algorithm? We propose a model by which to systematically study the design of tester-learner pairs (A, T), such that if the distribution on examples in the data passes the tester T then one can safely trust the output of the agnostic learner A on the data. To demonstrate the power of the model, we apply it to the classical problem of agnostically learning halfspaces under the standard Gaussian distribution and present a tester-learner pair with a combined run-time of n(($) over tilde (1/epsilon 4)). This qualitatively matches that of the best known ordinary agnostic learning algorithms for this task. In contrast, finite sample Gaussian distribution testers do not exist for the L-1 and EMD distance measures. Previously it was known that half-spaces are well-approximated with low-degree polynomials relative to the Gaussian distribution. A key step in our analysis is showing that this is the case even relative to distributions whose low-degree moments approximately match those of a Gaussian. We also go beyond spherically-symmetric distributions, and give a tester-learner pair for halfspaces under the uniform distribution on {0, 1}(n) with combined run-time of n(O(1/epsilon 4)). This is achieved using polynomial approximation theory and critical index machinery of [Diakonikolas, Gopalan, Jaiswal, Servedio, and Viola 2009]. Can one design agnostic learning algorithms under distributional assumptions and count on future technical work to produce, as a matter of course, tester-learner pairs with similar run-time? Our answer is a resounding no, aswe showthere exist somewell-studied settings for which 2(($) over tilde(root n)) run-time agnostic learning algorithms are available, yet the combined run-times of tester-learner pairs must be as high as 2(Omega(n)). On that account, the design of testerlearner pairs is a research direction in its own right independent of standard agnostic learning. To be specific, our lower bounds apply to the problems of agnostically learning convex sets under the Gaussian distribution and for monotone Boolean functions under the uniform distribution over {0, 1}(n).

引用

页码：1643 / 1656

页数：14

共 50 条

[1] Testing distributional assumptions: A GMM aproach
Bontemps, Christian
Meddahi, Nour
[J]. JOURNAL OF APPLIED ECONOMETRICS, 2012, 27 (06) : 978 - 1012
[2] Testing distributional assumptions using a continuum of moments
Amengual, Dante
Carrasco, Marine
Sentana, Enrique
[J]. JOURNAL OF ECONOMETRICS, 2020, 218 (02) : 655 - 689
[3] Testing distributional assumptions in CUB models for the analysis of rating data
Di Iorio, Francesca
Lucchetti, Riccardo
Simone, Rosaria
[J]. ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2024, 108 (03) : 669 - 701
[4] Learning Online Algorithms with Distributional Advice
Diakonikolas, Ilias
Kontonis, Vasilis
Tzamos, Christos
Vakilian, Ali
Zarifis, Nikos
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[5] TESTING THE DISTRIBUTIONAL ASSUMPTIONS OF LEAST-SQUARES LINEAR-REGRESSION
MARSHALL, P
SZIKSZAI, T
LEMAY, V
KOZAK, A
[J]. FORESTRY CHRONICLE, 1995, 71 (02): : 213 - 218
[6] Sensitivity analysis: Distributional assumptions and confounding assumptions
Weele, Tyler J. Vander
[J]. BIOMETRICS, 2008, 64 (02) : 645 - 649
[7] THE VAGARIES OF MAKING DISTRIBUTIONAL ASSUMPTIONS
PAULY, MV
[J]. HEALTH AFFAIRS, 1995, 14 (03) : 284 - 285
[8] ON DISTRIBUTIONAL ASSUMPTIONS IN DEMAND THEORY
GRODAL, B
[J]. NATIONALOKONOMISK TIDSSKRIFT, 1992, 130 (02): : 327 - 337
[9] AN OBJECTIVE GRAPHICAL-METHOD FOR TESTING NORMAL DISTRIBUTIONAL ASSUMPTIONS USING PROBABILITY PLOTS
MAGE, DT
[J]. AMERICAN STATISTICIAN, 1982, 36 (02): : 116 - 120
[10] COMPONENTS OF X2 FOR TESTING DISTRIBUTIONAL ASSUMPTIONS AGAINST CERTAIN RESTRICTED ALTERNATIVES
STUART, M
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1980, 75 (371) : 625 - 633

← 1 2 3 4 5 →