Testing Distributional Assumptions of Learning Algorithms

被引:1
|
作者
Rubinfeld, Ronitt [1 ]
Vasilyan, Arsen [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
关键词
agnostic learning; distribution testing; learning theory; K-WISE; INDEPENDENCE; HALFSPACES;
D O I
10.1145/3564246.3585117
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There are many important high dimensional function classes that have fast agnostic learning algorithms when strong assumptions on the distribution of examples can be made, such as Gaussianity or uniformity over the domain. But how can one be suffuciently confident that the data indeed satisfies the distributional assumption, so that one can trust in the output quality of the agnostic learning algorithm? We propose a model by which to systematically study the design of tester-learner pairs (A, T), such that if the distribution on examples in the data passes the tester T then one can safely trust the output of the agnostic learner A on the data. To demonstrate the power of the model, we apply it to the classical problem of agnostically learning halfspaces under the standard Gaussian distribution and present a tester-learner pair with a combined run-time of n(($) over tilde (1/epsilon 4)). This qualitatively matches that of the best known ordinary agnostic learning algorithms for this task. In contrast, finite sample Gaussian distribution testers do not exist for the L-1 and EMD distance measures. Previously it was known that half-spaces are well-approximated with low-degree polynomials relative to the Gaussian distribution. A key step in our analysis is showing that this is the case even relative to distributions whose low-degree moments approximately match those of a Gaussian. We also go beyond spherically-symmetric distributions, and give a tester-learner pair for halfspaces under the uniform distribution on {0, 1}(n) with combined run-time of n(O(1/epsilon 4)). This is achieved using polynomial approximation theory and critical index machinery of [Diakonikolas, Gopalan, Jaiswal, Servedio, and Viola 2009]. Can one design agnostic learning algorithms under distributional assumptions and count on future technical work to produce, as a matter of course, tester-learner pairs with similar run-time? Our answer is a resounding no, aswe showthere exist somewell-studied settings for which 2(($) over tilde(root n)) run-time agnostic learning algorithms are available, yet the combined run-times of tester-learner pairs must be as high as 2(Omega(n)). On that account, the design of testerlearner pairs is a research direction in its own right independent of standard agnostic learning. To be specific, our lower bounds apply to the problems of agnostically learning convex sets under the Gaussian distribution and for monotone Boolean functions under the uniform distribution over {0, 1}(n).
引用
收藏
页码:1643 / 1656
页数:14
相关论文
共 50 条
  • [1] Testing distributional assumptions: A GMM aproach
    Bontemps, Christian
    Meddahi, Nour
    [J]. JOURNAL OF APPLIED ECONOMETRICS, 2012, 27 (06) : 978 - 1012
  • [2] Testing distributional assumptions using a continuum of moments
    Amengual, Dante
    Carrasco, Marine
    Sentana, Enrique
    [J]. JOURNAL OF ECONOMETRICS, 2020, 218 (02) : 655 - 689
  • [3] Testing distributional assumptions in CUB models for the analysis of rating data
    Di Iorio, Francesca
    Lucchetti, Riccardo
    Simone, Rosaria
    [J]. ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2024, 108 (03) : 669 - 701
  • [4] Learning Online Algorithms with Distributional Advice
    Diakonikolas, Ilias
    Kontonis, Vasilis
    Tzamos, Christos
    Vakilian, Ali
    Zarifis, Nikos
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [5] TESTING THE DISTRIBUTIONAL ASSUMPTIONS OF LEAST-SQUARES LINEAR-REGRESSION
    MARSHALL, P
    SZIKSZAI, T
    LEMAY, V
    KOZAK, A
    [J]. FORESTRY CHRONICLE, 1995, 71 (02): : 213 - 218
  • [6] Sensitivity analysis: Distributional assumptions and confounding assumptions
    Weele, Tyler J. Vander
    [J]. BIOMETRICS, 2008, 64 (02) : 645 - 649
  • [7] THE VAGARIES OF MAKING DISTRIBUTIONAL ASSUMPTIONS
    PAULY, MV
    [J]. HEALTH AFFAIRS, 1995, 14 (03) : 284 - 285
  • [8] ON DISTRIBUTIONAL ASSUMPTIONS IN DEMAND THEORY
    GRODAL, B
    [J]. NATIONALOKONOMISK TIDSSKRIFT, 1992, 130 (02): : 327 - 337
  • [9] AN OBJECTIVE GRAPHICAL-METHOD FOR TESTING NORMAL DISTRIBUTIONAL ASSUMPTIONS USING PROBABILITY PLOTS
    MAGE, DT
    [J]. AMERICAN STATISTICIAN, 1982, 36 (02): : 116 - 120