Learning mixtures of product distributions over discrete domains

被引:28
|
作者
Feldman, Jon [1 ,2 ]
O'Donnell, Ryan [3 ]
Servedio, Rocco A. [4 ]
机构
[1] Google, New York Off, New York, NY 10011 USA
[2] Columbia Univ, Dept Ind Engn & Operat Res, New York, NY 10027 USA
[3] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[4] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
关键词
computational learning theory; PAC learning; mixture distributions; product distributions;
D O I
10.1137/060670705
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider the problem of learning mixtures of product distributions over discrete domains in the distribution learning framework introduced by Kearns et al. [Proceedings of the 26th Annual Symposium on Theory of Computing (STOC), Montreal, QC, 1994, ACM, New York, pp. 273-282]. We give a poly(n/epsilon)-time algorithm for learning a mixture of k arbitrary product distributions over the n-dimensional Boolean cube {0, 1}(n) to accuracy epsilon, for any constant k. Previous polynomial-time algorithms could achieve this only for k = 2 product distributions; our result answers an open question stated independently in [M. Cryan, Learning and Approximation Algorithms for Problems Motivated by Evolutionary Trees, Ph. D. thesis, University of Warwick, Warwick, UK, 1999] and [Y. Freund and Y. Mansour, Proceedings of the 12th Annual Conference on Computational Learning Theory, 1999, pp. 183-192]. We further give evidence that no polynomial-time algorithm can succeed when k is superconstant, by reduction from a difficult open problem in PAC (probably approximately correct) learning finally, we generalize our poly(n/epsilon)-time algorithm to learn any mixture of k = O(1) product distributions over {0, 1,..., b - 1}(n), for any b = O(1).
引用
收藏
页码:1536 / 1564
页数:29
相关论文
共 50 条
  • [1] Learning mixtures of product distributions over discrete domains
    Feldman, J
    O'Donnell, R
    Servedio, RA
    [J]. 46TH ANNUAL IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 2005, : 501 - 510
  • [2] Learning mixtures of structured distributions over discrete domains
    Chan, Siu-On
    Diakonikolas, Ilias
    Servedio, Rocco A.
    Sun, Xiaorui
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS (SODA 2013), 2013, : 1380 - 1394
  • [3] Learning Arbitrary Statistical Mixtures of Discrete Distributions
    Li, Jian
    Rabani, Yuval
    Schulman, Leonard J.
    Swamy, Chaitanya
    [J]. STOC'15: PROCEEDINGS OF THE 2015 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2015, : 743 - 752
  • [4] Learning Mixtures of Smooth Product Distributions: Identifiability and Algorithm
    Kargas, Nikos
    Sidiropoulos, Nicholas D.
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 388 - 396
  • [5] PAC learning of probability distributions over a discrete domain
    Magnoni, L
    Mirolli, M
    Montagna, F
    Simi, G
    [J]. THEORETICAL COMPUTER SCIENCE, 2003, 299 (1-3) : 37 - 63
  • [6] MIXTURES OF SOME DISCRETE DISTRIBUTIONS
    GUPTA, RC
    [J]. SOUTH AFRICAN STATISTICAL JOURNAL, 1974, 8 (02) : 83 - 92
  • [7] Testing Mixtures of Discrete Distributions
    Aliakbarpour, Maryam
    Kumar, Ravi
    Rubinfeld, Ronitt
    [J]. CONFERENCE ON LEARNING THEORY, VOL 99, 2019, 99
  • [8] ERLANG MIXTURES OF SOME DISCRETE DISTRIBUTIONS
    Roy, M. K.
    Haque, M. E.
    Roy, D. C.
    [J]. PAKISTAN JOURNAL OF STATISTICS, 2008, 24 (01): : 45 - 56
  • [9] On spectral learning of mixtures of distributions
    Achlioptas, D
    McSherry, R
    [J]. LEARNING THEORY, PROCEEDINGS, 2005, 3559 : 458 - 469
  • [10] A REPRESENTATION FOR DISCRETE-DISTRIBUTIONS BY EQUIPROBABLE MIXTURES
    PETERSON, AV
    KRONMAL, RA
    [J]. JOURNAL OF APPLIED PROBABILITY, 1980, 17 (01) : 102 - 111