Learning mixtures of structured distributions over discrete domains

被引:0
|
作者
Chan, Siu-On [1 ]
Diakonikolas, Ilias [2 ]
Servedio, Rocco A. [3 ]
Sun, Xiaorui [3 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[3] Columbia Univ, New York, NY 10027 USA
关键词
MAXIMUM-LIKELIHOOD-ESTIMATION; LOG-CONCAVE; DENSITY; PROBABILITY; MONOTONE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Let C be a class of probability distributions over the discrete domain [n] = {1, ..., n}: We show that if C satisfies a rather general condition - essentially, that each distribution in C can be well-approximated by a variable-width histogram with few bins - then there is a highly efficient (both in terms of running time and sample complexity) algorithm that can learn any mixture of k unknown distributions from C. We analyze several natural types of distributions over [n], including log-concave, monotone hazard rate and unimodal distributions, and show that they have the required structural property of being well-approximated by a histogram with few bins. Applying our general algorithm, we obtain near-optimally efficient algorithms for all these mixture learning problems as described below. More precisely, Log-concave distributions: We learn any mixture of k log-concave distributions over [n] using k.(O) over tilde (1/epsilon(4)) samples (independent of n) and running in time (O) over tilde (k log(n)/epsilon(4)) bit-operations (note that reading a single sample from [n] takes Theta (log n) bit operations). For the special case k = 1 we give an efficient algorithm using (O) over tilde (1/epsilon(3)) samples; this generalizes the main result of [DDS12b] from the class of Poisson Binomial distributions to the much broader class of all log-concave distributions. Our upper bounds are not far from optimal since any algorithm for this learning problem requires Omega (k/epsilon(5/2)) samples. Monotone hazard rate (MHR) distributions: We learn any mixture of k MHR distributions over [n] using O (k log(n/epsilon)/epsilon(4)) samples and running in time (O) over tilde (k log(2) (n)/epsilon(4)) bit-operations. Any algorithm for this learning problem must use Omega (k log(n)/epsilon(3)) samples. Unimodal distributions: We give an algorithm that learns any mixture of k unimodal distributions over [n] using O (k log(n)/epsilon(4)) samples and running in time (O) over tilde (k log(2) (n)/epsilon(4)) bit-operations. Any algorithm for this problem must use Omega (k log(n)/epsilon(3)) samples.
引用
下载
收藏
页码:1380 / 1394
页数:15
相关论文
共 50 条
  • [31] On learning mixtures of heavy-tailed distributions
    Dasgupta, A
    Hopcroft, J
    Kleinberg, J
    Sandler, M
    46th Annual IEEE Symposium on Foundations of Computer Science, Proceedings, 2005, : 491 - 500
  • [32] Faithfulness and learning hypergraphs from discrete distributions
    Klimova, Anna
    Uhler, Caroline
    Rudas, Tamas
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2015, 87 : 57 - 72
  • [33] Conjugated Discrete Distributions for Distributional Reinforcement Learning
    Lindenberg, Bjorn
    Lindahl, Jonas Nordqvistand Karl-Olof
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7516 - 7524
  • [34] Modules over discrete valuation domains. II
    Krylov P.A.
    Tuganbaev A.A.
    Journal of Mathematical Sciences, 2008, 151 (5) : 3255 - 3371
  • [35] Modules over discrete valuation domains. i
    Krylov P.A.
    Tuganbaev A.A.
    Journal of Mathematical Sciences, 2007, 145 (4) : 4997 - 5117
  • [36] Modules over Discrete Valuation Domains. III
    Krylov P.A.
    Tuganbaev A.A.
    Journal of Mathematical Sciences, 2021, 258 (2) : 199 - 221
  • [37] Efficiently Learning Structured Distributions from Untrusted Batches
    Chen, Sitan
    Li, Jerry
    Moitra, Ankur
    PROCEEDINGS OF THE 52ND ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '20), 2020, : 960 - 973
  • [38] Learning and Applying a Function Over Distributions
    Healey, Glenn
    Zhao, Shiyuan
    IEEE ACCESS, 2020, 8 : 172196 - 172203
  • [39] Bayesian analysis for mixtures of discrete distributions with a non-parametric component
    Alhaji, Baba B.
    Dai, Hongsheng
    Hayashi, Yoshiko
    Vinciotti, Veronica
    Harrison, Andrew
    Lausen, Berthold
    JOURNAL OF APPLIED STATISTICS, 2016, 43 (08) : 1369 - 1385
  • [40] Confidence intervals of the hazard rate function for discrete distributions using mixtures
    Karlis, Dimitris
    Patilea, Valentin
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 51 (11) : 5388 - 5401