Learning mixtures of structured distributions over discrete domains

被引：0

作者：

Chan, Siu-On ^{[1
]}

Diakonikolas, Ilias ^{[2
]}

Servedio, Rocco A. ^{[3
]}

Sun, Xiaorui ^{[3
]}

机构：

[1] Univ Calif Berkeley, Berkeley, CA 94720 USA

[2] Univ Edinburgh, Edinburgh, Midlothian, Scotland

[3] Columbia Univ, New York, NY 10027 USA

来源：

PROCEEDINGS OF THE TWENTY-FOURTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS (SODA 2013) | 2013年

关键词：

MAXIMUM-LIKELIHOOD-ESTIMATION; LOG-CONCAVE; DENSITY; PROBABILITY; MONOTONE;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Let C be a class of probability distributions over the discrete domain [n] = {1, ..., n}: We show that if C satisfies a rather general condition - essentially, that each distribution in C can be well-approximated by a variable-width histogram with few bins - then there is a highly efficient (both in terms of running time and sample complexity) algorithm that can learn any mixture of k unknown distributions from C. We analyze several natural types of distributions over [n], including log-concave, monotone hazard rate and unimodal distributions, and show that they have the required structural property of being well-approximated by a histogram with few bins. Applying our general algorithm, we obtain near-optimally efficient algorithms for all these mixture learning problems as described below. More precisely, Log-concave distributions: We learn any mixture of k log-concave distributions over [n] using k.(O) over tilde (1/epsilon(4)) samples (independent of n) and running in time (O) over tilde (k log(n)/epsilon(4)) bit-operations (note that reading a single sample from [n] takes Theta (log n) bit operations). For the special case k = 1 we give an efficient algorithm using (O) over tilde (1/epsilon(3)) samples; this generalizes the main result of [DDS12b] from the class of Poisson Binomial distributions to the much broader class of all log-concave distributions. Our upper bounds are not far from optimal since any algorithm for this learning problem requires Omega (k/epsilon(5/2)) samples. Monotone hazard rate (MHR) distributions: We learn any mixture of k MHR distributions over [n] using O (k log(n/epsilon)/epsilon(4)) samples and running in time (O) over tilde (k log(2) (n)/epsilon(4)) bit-operations. Any algorithm for this learning problem must use Omega (k log(n)/epsilon(3)) samples. Unimodal distributions: We give an algorithm that learns any mixture of k unimodal distributions over [n] using O (k log(n)/epsilon(4)) samples and running in time (O) over tilde (k log(2) (n)/epsilon(4)) bit-operations. Any algorithm for this problem must use Omega (k log(n)/epsilon(3)) samples.

引用

下载

页码：1380 / 1394

页数：15

共 50 条

[31] On learning mixtures of heavy-tailed distributions
Dasgupta, A
Hopcroft, J
Kleinberg, J
Sandler, M
46th Annual IEEE Symposium on Foundations of Computer Science, Proceedings, 2005, : 491 - 500
[32] Faithfulness and learning hypergraphs from discrete distributions
Klimova, Anna
Uhler, Caroline
Rudas, Tamas
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2015, 87 : 57 - 72
[33] Conjugated Discrete Distributions for Distributional Reinforcement Learning
Lindenberg, Bjorn
Lindahl, Jonas Nordqvistand Karl-Olof
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7516 - 7524
[34] Modules over discrete valuation domains. II
Krylov P.A.
Tuganbaev A.A.
Journal of Mathematical Sciences, 2008, 151 (5) : 3255 - 3371
[35] Modules over discrete valuation domains. i
Krylov P.A.
Tuganbaev A.A.
Journal of Mathematical Sciences, 2007, 145 (4) : 4997 - 5117
[36] Modules over Discrete Valuation Domains. III
Krylov P.A.
Tuganbaev A.A.
Journal of Mathematical Sciences, 2021, 258 (2) : 199 - 221
[37] Efficiently Learning Structured Distributions from Untrusted Batches
Chen, Sitan
Li, Jerry
Moitra, Ankur
PROCEEDINGS OF THE 52ND ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '20), 2020, : 960 - 973
[38] Learning and Applying a Function Over Distributions
Healey, Glenn
Zhao, Shiyuan
IEEE ACCESS, 2020, 8 : 172196 - 172203
[39] Bayesian analysis for mixtures of discrete distributions with a non-parametric component
Alhaji, Baba B.
Dai, Hongsheng
Hayashi, Yoshiko
Vinciotti, Veronica
Harrison, Andrew
Lausen, Berthold
JOURNAL OF APPLIED STATISTICS, 2016, 43 (08) : 1369 - 1385
[40] Confidence intervals of the hazard rate function for discrete distributions using mixtures
Karlis, Dimitris
Patilea, Valentin
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 51 (11) : 5388 - 5401

← 1 2 3 4 5 →