NONEXCHANGEABLE RANDOM PARTITION MODELS FOR MICROCLUSTERING

被引:2
|
作者
Di Benedetto, Giuseppe [1 ]
Caron, Francois [1 ]
Teh, Yee Whye [1 ]
机构
[1] Univ Oxford, Dept Stat, Oxford, England
来源
ANNALS OF STATISTICS | 2021年 / 49卷 / 04期
基金
欧盟第七框架计划; 英国工程与自然科学研究理事会;
关键词
Power-law; random partitions; completely random measure; stochastic process; sparse random graph; NORMALIZED RANDOM MEASURES; PRIORS;
D O I
10.1214/20-AOS2003
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Many popular random partition models, such as the Chinese restaurant process and its two-parameter extension, fall in the class of exchangeable random partitions, and have found wide applicability in various fields. While the exchangeability assumption is sensible in many cases, it implies that the size of the clusters necessarily grows linearly with the sample size, and such feature may be undesirable for some applications. We present here a flexible class of nonexchangeable random partition models, which are able to generate partitions whose cluster sizes grow sublinearly with the sample size, and where the growth rate is controlled by one parameter. Along with this result, we provide the asymptotic behaviour of the number of clusters of a given size, and show that the model can exhibit a power-law behaviour, controlled by another parameter. The construction is based on completely random measures and a Poisson embedding of the random partition, and inference is performed using a Sequential Monte Carlo algorithm. Experiments on real data sets emphasise the usefulness of the approach compared to a two-parameter Chinese restaurant process.
引用
收藏
页码:1931 / 1957
页数:27
相关论文
共 50 条
  • [21] PARTITION MODELS
    HARTIGAN, JA
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1990, 19 (08) : 2745 - 2756
  • [22] De Finetti-type theorems for nonexchangeable 0–1 random variables
    Ricardo Vélez Ibarrola
    Tomás Prieto-Rumeau
    TEST, 2011, 20 : 293 - 310
  • [23] An Asymptotic Analysis of Random Partition Based Minibatch Momentum Methods for Linear Regression Models
    Gao, Yuan
    Zhu, Xuening
    Qi, Haobo
    Li, Guodong
    Zhang, Riquan
    Wang, Hansheng
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2023, 32 (03) : 1083 - 1096
  • [24] RANDOM PARTITION MODELS AND COMPLEMENTARY CLUSTERING OF ANGLO-SAXON PLACE-NAMES
    Zanella, Giacomo
    ANNALS OF APPLIED STATISTICS, 2015, 9 (04): : 1792 - 1822
  • [25] Variable selection in covariate dependent random partition models: an application to urinary tract infection
    Barcella, William
    De Iorio, Maria
    Baio, Gianluca
    Malone-Lee, James
    STATISTICS IN MEDICINE, 2016, 35 (08) : 1373 - 1389
  • [26] On the multiplicity of parts in a random partition
    Corteel, S
    Pittel, B
    Savage, CD
    Wilf, HS
    RANDOM STRUCTURES & ALGORITHMS, 1999, 14 (02) : 185 - 197
  • [27] De Finetti-type theorems for nonexchangeable 0-1 random variables
    Velez Ibarrola, Ricardo
    Prieto-Rumeau, Tomas
    TEST, 2011, 20 (02) : 293 - 310
  • [28] THEORY OF PARTITION MODELS
    KATSMAN, VY
    SOVIET JOURNAL OF COMPUTER AND SYSTEMS SCIENCES, 1991, 29 (01): : 64 - 73
  • [29] A De Finetti-type theorem for nonexchangeable finite-valued random variables
    Velez Ibarrola, Ricardo
    Prieto-Rumeau, Tomas
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2008, 347 (02) : 407 - 415
  • [30] On the number of summands in a random prime partition
    Dimbinaina Ralaivaosaona
    Monatshefte für Mathematik, 2012, 166 : 505 - 524