NONEXCHANGEABLE RANDOM PARTITION MODELS FOR MICROCLUSTERING

被引:2
|
作者
Di Benedetto, Giuseppe [1 ]
Caron, Francois [1 ]
Teh, Yee Whye [1 ]
机构
[1] Univ Oxford, Dept Stat, Oxford, England
来源
ANNALS OF STATISTICS | 2021年 / 49卷 / 04期
基金
欧盟第七框架计划; 英国工程与自然科学研究理事会;
关键词
Power-law; random partitions; completely random measure; stochastic process; sparse random graph; NORMALIZED RANDOM MEASURES; PRIORS;
D O I
10.1214/20-AOS2003
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Many popular random partition models, such as the Chinese restaurant process and its two-parameter extension, fall in the class of exchangeable random partitions, and have found wide applicability in various fields. While the exchangeability assumption is sensible in many cases, it implies that the size of the clusters necessarily grows linearly with the sample size, and such feature may be undesirable for some applications. We present here a flexible class of nonexchangeable random partition models, which are able to generate partitions whose cluster sizes grow sublinearly with the sample size, and where the growth rate is controlled by one parameter. Along with this result, we provide the asymptotic behaviour of the number of clusters of a given size, and show that the model can exhibit a power-law behaviour, controlled by another parameter. The construction is based on completely random measures and a Poisson embedding of the random partition, and inference is performed using a Sequential Monte Carlo algorithm. Experiments on real data sets emphasise the usefulness of the approach compared to a two-parameter Chinese restaurant process.
引用
收藏
页码:1931 / 1957
页数:27
相关论文
共 50 条
  • [41] Hidden Markov partition models
    Farcomeni, Alessio
    STATISTICS & PROBABILITY LETTERS, 2011, 81 (12) : 1766 - 1770
  • [42] MODELS FOR ESTIMATION OF PARTITION CONSTANTS
    CARREIRA, LA
    HILAL, SH
    KARICKHOFF, SW
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1994, 208 : 46 - ENVR
  • [43] Spatial Product Partition Models
    Page, Garritt L.
    Quintana, Fernando A.
    BAYESIAN ANALYSIS, 2016, 11 (01): : 265 - 298
  • [44] The limiting distribution of the trace of a random plane partition
    Kamenov, E. P.
    Mutafchien, L. R.
    ACTA MATHEMATICA HUNGARICA, 2007, 117 (04) : 293 - 314
  • [45] On the Maximal Multiplicity of Parts in a Random Integer Partition
    Ljuben R. Mutafchiev
    The Ramanujan Journal, 2005, 9 : 305 - 316
  • [46] On the size of the Durfee square of a random integer partition
    Mutafchiev, LR
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2002, 142 (01) : 173 - 184
  • [47] Core size of a random partition for the Plancherel measure
    Rostam, Salim
    ANNALES DE L INSTITUT HENRI POINCARE-PROBABILITES ET STATISTIQUES, 2023, 59 (04): : 2151 - 2188
  • [48] On the average time complexity of computation with random partition
    Liao, Mingxue
    Lv, Pin
    COMPUTING, 2024, 106 (03) : 741 - 758
  • [49] Cluster analysis via random partition distributions
    Dahl, David B.
    Andros, Jacob
    Carter, J. Brandon
    STATISTICAL ANALYSIS AND DATA MINING, 2023, 16 (02) : 135 - 148
  • [50] Asymptotics of the partition function of a random matrix model
    Bleher, PM
    Its, AR
    ANNALES DE L INSTITUT FOURIER, 2005, 55 (06) : 1943 - 2000