Zipf's law unzipped

被引:80
|
作者
Baek, Seung Ki [1 ]
Bernhardsson, Sebastian [2 ]
Minnhagen, Petter [1 ]
机构
[1] Umea Univ, Dept Phys, Integrated Sci Lab, S-90187 Umea, Sweden
[2] Niels Bohr Inst, Ctr Models Life, DK-2100 Copenhagen O, Denmark
来源
NEW JOURNAL OF PHYSICS | 2011年 / 13卷
基金
瑞典研究理事会;
关键词
DISTRIBUTIONS;
D O I
10.1088/1367-2630/13/4/043004
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Why does Zipf's law give a good description of data from seemingly completely unrelated phenomena? Here it is argued that the reason is that they can all be described as outcomes of a ubiquitous random group division: the elements can be citizens of a country and the groups family names, or the elements can be all the words making up a novel and the groups the unique words, or the elements could be inhabitants and the groups the cities in a country and so on. A random group formation (RGF) is presented from which a Bayesian estimate is obtained based on minimal information: it provides the best prediction for the number of groups with k elements, given the total number of elements, groups and the number of elements in the largest group. For each specification of these three values, the RGF predicts a unique group distribution N(k) proportional to exp(-bk)/k(gamma), where the power-law index gamma is a unique function of the same three values. The universality of the result is made possible by the fact that no system-specific assumptions are made about the mechanism responsible for the group division. The direct relation between gamma and the total number of elements, groups and the number of elements in the largest group is calculated. The predictive power of the RGF model is demonstrated by direct comparison with data from a variety of systems. It is shown that gamma usually takes values in the interval 1 <= gamma <= 2 and that the value for a given phenomenon depends in a systematic way on the total size of the dataset. The results are put in the context of earlier discussions on Zipf's and Gibrat's laws, N(k) proportional to k(-2) and the connection between growth models and RGF is elucidated.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] On Zipf's law and the bias of Zipf regressions
    Schluter, Christian
    [J]. EMPIRICAL ECONOMICS, 2021, 61 (02) : 529 - 548
  • [2] On Zipf’s law and the bias of Zipf regressions
    Christian Schluter
    [J]. Empirical Economics, 2021, 61 : 529 - 548
  • [3] Universality of Zipf's law
    Corominas-Murtra, Bernat
    Sole, Ricard V.
    [J]. PHYSICAL REVIEW E, 2010, 82 (01):
  • [4] Unzipping Zipf's law
    Lestrade, Sander
    [J]. PLOS ONE, 2017, 12 (08):
  • [5] Zipf's law in percolation
    Watanabe, MS
    [J]. PHYSICAL REVIEW E, 1996, 53 (04) : 4187 - 4190
  • [6] Zipf's law in multifragmentation
    Campi, X
    Krivine, H
    [J]. PHYSICAL REVIEW C, 2005, 72 (05):
  • [7] Zipf's Law in Passwords
    Wang, Ding
    Cheng, Haibo
    Wang, Ping
    Huang, Xinyi
    Jian, Gaopeng
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (11) : 2776 - 2791
  • [8] Unzipping Zipf's law
    Lada Adamic
    [J]. Nature, 2011, 474 : 164 - 165
  • [9] Universality of Zipf's law
    Kawamura, K
    Hatano, N
    [J]. JOURNAL OF THE PHYSICAL SOCIETY OF JAPAN, 2002, 71 (05) : 1211 - 1213
  • [10] Mandelbrot's Model for Zipf's Law Can Mandelbrot's Model Explain Zipf's Law for Language?
    Manin, D. Yu.
    [J]. JOURNAL OF QUANTITATIVE LINGUISTICS, 2009, 16 (03) : 274 - 285