Distinct flavors of Zipf's law and its maximum likelihood fitting: Rank-size and size-distribution representations

被引:17
|
作者
Corral, Alvaro [1 ,2 ,3 ,4 ]
Serra, Isabel [1 ,5 ]
Ferrer-i-Cancho, Ramon [6 ]
机构
[1] Ctr Recerca Matemat, Edif C,Campus Bellaterra, E-08193 Barcelona, Spain
[2] Univ Autonoma Barcelona, Fac Ciencies, Dept Matemat, E-08193 Barcelona, Spain
[3] Barcelona Grad Sch Math, Edif C,Campus Bellaterra, E-08193 Barcelona, Spain
[4] Complex Sci Hub Vienna, Josefstadter Str 39, A-1080 Vienna, Austria
[5] Barcelona Supercomp Ctr BSC CNS, Comp Architecture & Operating Syst Grp, E-08034 Barcelona, Spain
[6] Univ Politecn Cataluna, Dept Ciencies Comp, Complex & Quantitat Linguist Lab, E-08034 Barcelona, Catalonia, Spain
关键词
84;
D O I
10.1103/PhysRevE.102.052113
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
In recent years, researchers have realized the difficulties of fitting power-law distributions properly. These difficulties are higher in Zipfian systems, due to the discreteness of the variables and to the existence of two representations for these systems, i.e., two versions depending on the random variable to fit: rank or size. The discreteness implies that a power law in one of the representations is not a power law in the other, and vice versa. We generate synthetic power laws in both representations and apply a state-of-the-art fitting method to each of the two random variables. The method (based on maximum likelihood plus a goodness-of-fit test) does not fit the whole distribution but the tail, understood as the part of a distribution above a cutoff that separates non-power-law behavior from power-law behavior. We find that, no matter which random variable is power-law distributed, using the rank as the random variable is problematic for fitting, in general (although it may work in some limit cases). One of the difficulties comes from recovering the "hidden" true ranks from the empirical ranks. On the contrary, the representation in terms of the distribution of sizes allows one to recover the true exponent (with some small bias when the underlying size distribution is a power law only asymptotically).
引用
收藏
页数:17
相关论文
共 22 条
  • [1] A new rank-size distribution of Zipf's Law and its applications
    Jiang, GH
    Shan, S
    Jiang, L
    Xu, XS
    [J]. SCIENTOMETRICS, 2002, 54 (01) : 119 - 130
  • [2] A new rank-size distribution of Zipf's law and its applications
    Jiang, GH
    Shan, S
    Xu, XS
    [J]. 8TH INTERNATIONAL CONFERENCE ON SCIENTOMETRICS AND INFORMETRICS, VOLS 1 AND 2 - ISSI-2001, PROCEEDINGS, 2001, : 287 - 295
  • [3] A new rank-size distribution of Zipf"s Law and its applications
    Guohua Jiang
    Shi Shan
    Lan Jiang
    Xuesong Xu
    [J]. Scientometrics, 2002, 54 : 119 - 130
  • [4] Why Does Zipf's Law Break Down in Rank-Size Distribution of Cities?
    Kuninaka, Hiroto
    Matsushita, Mitsugu
    [J]. JOURNAL OF THE PHYSICAL SOCIETY OF JAPAN, 2008, 77 (11)
  • [5] The return of Zipf: Towards a further understanding of the rank-size distribution
    Brakman, S
    Garretsen, H
    Van Marrewijk, C
    van den Berg, M
    [J]. JOURNAL OF REGIONAL SCIENCE, 1999, 39 (01) : 183 - 213
  • [6] Rank-size form of Zipf's law and its probability interpretation based on random graph dynamics
    Sun, Jingwei
    Shan, Shi
    Zhang, Yifei
    [J]. Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2010, 35 (SPECIAL ISSUE 2): : 169 - 171
  • [7] Further Analysis of the Zipf's Law: Does the Rank-Size Rule Really Exist?
    Nota, Fungisai
    Song, Shunfeng
    [J]. JOURNAL OF URBAN MANAGEMENT, 2012, 1 (02) : 19 - 31
  • [8] Zipf's rank-size rule: a comment on Chen and Zhou
    Harris, B
    [J]. ENVIRONMENT AND PLANNING B-PLANNING & DESIGN, 2004, 31 (06): : 931 - 932
  • [9] Hierarchy, cities size distribution and Zipf's law
    F. Semboloni
    [J]. The European Physical Journal B, 2008, 63 : 295 - 301
  • [10] SIZE DISTRIBUTION OF CITIES, ZIPF'S LAW REVISITED
    Roca Cladera, Josep
    Arellano Ramos, Blanca
    [J]. ACE-ARCHITECTURE CITY AND ENVIRONMENT, 2011, 6 (16): : 199 - 222