On the origin of long-range correlations in texts

被引:75
|
作者
Altmann, Eduardo G. [1 ]
Cristadoro, Giampaolo [2 ]
Esposti, Mirko Degli [2 ]
机构
[1] Max Planck Inst Phys Komplexer Syst, D-01187 Dresden, Germany
[2] Univ Bologna, Dipartimento Matemat, I-40126 Bologna, Italy
关键词
complex systems; language dynamics; long correlations; statistical physics; burstiness; FRACTAL CORRELATIONS; KEYWORD DETECTION; 1/F NOISE; LANGUAGE;
D O I
10.1073/pnas.1117723109
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The complexity of human interactions with social and natural phenomena is mirrored in the way we describe our experiences through natural language. In order to retain and convey such a high dimensional information, the statistical properties of our linguistic output has to be highly correlated in time. An example are the robust observations, still largely not understood, of correlations on arbitrary long scales in literary texts. In this paper we explain how long-range correlations flow from highly structured linguistic levels down to the building blocks of a text (words, letters, etc..). By combining calculations and data analysis we show that correlations take form of a bursty sequence of events once we approach the semantically relevant topics of the text. The mechanisms we identify are fairly general and can be equally applied to other hierarchical settings.
引用
收藏
页码:11582 / 11587
页数:6
相关论文
共 50 条
  • [1] Quantifying origin and character of long-range correlations in narrative texts
    Drozdz, Stanislaw
    Oswiecimka, Pawel
    Kulig, Andrzej
    Kwapien, Jaroslaw
    Bazarnik, Katarzyna
    Grabska-Gradzinska, Iwona
    Rybicki, Jan
    Stanuszek, Marek
    INFORMATION SCIENCES, 2016, 331 : 32 - 44
  • [2] LONG-RANGE CORRELATIONS BETWEEN LETTERS AND SENTENCES IN TEXTS
    EBELING, W
    NEIMAN, A
    PHYSICA A, 1995, 215 (03): : 233 - 241
  • [3] LANGUAGE AND CODIFICATION DEPENDENCE OF LONG-RANGE CORRELATIONS IN TEXTS
    Amit, M.
    Shmerler, Y.
    Eisenberg, E.
    Abraham, M.
    Shnerb, N.
    FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 1994, 2 (01) : 7 - 13
  • [4] Origin of long-range azimuthal correlations in hadronic collisions
    Torrieri, Giorgio
    PHYSICAL REVIEW C, 2014, 89 (02):
  • [5] On the origin of long-range azimuthal correlations in hadronic collisions
    Torrieri, Giorgio
    XXXVI BRAZILIAN WORKSHOP ON NUCLEAR PHYSICS, 2014, 1625 : 66 - 72
  • [6] LONG-RANGE CORRELATIONS
    BOTKE, JC
    PHYSICAL REVIEW LETTERS, 1973, 31 (10) : 658 - 661
  • [7] Hierarchical structures induce long-range dynamical correlations in written texts
    Alvarez-Lacalle, E.
    Dorow, B.
    Eckmann, J. -P.
    Moses, E.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (21) : 7956 - 7961
  • [8] Computer and natural language texts - A comparison based on long-range correlations
    Kokol, P
    Podgorelec, V
    Zorman, M
    Kokol, T
    Njivar, T
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1999, 50 (14): : 1295 - 1301
  • [9] Modeling Long-Range Dynamic Correlations of Words in Written Texts with Hawkes Processes
    Ogura, Hiroshi
    Hanada, Yasutaka
    Amano, Hiromi
    Kondo, Masato
    ENTROPY, 2022, 24 (07)
  • [10] Degeneracy and long-range correlations
    Delignieres, D.
    Marmelat, V.
    CHAOS, 2013, 23 (04)