Network analysis of named entity co-occurrences in written texts

被引:9
|
作者
Amancio, Diego Raphael [1 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci, Sao Paulo, Brazil
基金
巴西圣保罗研究基金会;
关键词
COMPLEX; LANGUAGE;
D O I
10.1209/0295-5075/114/58005
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The use of methods borrowed from statistics and physics to analyze written texts has allowed the discovery of unprecedent patterns of human behavior and cognition by establishing links between models features and language structure. While current models have been useful to unveil patterns via analysis of syntactical and semantical networks, only a few works have probed the relevance of investigating the structure arising from the relationship between relevant entities such as characters, locations and organizations. In this study, we represent entities appearing in the same context as a co-occurrence network, where links are established according to a null model based on random, shuffled texts. Computational simulations performed in novels revealed that the proposed model displays interesting topological features, such as the small world feature, characterized by high values of clustering coefficient. The effectiveness of our model was verified in a practical pattern recognition task in real networks. When compared with traditional word adjacency networks, our model displayed optimized results in identifying unknown references in texts. Because the proposed representation plays a complementary role in characterizing unstructured documents via topological analysis of named entities, we believe that it could be useful to improve the characterization of written texts (and related systems), specially if combined with traditional approaches based on statistical and deeper paradigms. Copyright (C) EPLA, 2016
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Code Smell Co-occurrences: A Systematic Mapping
    Neto, Antonio
    Bezerra, Carla
    Martins, Julio
    [J]. 36TH BRAZILIAN SYMPOSIUM ON SOFTWARE ENGINEERING, SBES 2022, 2022, : 331 - 336
  • [32] Corpus of Syntactic Co-Occurrences: A Delayed Promise
    Klyshinsky, Eduard S.
    Lukashevich, Natalia Y.
    [J]. ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE, 2018, 789 : 121 - 127
  • [33] Named Entity Recognition Experiments on Turkish Texts
    Kuecuek, Dilek
    Yazici, Adnan
    [J]. FLEXIBLE QUERY ANSWERING SYSTEMS: 8TH INTERNATIONAL CONFERENCE, FQAS 2009, 2009, 5822 : 524 - 535
  • [34] Novel multilayer network analysis to assess variation in the spatial co-occurrences of close kin in wild caribou populations
    Jones, Teri B.
    Manseau, Micheline
    Merriell, Brandon
    Pittoello, Gigi
    Hervieux, Dave
    Wilson, Paul J.
    [J]. GLOBAL ECOLOGY AND CONSERVATION, 2023, 47
  • [35] Named Entity Recognition for Digitised Historical Texts
    Grover, Claire
    Givon, Sharon
    Tobin, Richard
    Ball, Julian
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1343 - 1346
  • [36] Visualization of health-subject analysis based on query term co-occurrences
    Zhang, Jin
    Wolfram, Dietmar
    Wang, Peiling
    Hong, Yi
    Gillis, Rick
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2008, 59 (12): : 1933 - 1947
  • [37] Analyzing Relatedness by Toponym Co-Occurrences on Web Pages
    Liu, Yu
    Wang, Fahui
    Kang, Chaogui
    Gao, Yong
    Lu, Yongmei
    [J]. TRANSACTIONS IN GIS, 2014, 18 (01) : 89 - 107
  • [38] Disentangling categorical relationships through a graph of co-occurrences
    Martinez-Romo, Juan
    Araujo, Lourdes
    Borge-Holthoefer, Javier
    Arenas, Alex
    Capitan, Jose A.
    Cuesta, Jose A.
    [J]. PHYSICAL REVIEW E, 2011, 84 (04)
  • [39] Diversification Improvements Through News Article Co-occurrences
    Yaros, John Robert
    Imielinski, Tomasz
    [J]. 2014 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR FINANCIAL ENGINEERING & ECONOMICS (CIFER), 2014, : 130 - 137
  • [40] Attitudes From Mere Co-Occurrences Are Guided by Differentiation
    Alves, Hans
    Hoegden, Fabia
    Gast, Anne
    Aust, Frederik
    Unkelbach, Christian
    [J]. JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 2020, 119 (03) : 560 - 581