Homophily and missing links in citation networks

被引:38
|
作者
Ciotti, Valerio [1 ,2 ]
Bonaventura, Moreno [1 ,2 ]
Nicosia, Vincenzo [2 ]
Panzarasa, Pietro [1 ]
Latora, Vito [2 ,3 ,4 ]
机构
[1] Queen Mary Univ London, Sch Business & Management, Mile End Rd, London E1 4NS, England
[2] Queen Mary Univ London, Sch Math Sci, Mile End Rd, London E1 4NS, England
[3] Univ Catania, Dipartimento Fis & Astron, Via S Sofia, I-95123 Catania, Italy
[4] Ist Nazl Fis Nucl, Sez Catania, Via S Sofia, I-95123 Catania, Italy
来源
EPJ DATA SCIENCE | 2016年 / 5卷
基金
英国工程与自然科学研究理事会;
关键词
citation networks; homophily; link prediction; bibliometric techniques; SUPREME-COURT; SCIENCE;
D O I
10.1140/epjds/s13688-016-0068-2
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Citation networks have been widely used to study the evolution of science through the lenses of the underlying patterns of knowledge flows among academic papers, authors, research sub-fields, and scientific journals. Here we focus on citation networks to cast light on the salience of homophily, namely the principle that similarity breeds connection, for knowledge transfer between papers. To this end, we assess the degree to which citations tend to occur between papers that are concerned with seemingly related topics or research problems. Drawing on a large data set of articles published in the journals of the American Physical Society between 1893 and 2009, we propose a novel method for measuring the similarity between articles through the statistical validation of the overlap between their bibliographies. Results suggest that the probability of a citation made by one article to another is indeed an increasing function of the similarity between the two articles. Our study also enables us to uncover missing citations between pairs of highly related articles, and may thus help identify barriers to effective knowledge flows. By quantifying the proportion of missing citations, we conduct a comparative assessment of distinct journals and research sub-fields in terms of their ability to facilitate or impede the dissemination of knowledge. Findings indicate that Electromagnetism and Interdisciplinary Physics are the two sub-fields in physics with the smallest percentage of missing citations. Moreover, knowledge transfer seems to be more effectively facilitated by journals of wide visibility, such as Physical Review Letters, than by lower-impact ones. Our study has important implications for authors, editors and reviewers of scientific journals, as well as public preprint repositories, as it provides a procedure for recommending relevant yet missing references and properly integrating bibliographies of papers.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Homophily and missing links in citation networks
    Valerio Ciotti
    Moreno Bonaventura
    Vincenzo Nicosia
    Pietro Panzarasa
    Vito Latora
    [J]. EPJ Data Science, 5
  • [2] Ruling out static latent homophily in citation networks
    Wittek, Peter
    Daranyi, Sandor
    Nelhans, Gustaf
    [J]. SCIENTOMETRICS, 2017, 110 (02) : 765 - 777
  • [3] Ruling out static latent homophily in citation networks
    Peter Wittek
    Sándor Darányi
    Gustaf Nelhans
    [J]. Scientometrics, 2017, 110 : 765 - 777
  • [4] Missing Links in Multiple Trade Networks
    Foschi, Rachele
    Riccaboni, Massimo
    Schiavo, Stefano
    [J]. 2013 INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS), 2013, : 580 - 585
  • [5] Networks - Teasing out the missing links
    Redner, Sid
    [J]. NATURE, 2008, 453 (7191) : 47 - 48
  • [6] Missing and forbidden links in mutualistic networks
    Olesen, Jens M.
    Bascompte, Jordi
    Dupont, Yoko L.
    Elberling, Heidi
    Rasmussen, Claus
    Jordano, Pedro
    [J]. PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2011, 278 (1706) : 725 - 732
  • [7] Finding missing links in interaction networks
    Terry, J. Christopher D.
    Lewis, Owen T.
    [J]. ECOLOGY, 2020, 101 (07)
  • [8] Outlier detection in networks with missing links
    Gaucher, Solenne
    Klopp, Olga
    Robin, Genevieve
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 164
  • [9] Missing nodes and links in mycorrhizal networks
    Oepik, Maarja
    Moora, Mari
    [J]. NEW PHYTOLOGIST, 2012, 194 (02) : 304 - 306
  • [10] Mapping flows on sparse networks with missing links
    Smiljanic, Jelena
    Edler, Daniel
    Rosvall, Martin
    [J]. PHYSICAL REVIEW E, 2020, 102 (01)