Estimating Homophily in Social Networks Using Dyadic Predictions

被引:1
|
作者
Berry, George [1 ]
Sirianni, Antonio [2 ]
Weber, Ingmar [3 ]
An, Jisun [4 ]
Macy, Michael [1 ]
机构
[1] Cornell Univ, Dept Sociol, Ithaca, NY 14853 USA
[2] Dartmouth Coll, Dept Sociol, Hanover, NH 03755 USA
[3] Qatar Comp Res Inst, Doha, Qatar
[4] Singapore Management Univ, Sch Comp & Informat Syst, Singapore, Singapore
关键词
homophily; networks; machine learning; quantitative methodology; SEGREGATION; CORE;
D O I
10.15195/v8.a14
中图分类号
C91 [社会学];
学科分类号
030301 ; 1204 ;
摘要
Predictions of node categories are commonly used to estimate homophily and other relational properties in networks. However, little is known about the validity of using predictions for this task. We show that estimating homophily in a network is a problem of predicting categories of dyads (edges) in the graph. Homophily estimates are unbiased when predictions of dyad categories are unbiased. Node-level prediction models, such as the use of names to classify ethnicity or gender, do not generally produce unbiased predictions of dyad categories and therefore produce biased homophily estimates. Bias comes from three sources: sampling bias, correlation between model errors and node degree, and correlation between node-level model errors along dyads. We examine three methods for estimating homophily: predicting node categories, predicting dyad categories, and a hybrid "ego-alter" approach. This analysis indicates that only the dyadic prediction approach is unbiased, whereas the node-level approach produces both high bias and high overall error. We find that node-level classification performance is not a reliable indicator of accuracy for homophily. Although this article focuses on a particular version of homophily, results generalize to heterophilous cases and other dyadic measures. We conclude with suggestions for research design. Code for this article is available at https://github.com/georgeberry/autocorr.
引用
收藏
页码:285 / 307
页数:23
相关论文
共 50 条
  • [1] Fuzzy Homophily In Social Networks
    Heidarpour, Mostafa
    Emami, Hojjat
    Shirazi, Hossein
    2015 4th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS), 2015,
  • [2] Purity Homophily in Social Networks
    Dehghani, Morteza
    Johnson, Kate
    Hoover, Joe
    Sagi, Eyal
    Garten, Justin
    Parmar, Niki Jitendra
    Vaisey, Stephen
    Iliev, Rumen
    Graham, Jesse
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2016, 145 (03) : 366 - 375
  • [3] Discovering Homophily in Online Social Networks
    De Salve, Andrea
    Guidi, Barbara
    Ricci, Laura
    Mori, Paolo
    MOBILE NETWORKS & APPLICATIONS, 2018, 23 (06): : 1715 - 1726
  • [4] A genetic basis for homophily in social networks
    Settle, Jaime
    Fowler, James
    BEHAVIOR GENETICS, 2009, 39 (06) : 680 - 680
  • [5] Birds of a feather: Homophily in social networks
    McPherson, M
    Smith-Lovin, L
    Cook, JM
    ANNUAL REVIEW OF SOCIOLOGY, 2001, 27 : 415 - 444
  • [6] A simple model of homophily in social networks
    Currarini, Sergio
    Matheson, Jesse
    Vega-Redondo, Fernando
    EUROPEAN ECONOMIC REVIEW, 2016, 90 : 18 - 39
  • [7] Discovering Homophily in Online Social Networks
    Andrea De Salve
    Barbara Guidi
    Laura Ricci
    Paolo Mori
    Mobile Networks and Applications, 2018, 23 : 1715 - 1726
  • [8] Homophily and the Glass Ceiling Effect in Social Networks
    Avin, Chen
    Keller, Barbara
    Lotker, Zvi
    Mathieu, Claire
    Peleg, David
    Pignolet, Yvonne-Anne
    PROCEEDINGS OF THE 6TH INNOVATIONS IN THEORETICAL COMPUTER SCIENCE (ITCS'15), 2015, : 41 - 50
  • [9] Structural transition in social networks: The role of homophily
    Murase, Yohsuke
    Jo, Hang-Hyun
    Torok, Janos
    Kertesz, Janos
    Kaski, Kimmo
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [10] A Structural Model of Homophily and Clustering in Social Networks
    Mele, Angelo
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2022, 40 (03) : 1377 - 1389