Estimation and correction of bias in network simulations based on respondent-driven sampling data

被引:1
|
作者
Zhu, Lin [1 ,2 ]
Menzies, Nicolas A. [1 ]
Wang, Jianing [3 ]
Linas, Benjamin P. [3 ,4 ]
Goodreau, Steven M. [5 ]
Salomon, Joshua A. [2 ]
机构
[1] Harvard TH Chan Sch Publ Hlth, Dept Global Hlth & Populat, Boston, MA USA
[2] Stanford Univ, Dept Med, Sch Med, Stanford, CA 94305 USA
[3] Boston Med Ctr, Dept Med, Infect Dis Sect, Boston, MA USA
[4] Boston Univ, Sch Publ Hlth, Dept Epidemiol, Boston, MA USA
[5] Stanford Univ, Univ Washington, Ctr Studies Demog & Ecol, Dept Epidemiol,Dept Anthropol, Seattle, WA USA
关键词
HEPATITIS-C TRANSMISSION; INJECTION-DRUG USERS; SOCIAL NETWORK; PREEXPOSURE PROPHYLAXIS; RISK; RECRUITMENT; INFERENCE; SNOWBALL; CENTERS; MODELS;
D O I
10.1038/s41598-020-63269-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Respondent-driven sampling (RDS) is widely used for collecting data on hard-to-reach populations, including information about the structure of the networks connecting the individuals. Characterizing network features can be important for designing and evaluating health programs, particularly those that involve infectious disease transmission. While the validity of population proportions estimated from RDS-based datasets has been well studied, little is known about potential biases in inference about network structure from RDS. We developed a mathematical and statistical platform to simulate network structures with exponential random graph models, and to mimic the data generation mechanisms produced by RDS. We used this framework to characterize biases in three important network statistics - density/mean degree, homophily, and transitivity. Generalized linear models were used to predict the network statistics of the original network from the network statistics of the sample network and observable sample design features. We found that RDS may introduce significant biases in the estimation of density/mean degree and transitivity, and may exaggerate homophily when preferential recruitment occurs. Adjustments to network-generating statistics derived from the prediction models could substantially improve validity of simulated networks in terms of density, and could reduce bias in replicating mean degree, homophily, and transitivity from the original network.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] AN EMPIRICAL EVALUATION OF RESPONDENT-DRIVEN SAMPLING
    McCreesh, N.
    Frost, S.
    Seeley, J.
    Katongole, J.
    Tarsh, M. Ndagire
    Ndungutse, R.
    Jichi, F.
    Maher, D.
    Sonnenberg, P.
    Copas, A.
    Hayes, R. J.
    White, R. G.
    SEXUALLY TRANSMITTED INFECTIONS, 2011, 87 : A15 - A16
  • [32] Respondent-Driven Sampling and Spatial Autocorrelation
    Morris, E. Scott
    Thakar, Vaishnavi
    Griffith, Daniel A.
    ADVANCES IN GEOCOMPUTATION, 2017, : 241 - 251
  • [33] Respondent-driven sampling on directed networks
    Lu, Xin
    Malmros, Jens
    Liljeros, Fredrik
    Britton, Tom
    ELECTRONIC JOURNAL OF STATISTICS, 2013, 7 : 292 - 322
  • [34] Web-based network sampling - Efficiency and efficacy of respondent-driven sampling for online research
    Wejnert, Cyprian
    Heckathorn, Douglas D.
    SOCIOLOGICAL METHODS & RESEARCH, 2008, 37 (01) : 105 - 134
  • [35] VERIFICATION OF RANDOM SELECTION ASSUMPTION IN RESPONDENT-DRIVEN SAMPLING IN EGOCENTRIC SOCIAL NETWORK DATA
    Liu, H.
    Li, J.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2011, 173 : S110 - S110
  • [36] Social network analysis with respondent-driven sampling data: A study of racial integration on campus
    Wejnert, Cyprian
    SOCIAL NETWORKS, 2010, 32 (02) : 112 - 124
  • [37] Inferring bivariate association from respondent-driven sampling data
    Kim, Dongah
    Gile, Krista J.
    Guarino, Honoria
    Mateu-Gelabert, Pedro
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2021, 70 (02) : 415 - 433
  • [38] A single weighting approach to analyze respondent-driven sampling data
    Selvaraj, Vadivoo
    Boopathi, Kangusamy
    Paranjape, Ramesh
    Mehendale, Sanjay
    INDIAN JOURNAL OF MEDICAL RESEARCH, 2016, 144 : 447 - 459
  • [39] Bias-variance and breadth-depth tradeoffs in respondent-driven sampling
    Nesterko, Sergiy
    Blitzstein, Joseph
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (01) : 89 - 102
  • [40] Respondent-Driven Sampling - Testing Assumptions: Sampling with Replacement
    Barash, Vladimir D.
    Cameron, Christopher J.
    Spiller, Michael W.
    Heckathorn, Douglas D.
    JOURNAL OF OFFICIAL STATISTICS, 2016, 32 (01) : 29 - 73