Estimation and correction of bias in network simulations based on respondent-driven sampling data

被引:1
|
作者
Zhu, Lin [1 ,2 ]
Menzies, Nicolas A. [1 ]
Wang, Jianing [3 ]
Linas, Benjamin P. [3 ,4 ]
Goodreau, Steven M. [5 ]
Salomon, Joshua A. [2 ]
机构
[1] Harvard TH Chan Sch Publ Hlth, Dept Global Hlth & Populat, Boston, MA USA
[2] Stanford Univ, Dept Med, Sch Med, Stanford, CA 94305 USA
[3] Boston Med Ctr, Dept Med, Infect Dis Sect, Boston, MA USA
[4] Boston Univ, Sch Publ Hlth, Dept Epidemiol, Boston, MA USA
[5] Stanford Univ, Univ Washington, Ctr Studies Demog & Ecol, Dept Epidemiol,Dept Anthropol, Seattle, WA USA
关键词
HEPATITIS-C TRANSMISSION; INJECTION-DRUG USERS; SOCIAL NETWORK; PREEXPOSURE PROPHYLAXIS; RISK; RECRUITMENT; INFERENCE; SNOWBALL; CENTERS; MODELS;
D O I
10.1038/s41598-020-63269-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Respondent-driven sampling (RDS) is widely used for collecting data on hard-to-reach populations, including information about the structure of the networks connecting the individuals. Characterizing network features can be important for designing and evaluating health programs, particularly those that involve infectious disease transmission. While the validity of population proportions estimated from RDS-based datasets has been well studied, little is known about potential biases in inference about network structure from RDS. We developed a mathematical and statistical platform to simulate network structures with exponential random graph models, and to mimic the data generation mechanisms produced by RDS. We used this framework to characterize biases in three important network statistics - density/mean degree, homophily, and transitivity. Generalized linear models were used to predict the network statistics of the original network from the network statistics of the sample network and observable sample design features. We found that RDS may introduce significant biases in the estimation of density/mean degree and transitivity, and may exaggerate homophily when preferential recruitment occurs. Adjustments to network-generating statistics derived from the prediction models could substantially improve validity of simulated networks in terms of density, and could reduce bias in replicating mean degree, homophily, and transitivity from the original network.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Towards the Estimation of Effect Measures in Studies Using Respondent-Driven Sampling
    Michael A. Rotondi
    Journal of Urban Health, 2014, 91 : 592 - 597
  • [42] POPULATION SIZE ESTIMATION USING MULTIPLE RESPONDENT-DRIVEN SAMPLING SURVEYS
    Kim, Brian J.
    Handcock, Mark S.
    JOURNAL OF SURVEY STATISTICS AND METHODOLOGY, 2021, 9 (01) : 94 - 120
  • [43] Towards the Estimation of Effect Measures in Studies Using Respondent-Driven Sampling
    Rotondi, Michael A.
    JOURNAL OF URBAN HEALTH-BULLETIN OF THE NEW YORK ACADEMY OF MEDICINE, 2014, 91 (03): : 592 - 597
  • [44] Caution on the Interpretation of Respondent-Driven Sampling Results
    Guimaraes, Mark D. C.
    AIDS RESEARCH AND HUMAN RETROVIRUSES, 2020, 36 (04) : 253 - 253
  • [45] Correcting for differential recruitment in respondent-driven sampling data using ego-network information
    Beaudry, Isabelle S.
    Gile, Krista J.
    ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (02): : 2678 - 2713
  • [46] Respondent-driven sampling in a syringe exchange setting
    Hakansson, Anders
    Isendahl, Pernilla
    Wallin, Camilla
    Berglund, Mats
    SCANDINAVIAN JOURNAL OF PUBLIC HEALTH, 2012, 40 (08) : 725 - 729
  • [47] Respondent-driven Sampling for Characterizing Unstructured Overlays
    Rasti, Amir H.
    Torkjazi, Mojtaba
    Rejaie, Reza
    Duffield, Nick
    Willinger, Walter
    Stutzbach, Daniel
    IEEE INFOCOM 2009 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-5, 2009, : 2701 - +
  • [48] Estimating hidden population size using Respondent-Driven Sampling data
    Handcock, Mark S.
    Gilo, Krista J.
    Mar, Corinne M.
    ELECTRONIC JOURNAL OF STATISTICS, 2014, 8 : 1491 - 1521
  • [49] Using geographical data and rolling statistics for diagnostics of respondent-driven sampling
    Kim, Brian
    Ogwal, Moses
    Sande, Enos
    Kiyingi, Herbert
    Serwadda, David
    Hladik, Wolfgang
    SOCIAL NETWORKS, 2022, 69 : 74 - 83
  • [50] Consistency for the tree bootstrap in respondent-driven sampling
    Green, A. K. B.
    McCormick, T. H.
    Raftery, A. E.
    BIOMETRIKA, 2020, 107 (02) : 497 - 504