Asymptotic seed bias in respondent-driven sampling

被引:0
|
作者
Yan, Yuling [1 ]
Hanlon, Bret [2 ]
Roch, Sebastien [3 ]
Rohe, Karl [4 ]
机构
[1] Princeton Univ, Dept Operat Res & Financial Engn, Sherrerd Hall,Charlton St, Princeton, NJ 08544 USA
[2] Univ Wisconsin, Dept Biostat & Med Informat, 610 Walnut St, Madison, WI 53726 USA
[3] Univ Wisconsin, Dept Math, 480 Lincoln Dr, Madison, WI 53792 USA
[4] Univ Wisconsin, Dept Stat, 1300 Univ Ave, Madison, WI 53706 USA
来源
ELECTRONIC JOURNAL OF STATISTICS | 2020年 / 14卷 / 01期
关键词
Limit distribution; Calton-Watson process; Volz-Heckathorn estimator; LIMIT-THEOREMS; BLOCKMODELS;
D O I
10.1214/20-EJS1698
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Respondent-driven sampling (RDS) collects a sample of individuals in a networked population by incentivizing the sampled individuals to refer their contacts into the sample. This iterative process is initialized from some seed node(s). Sometimes, this selection creates a large amount of seed bias. Other times, the seed bias is small. This paper gains a deeper understanding of this bias by characterizing its effect on the limiting distribution of various RDS estimators. Using classical tools and results from multi-type branching processes [12], we show that the seed bias is negligible for the Generalized Least Squares (GLS) estimator and non-negligible for both the inverse probability weighted and Volz-Heckathorn (VH) estimators. In particular, we show that (i) above a critical threshold, VH converge to a non-trivial mixture distribution, where the mixture component depends on the seed node, and the mixture distribution is possibly multi-modal. Moreover, (ii) GLS converges to a Gaussian distribution independent of the seed node, under a certain condition on the Markov process. Numerical experiments with both simulated data and empirical social networks suggest that these results appear to hold beyond the Markov conditions of the theorems.
引用
收藏
页码:1577 / 1610
页数:34
相关论文
共 50 条
  • [1] Bias decomposition and estimator performance in respondent-driven sampling
    Sirianni, Antonio D.
    Cameron, Christopher J.
    Shi, Yongren
    Heckathorn, Douglas D.
    [J]. SOCIAL NETWORKS, 2021, 64 : 109 - 121
  • [2] Respondent-driven sampling
    Schonlau, Matthias
    Liebau, Elisabeth
    [J]. STATA JOURNAL, 2012, 12 (01): : 72 - 93
  • [3] Assessing respondent-driven sampling
    Goel, Sharad
    Salganik, Matthew J.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (15) : 6743 - 6747
  • [4] The sensitivity of respondent-driven sampling
    Lu, Xin
    Bengtsson, Linus
    Britton, Tom
    Camitz, Martin
    Kim, Beom Jun
    Thorson, Anna
    Liljeros, Fredrik
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2012, 175 : 191 - 216
  • [5] Diagnostics for respondent-driven sampling
    Gile, Krista J.
    Johnston, Lisa G.
    Salganik, Matthew J.
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2015, 178 (01) : 241 - 269
  • [6] Evaluation of Respondent-driven Sampling
    McCreesh, Nicky
    Frost, Simon D. W.
    Seeley, Janet
    Katongole, Joseph
    Tarsh, Matilda N.
    Ndunguse, Richard
    Jichi, Fatima
    Lunel, Natasha L.
    Maher, Dermot
    Johnston, Lisa G.
    Sonnenberg, Pam
    Copas, Andrew J.
    Hayes, Richard J.
    White, Richard G.
    [J]. EPIDEMIOLOGY, 2012, 23 (01) : 138 - 147
  • [7] ASSESSING RESPONDENT-DRIVEN SAMPLING
    Goel, S.
    [J]. SEXUALLY TRANSMITTED INFECTIONS, 2011, 87 : A15 - A15
  • [8] RESPONDENT-DRIVEN SAMPLING AND AN UNUSUAL EPIDEMIC
    Malmros, J.
    Liljeros, F.
    Britton, T.
    [J]. JOURNAL OF APPLIED PROBABILITY, 2016, 53 (02) : 518 - 530
  • [9] Nonparametric identification for respondent-driven sampling
    Aronow, Peter M.
    Crawford, Forrest W.
    [J]. STATISTICS & PROBABILITY LETTERS, 2015, 106 : 100 - 102
  • [10] THE GRAPHICAL STRUCTURE OF RESPONDENT-DRIVEN SAMPLING
    Crawford, Forrest W.
    [J]. SOCIOLOGICAL METHODOLOGY, VOL 46, 2016, 46 : 187 - 211