Multilevel modelling of complex survey data

被引:432
|
作者
Rabe-Hesketh, Sophia
Skrondal, Anders
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Univ London, Inst Educ, London WC1N 1AZ, England
[3] London Sch Econ & Polit Sci, London, England
[4] Norwegian Inst Publ Hlth, Oslo, Norway
关键词
adaptive quadrature; generalized linear latent and mixed model; generalized linear mixed model; gllamm program; multilevel model; probability weighting; 'Program for international student assessment'; pseudolikelihood; sandwich estimator; stratification;
D O I
10.1111/j.1467-985X.2006.00426.x
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Multilevel modelling is sometimes used for data from complex surveys involving multistage sampling, unequal sampling probabilities and stratification. We consider generalized linear mixed models and particularly the case of dichotomous responses. A pseudolikelihood approach for accommodating inverse probability weights in multilevel models with an arbitrary number of levels is implemented by using adaptive quadrature. A sandwich estimator is used to obtain standard errors that account for stratification and clustering. When level 1 weights are used that vary between elementary units in clusters, the scaling of the weights becomes important. We point out that not only variance components but also regression coefficients can be severely biased when the response is dichotomous. The pseudolikelihood methodology is applied to complex survey data on reading proficiency from the American sample of the 'Program for international student assessment' 2000 study, using the Stata program gllamm which can estimate a wide range of multilevel and latent variable models. Performance of pseudo-maximum-likelihood with different methods for handling level 1 weights is investigated in a Monte Carlo experiment. Pseudo-maximum-likelihood estimators of (conditional) regression coefficients perform well for large cluster sizes but are biased for small cluster sizes. In contrast, estimators of marginal effects perform well in both situations. We conclude that caution must be exercised in pseudo-maximum-likelihood estimation for small cluster sizes when level 1 weights are used.
引用
收藏
页码:805 / 827
页数:23
相关论文
共 50 条