Multilevel modelling of complex survey data

被引：432

作者：

Rabe-Hesketh, Sophia

Skrondal, Anders

机构：

[1] Univ Calif Berkeley, Berkeley, CA 94720 USA

[2] Univ London, Inst Educ, London WC1N 1AZ, England

[3] London Sch Econ & Polit Sci, London, England

[4] Norwegian Inst Publ Hlth, Oslo, Norway

来源：

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY | 2006年 / 169卷

关键词：

adaptive quadrature; generalized linear latent and mixed model; generalized linear mixed model; gllamm program; multilevel model; probability weighting; 'Program for international student assessment'; pseudolikelihood; sandwich estimator; stratification;

D O I：

10.1111/j.1467-985X.2006.00426.x

中图分类号：

O1 [数学]; C [社会科学总论];

学科分类号：

03 ; 0303 ; 0701 ; 070101 ;

摘要：

Multilevel modelling is sometimes used for data from complex surveys involving multistage sampling, unequal sampling probabilities and stratification. We consider generalized linear mixed models and particularly the case of dichotomous responses. A pseudolikelihood approach for accommodating inverse probability weights in multilevel models with an arbitrary number of levels is implemented by using adaptive quadrature. A sandwich estimator is used to obtain standard errors that account for stratification and clustering. When level 1 weights are used that vary between elementary units in clusters, the scaling of the weights becomes important. We point out that not only variance components but also regression coefficients can be severely biased when the response is dichotomous. The pseudolikelihood methodology is applied to complex survey data on reading proficiency from the American sample of the 'Program for international student assessment' 2000 study, using the Stata program gllamm which can estimate a wide range of multilevel and latent variable models. Performance of pseudo-maximum-likelihood with different methods for handling level 1 weights is investigated in a Monte Carlo experiment. Pseudo-maximum-likelihood estimators of (conditional) regression coefficients perform well for large cluster sizes but are biased for small cluster sizes. In contrast, estimators of marginal effects perform well in both situations. We conclude that caution must be exercised in pseudo-maximum-likelihood estimation for small cluster sizes when level 1 weights are used.

引用

页码：805 / 827

页数：23

共 50 条

[1] MULTILEVEL MODELLING OF SURVEY DATA
Rozi, S.
Mahmud, S.
Lancaster, G.
[J]. JOURNAL OF EPIDEMIOLOGY AND COMMUNITY HEALTH, 2011, 65 : A207 - A207
[2] Modelling overdispersion for complex survey data
Molina, EA
Smith, TMF
Sugden, RA
[J]. INTERNATIONAL STATISTICAL REVIEW, 2001, 69 (03) : 373 - 384
[3] A pseudo maximum likelihood approach to multilevel modelling of survey data
Kovacevic, MS
Rai, SN
[J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2003, 32 (01) : 103 - 121
[4] Estimating a Multilevel Model with Complex Survey Data: Demonstration using TIMSS
Lorah, Julie
[J]. JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2019, 18 (02)
[5] Fitting multilevel models in complex survey data with design weights: Recommendations
Carle, Adam C.
[J]. BMC MEDICAL RESEARCH METHODOLOGY, 2009, 9
[6] Fitting multilevel models in complex survey data with design weights: Recommendations
Adam C Carle
[J]. BMC Medical Research Methodology, 9
[7] Multilevel modelling of medical data
Goldstein, H
Browne, W
Rasbash, J
[J]. STATISTICS IN MEDICINE, 2002, 21 (21) : 3291 - 3315
[8] Modelling multilevel data under complex sampling designs: An empirical likelihood approach
Oguz-Alper, Melike
Berger, Yves G.
[J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 145
[9] MULTILEVEL MODELING OF SURVEY DATA
GOLDSTEIN, H
[J]. STATISTICIAN, 1991, 40 (02): : 235 - 244
[10] Multilevel analysis of survey data
van Oyen, Herman
[J]. INTERNATIONAL JOURNAL OF PUBLIC HEALTH, 2009, 54 (03) : 129 - 130

← 1 2 3 4 5 →