Hypotheses Testing from Complex Survey Data Using Bootstrap Weights: A Unified Approach

被引:0
|
作者
Kim, Jae Kwang [1 ]
Rao, J. N. K. [2 ]
Wang, Zhonglei [3 ,4 ]
机构
[1] Iowa State Univ, Dept Stat, Ames, IA USA
[2] Carleton Univ, Sch Math & Stat, Ottawa, ON, Canada
[3] Xiamen Univ, Wang Yanan Inst Studies Econ WISE, Sch Econ, Xiamen, Fujian, Peoples R China
[4] Xiamen Univ, Wang Yanan Inst Studies Econ WISE, Sch Econ, Xiamen 361005, Fujian, Peoples R China
基金
国家重点研发计划; 加拿大自然科学与工程研究理事会;
关键词
Quasi-likelihood-ratio test; Quasi-score test; Wald test; Wilks' theorem; CHI-SQUARED TESTS; GOODNESS-OF-FIT; VARYING PROBABILITIES; ASYMPTOTIC THEORY; SAMPLING DESIGNS; GENERALIZED BOOTSTRAP; VARIANCE; REPLACEMENT; ESTIMATOR; INFERENCE;
D O I
10.1080/01621459.2023.2183130
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Standard statistical methods without taking proper account of the complexity of a survey design can lead to erroneous inferences when applied to survey data due to unequal selection probabilities, clustering, and other design features. In particular, the Type I error rates of hypotheses tests using standard methods can be much larger than the nominal significance level. Methods incorporating design features in testing hypotheses have been proposed, including Wald tests and quasi-score tests that involve estimated covariance matrices of parameter estimates. In this article, we present a unified approach to hypothesis testing without requiring estimated covariance matrices or design effects, by constructing bootstrap approximations to quasi-likelihood ratio statistics and quasi-score statistics and establishing its asymptotic validity. The proposed method can be easily implemented without a specific software designed for complex survey sampling. We also consider hypothesis testing for categorical data and present a bootstrap procedure for testing simple goodness of fit and independence in a two-way table. In simulation studies, the Type I error rates of the proposed approach are much closer to their nominal significance level compared with the naive likelihood ratio test and quasi-score test. An application to an educational survey under a logistic regression model is also presented. for this article are available online.
引用
收藏
页码:1229 / 1239
页数:11
相关论文
共 50 条